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| PREFACE 
TO THE TENTH EDITION 


Thé book has been revised keeping in mind the comments and 
suggestions received from the readers. An attempt is made to eliminate 
the misprints/errors in the last edition. Further suggestions and criticism 
for the improvement of the book will be most welcome and thank‘ully 
acknowledged. oe 


August 2000 S.C. GUPTA 
V.K. KAPOOR 


TO THE NINTH EDITION 

The book originally written twenty-four years ago has, during the 
intervening period, been revised and reprinted several times. The 
authors have, however, been thinking, for the last few years that the 
book needed not only a thorough revision but rather a complete 
rewriting. They now take great pleasure in presenting to the readers the 
ninth completely revised and enlarged edition of the book. The subject- 
matter in the whole book has been rewritten in the light of numerous 
criticisms and suggestions received from the users of the previous 
editions in-India and abroad. 

Some salient features of the new edition are: 

e The entire text, especially Chapter 5 (Random Variables), Chapter 
6 (Mathematical Expectation), Chapters 7 and 8 (Theoretical Discrete 
and Continuous Distributions), Chapter 10 (Correlation and 
Regression), Chapter 15 (Theory of Estimation), has been restructured, 
rewritten and updated to cater to the revised syllabi of Indian 
universities, Indian Civil Services and various other competitive 
examinations. 

e During the course of rewriting, it has been specially borne in 
mind to retain all the basic features of the previous editions especially 
the simplicity of presentation, lucidity of style and analytical approach 
which have been appreciated by teachers and students all over India 
and abroad. 

e A number of typical problems have been added as solved 
examples in each chapter. These will enable the reader to have a better 
and thoughtful understanding of the basic concepts of the theory and 
its various applications. 

¢ Several new topics have been added at appropriate places to 
make the treatment more comprehensive and complete. Some of the 
obvious ADDITIONS are: 

§ 8-1.5 Triangular Distribution p. 8-10 to 8-12 

§ 8-8.3 Logistic Distribution p. 8-92 to 8-95 

§ 8-10 Remarks 2, Convergence in Distribution of Law p. 8-106 

§ 8-10.3. Remark 3, Relation between Central Limit Theorem and 

Weak Law of Large Numbers p. 8-110 
§ 8-10.4 Cramer’s Theorem p . 8-111-8.112, 8-114-8-115 — 
Example 8.46 


§ 8-14 to | Order Statistics — Theory, Illustrations and 
§ 8-14-6 J Exercise Set p. 8-136 to 8-151 
§ 8-15 ‘Truncated Distributions—with Illustrations 
p. 8-151 to 8-156 \ 
§ 10-6-1 Derivation of Rank Correlation Formula for Tied Ranks 
p. 10-40-10-41 
§ 10-7-1 Lines of Regression—Derivation (Aliter) 
p. 10-50-10-51. Example 10-21 p. 10-55 
§ 10-10-2 Remark to § 10-10-2 — Marginal Distributions of 
Bivariate Normal Distribution p. 10-88-10-90 
Theorem 10-5, p. 10-86. and Theorem 10-6, p. 10-87 on 
Bivariate Normal Distribution. 
Solved Examples 10-31, 10-32, pages 10-96-10-97 on 
BVN Distribution. 
Theorem 13-5 Alternative Proof of Distribution of (X, s) 
using m.g.f. p.13-19 to 13-21 
13-11 y?-Test for pooling of Probabilities (P, Test) p. 13-69 
15-4-1 Invariance property of Consistent Estimators—Theorem 
15-1, pp 15-3 
15-4-2 Sufficient Conditions for Consistency—Theorem 15-2, 
p. 15-3 
15-5-5 MVUE : Theorem 15-4, p. 15-12-15-13 
15-7 Remark 1. Minimum Variance Bound (MVB), Estimator, 
p.15-24 
15-7-1 Conditions for the equality sign in Cramer-Rao (CR) 
inequality, p. 15-25 to 15-27 
15-8 Complete family of Distributions (with illustrations), 
p. 15-31 to 15:34 
Theorem 15-10 (Blackwellisation), p. 15-36. 
Theorems 15-16 and 15-17 on MLE, p. 15-55. 
§ 16-5-1 Unbiased Test and Unbiased Critical Region. 
Theorem 16-2-pages 16-9-16-10 
§ 16-5-2 Optimum Regions and Sufficient Statistics, 
p.16-10-16-11 
Remark to Example 16-6, p. 16-17-16-18 and Remarks 
1, 2 to Example 16-7, p. 16-20 to 16-22; Graphical 
Representation of Critical Regions. 

e Exercise sets at the end of each chapter are substantially 
reorganised. Many new problems are included in the exercise sets. 
Repetition of questions of the same type (more than what is necessary) 
has been avoided. Further in the set of exercises, the problems have 
been carefully arranged and properly graded. More difficult problems 
are put in the miscellaneous exercise at the end of each chapter. 

¢ Solved examples and unsolved problems in the exercise sets 


have been drawn from the latest examination papers of various Indian 
Universities, Indiarr Civil Services, etc. 
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e An attempt has been made to rectify the errors in the previous 
editions. 

e The present edition Incorporates modern viewpoints. In fact 
with the addition of new topics, rewriting and revision of many 
others and restructuring of exercise sets, altogether a new book, 
covering the revised syllabi of almost all the Indian universities, 
is being presented to the réader. It is earnestly hoped that, in the 


new form, the book will prove of much greater utility to the students 
as well as teachers of the subject. 


We express our deep sense of gratitude to our Publishers M/s 
Sultan Chand & Sons and printers DRO Phototypesetter for their untiring 
efforts, unfailing courtesy, and co-operation in bringing out the book, 
in such an elegant form. We are also thankful to our several colleagues, 
friends and students for their suggestions and encouragement during 
the preparing of this revised edition. 

Suggestions and criticism for further improvement of the book as 
well as intimation of errors and misprints will be most gratefully received 
and duly acknowledged. 


SC. GUPTA & V.K. KAPOOR 


TO THE FIRST EDITION 


Although there are a iarge number of books available covering 
various aspects in the field of Mathematical Statistics, there is no 
comprehensive book dealing with the various topics on Mathematical 
Statistics for the students. The present book is a modest though 
determined bid to meet the requirements of the students of Mathematical 
Statistics at Degrée, Honours: and Post-graduate levels. The book will 
also be found’ of use by the students preparing for various competitive 
examinations. While writing this book our goal has been to present a 
clear, interesting, systematic and thoroughly teachable treatment of 
Mathematical Statistics ahd to provide a textbook which should not 
only serve as an introduction to the study of Mathematical Statistics 
but also carry the student on to such a level that he can read with 
profit the numercus special monographs which are available on the 
subject. In any branch of Mathematics, it is certainly the teacher who 
holds the key to successful learning, Our aim in writing this book has 
been simply to assist the teacher in conveying to the students more 
effectively a thorough understanding of Mathematical Statistics. 

The book contains sixteen chapters (equally divided between two 
volumes). The first chapter is devoted to a concise and logical 
development of the subject. The second and third chapters deal with 
the frequency distributions, and measures of average and dispersion. 
Mathematical treatment has been given to .the proofs of various articles 
included in these chapters in a very logical and simple manner. The 
theory of probability which has been developed by the application of 
the set theory has been discussed quite in detail. A large number of 
theorems have been deduced using the simple tools of set theory. The 
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simple applications of probability are also given. The chapters on 
mathematical expectation and theoretical distributions (discrete as well 
as continuous) have been written keeping the latest ideas in mind. A 
new treatment has been given to the chapters on correlation, regression 
and bivariate normal distribution using the concepts of mathematical 
expectation. The thirteenth and fourteenth chapters deal mainly with 
the various sampling distributions and the various tests of significance 
which can be derived from them. In chapter 15, we have discussed 
concisely statistical inference (estimation and testing of hypothesis). 
Abundant material is given in the chapter on finite differences and 
numerical integration. The whole of the relevant theory is arranged in 
the form of serialised articles which are concise and to the. point 
without being insufficient. The more difficult sections will, in general, 
be found towards the end of each chapter. We have tried our best to 
present the subject so as to be within the éasy grasp of students with 
varying degrees of intellectual attainment. 


Due care has been taken of the examination reeds of the students 
and, wherever possible, indication of the year, when the articles and 
problems were set in the examination as been given. While writing this 
text, we have gone through the syllabi and examination papers of 
almest all Indian universities where the subject is taught so as to 
make it as comprehensive as possible. Each chapter contains a large 
number of carefully graded worked problems mostly drawn from 
university papers with a view to acquainting the student with the typical 
questions pertaining to each topic. Furthermore, to assist the student 
to gain proficiency in the subject, a large number of properly graded 
problems mainly drawn from examination papers of various. universities 
are given at the end of each chapter. The questions and problems 
given at the end of each chapter usually require for their solution a 
thoughtful use of concepts. During the preparation of the text we have 
gone through a vast body of literature available on the subject, a list 
of which is given at the end of the book. It is expected that the 
bibliography given at the end of the book will considerably help those 
who want to make a detailed study of the subject 


The lucidity of style and simplicity of exptession have been our 
twin objects to remove the awe which is usually associated with most 
mathematical and statistical textbooks. 


While every effort has been made to avoid printing and other 
mistakes, we crave for the indulgence of the readers fof the errors that 
might have inadvertently crept in. We shall consider our efforts amply 
rewarded if those for whom the book is intended are benefited by. it. 
Suggestions for the improvement of the book will be highly appreciated 
and will be duly incorporated. 


SEPTEMBER 10, 1970 S.C. GUPTA & V.K. KAPOOR 
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CHAPTER ONE 


Introduction — Meaning and Scope 


1-1. Origin and Development of Statistics. Statistics, in a sense, 1S as old 
as the human socicty itself. .Jts origin can be uraced to the old days when it was 
regarded as the ‘science of State-craft’ and.was the by-product of the administrative 
activity of the State. The word ‘Statistics’ seems to have been-derived from the 
Latin word ‘status’ or the ltalian word ‘statista’ or the German word ‘statistik’ each 
of which means a ‘political state’. In ancient times, the government used to collect 
-the information regarding the population and ‘property or wealth’ of the country — 
the farmer enabling the government to have an idea of the manpower of the country 
(to safeguard itself against external aggression, if any), and the latter providing it 
a basis for introducing news taxes and levies. 

In India, an efficient system of collecting official and administrative statistics 
existed even more than 2,000 years ago, in particular, during the reign of Chandra 
Gupta Maurya ( 324 -300 B.C.). From Kautilya’s Arthshastra it is known that 
even before 300 B.C. a very good system of collecting ‘Vital Statistics’ and 
registration of births and deaths was in vogue. During Akbar’s reign (1556 - 1605 
A.D.), Raja Todarmal, the then. land and revenue minister, maintained good 
records of land and agricultural statistics. In Aina-e-Akbari written by Abul Fazl 
(in 1596 - 97 ), one of the nine gems of Akbar, we find detailed accounts of the 
administrative and statistical surveys conducted during Akbar’s reign. 

In Germany, the systematic collection of official statisucs originated towards 
the end of the 18th:century when, in order to have an idea of the relative strength 
of different German States, information regarding population and output — in- 
dustrial and agricultural — was collected. In England, statistics were the outcome 
of Napoleonic Wars. The Wars necessitated the systematic collection of numerical 
data to enable the government to assess the revenues and expenditure with greater 
precision and then to levy new taxes in order to mect the cost of war. 

Seventeenth century saw the.origin of the ‘Vital Statistics.’ Captain John 
Grant of London (1620 - 1674 ) , known as the ‘father’ of Vital Statistics, was the 
first man to study the statistics of births and deaths. Computation of moruality tables 
and the calculation of expectation of life at different ages by a number of persons, 
viz., Casper Newman, Sir William Petty (1623 « 1687 ), James Dodson, Dr. Price, 
to mention only a few, led to the idea of ‘life insurance’ and the first life insurance 
insUtution was founded in London in 1698. 

The theorctical deveiopment of the so-called modem statistics came during 
the mid-seventeenth century with the introduction of ‘Theory of Probability’ and 
‘Theory of Games and Chance’, the chief contributors being mathematicians and 
gamblers of France, Germany and England. The French mathematician Pascal 
(1623 - 1662 ), after lengthy correspondence with another French mathematician 
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P. Fermat (1601 - 1665 ) solved the famous ‘Problem of Points’ posed by the 
gambler Chevalier de - Mere. His study of the problem laid the foundation of the 
theory of probability which is the backbone of the modern theory of statistics. 
Pascal also investigated the properties of the co-efficients of binomial expansions 
and also invented mechanical computation machine. Other notable contributors in 
this field are : James Bernouli ( 1654 - 1705 ), who wrote the first treatise on the 
“Theory of Probability’; De-Moivre (1667: - 1754 ) who also worked on prob- 
abilities and annuities arid published his important work "The Doctrine of Chances” 
in 1718, Laplace (1749 - 1827 ) who published in 1782 his monumental work on 
the theory of ‘probability, and Gauss (1777 - 1855 ), perhaps the most original of 
all writers on statistical subjects, who gave the principle-of least squares and the 
normal law of errors. Later on, most of the prominent mathematicians of 18th, 19th 
and 20th centuries, viz., Euler, Lagrange, Bayes, A. Markoff, Khintchin, Kol- 
mogoroff, to mention only a few, added to the contributions in. the field of 
probability. 

Modern veterans in the development of the subject are Englishmen. Francis 
Galton (1822- 1921), with his works on ‘regressian’ , pioneered the use of statistical 
methods in the field of Biometry. Karl Pearson (1857-1936), the founder of the 
greatest statistical laboratory in England (1911), is the pioneer in, correlational 
analysis. His discovery of the ‘chi square test’, the first and the most important of 
modern tests of significance, won for Statistics a place as a science, In 1908 the 
discovery of Student's ‘/’ distribution by W.S. Gosset who wrote under the, 
pseudonym of ‘Student’ ushered in an era of exact.sample tests (small samples). 

Sir Ronald A. Fisher (1890 - 1962), known as the ‘Father of Statistics’ , placed 
Statistics on a very sound footing by applying it to various diversified fields, such 
as genetics; biometry, education, agriculture, etc. Apart from enlarging the existing 
theory, he is the pioneer in introducing the concepts.of “Poirtt Estumation’ (efficien- 
cy, sufficiency, principle of maximum likelihood,.etc.), ‘Fiducial Inference’ and 
‘Exact Sampling) Distributions.’ He also pioneered the study of ‘Analysis: of 
Variance’ and ‘Destgn of Experiments.’ His contributions won for Statistics a very 
responsible position among sciences. 

1-2. Definition of Statistics. Statistics has been defined differently by 
different authors from time to time. The reasons for a variety of definitions are 
primarily two. First, in modern times the field of utility of Statistics has widened 
considerably. In ancient times Statistics was confined only to the affairs of State 
but now it embraces almost every sphere of human activity. Hence a number of 
old definitions which were confined to a very narrow field of enquiry were replaced 
by new definitions which are much more comprehensive and exhaustive. Secondly, 
Statistics has been defined in two ways. Some writers define it as ‘statistical data’, 
i.e., numerical statement of facts, while others define it as ‘statistical methods’ ,i.e., 
complete body of the principles and techniques used in,collecting and ae 
such data. Some of the important definitions are given below. 
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Statistics as ‘Statistical Data’ 

Webster defines Statistics as "classified facts representing the conditions of 
the people in a State ... especially those facts which can be stated in numbers or in 
any other tabular or classified arrangement.” This definition, since it confines 
Statistics only to the data pertaining to State, is inadequate as the domain of 
‘Statistics is much wider. 

Bowley defines Statistics as ” numerical statements of facts in any department 
of enquiry placed in-relation to each other." ; 

A more exhaustive definition is given by Prof. Horace Secrist as follows : 

" By Statistics we mean aggregates of facts affected to a marked extent by 
multiplicity of causes numerically expressed, enumerated or estimated according 
to reasonable standards of accuracy, collected in a systematic manner for a 
pre-determined purpose and placed in relation te each other." 

Statistics as Statistical.Methods 
Bowley himself defines Statistics in. the following three different ways : 

(i) Statistics may be called the scivhce of counung. 

(ii) Statistics may rightly be called the science of averages. 

(iti) Statistics is the scierice of the measurement of social organism, regarded 
as a whole in all its manifestations. ; 

But none of the above definitions is adequate. The first because-Statisticssis 
not merely confined to the collection of data as other aspects like presentation, 
analysis and interpretation, etc., are also covered by it. The second, because 
averages are only a part of the statistical tools used in the analysis of the data, others’ 
being dispersion, skewness, kurtosis, correlation, regression, etc. The third, be-- 
Cause it restricts the application of Statistics to sociology alone while in modern 
days Statistics is used in almost all sciences — social as well as physical. 

According to Boddington, " Statistics is the. science of estimates and prob- 
abilities." This also is an inadequate definition since probabilities and estimates: 
constitute only a part of the statistical methods. 

Some other definitions are : 

" The science of Statistics is the method of judging collective, natural or social 
phenomenon from the resuits obtained from the analysis or enumeration or 
collection of estimates." — King. 

" Statistics is the science which deals with collection, classification and 
tabulation of numerical facts as the basis for explanation, description and com-' 
parison of phenomenon." = Lovitt. 

Perhaps the best definition-seems to be one given by- Croxton and Cowden, 
according to whom Statistics may be defined as “ the science which deals with.the 
collection, analysis and interpretation of numerical data.” 
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1:3. Importance and Scope of Statistics. In modcrn umes, Statistics is 
viewed not as a mere device for collecting numerical data but as a méans of 
developing sound techniques for their handling and analysis and drawing valid 
inferences from them. As such it is not confined to the affairs of the State but is 
intruding constantly into various diversified spheres of life — social, economic and 
political. [tis now finding wide applications in almost all sciences — social as well 
as physical - such as biology, psychology, education, economics, business manage- 
ment, etc. It is hardly possible to enumerate even a single department of human 
activity where statistics does not creep in. It has rather become indispensable in 
all phases of human endeavour. 

Statistics aud Planning. Statistics is indispensable to planning. In the 
modem age which is termed as ‘the age of planning’, almost all over the world, 
goernments, particularly of the budding economies, are resorting to planning for 
the economic development. In order that planning is successful, it must be based 
soundly on the correct analysis of complex statistical data. 

Statistics and Economics. Statistical data and technique of statistical analysis 
have proved immensely useful in solving a variety of economic problems, such as 
wages, prices, analysis of time series and demand analysis. It has also facilitated 
the development of economic theory. Wide applications of mathematics and 
statistics in the study of economics have led to the development of new disciplines 
called Economie Statistics and Econometrics. 

Statistics and Business. Statistics is an indispensable tool of production 
control also. Business executives are relying more and more on statistical techni- 
ques for studying the needs and the desires of the consumers and for many other 
purposes. The success of a businessman more or less depends upon the accuracy 
and precision of his statistical forecasting. Wrong expectations, which may be the 
result of faulty and inaccurate analysis of, various causes affecting a particular 
phenomenon, might lead to his disaster. Suppose a businessman wants to manufac- 
ture readymade garments. Before starting with the production process he must 
have an overall idea as to ‘how many.garments are to be manufactured’, ‘how much 
raw material and labour is needed for that’ , ‘and ‘what is the quality, shape, colour, 
size, etc., of the garments to be manufactured’. Thus the formulation of a produc- 
tion plan in advance is a must which cannot be done without having quantitative 
facts about the details mentioned above. As such most of the large industrial and 
commercial enterprises are employing trained and efficient statisticians. 

Statistics and Industry. In industry, Statistics is very widely used in ‘Quality 
Control’. in production engineering, to find whether the product is conforming to 
specifications or not, staustical tools, viz., inspection plans, control charts, etc., are 
of extreme importance. In inspection plans we have to resort to some kind of 
sampling — a very important aspect of Statistics. 
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Statistics and Mathematics. Staustics and mathematics are very intimately 
related. Recent advancements in statistical techniques are the outcome of wide. 
applications of advanced mathemaucs. Main contributors to statistics, namely,. 
Bernouli, Pascal, Laplace, De-Moivre, Gauss, R. A. Fisher, to mention only a few, 
were primarily talented and skilled mathematicians. Statistics may be regarded as 
that branch of mathematics which provided us with systematic methods of analys- 
ing a large number of related numerical facts. According to Connor, ” Statistics is 
4 branch of Applied Mathematics which specialises in data." Increasing role of 
mathematics in statistical analysis has resulted in a new branch of Statistics called 
Mathematical Statistics. ; 

Statistics and Biology, Astronomy and Medical Science. The association 
between statistical methods and biological theories was first studied by Francis 
Galton in his work in ‘Regression’. According to Prof. Karl Pearson, the whole 
‘theory of heredity’ rests on statistical basis. He says, ” The whole problem of 
evolution is a problem of vital statistics, a problem of longevity, of fertility, of 
health, of disease and it is impossible for the Registrar General to discuss the 
national mortality without an enumeration of the population, a classification of 
deaths and knowledge of Statistical theory." 

In astronomy, the theory of Gaussian ‘Normal Law of Errors’ for the study. 
of the movement of stars and planets is developed by using the ‘Principle of Least 
Squares’. 

In medical science also, the statistical tools for the collection, presentation 
and analysis of observed facts relating to the causes and incidence of diseases and 
the results obtained from the use of various drugs and medicines, are of great 
importance. Moreover, the efficacy of a manufacutured drug or injection or 
medicine is tested by using the ‘tests of significance’ — (1-test). 

Statistics and Psychology and Education. In education and psychology, too, 
Statistics has found wide applications, e.g., to determine the reliability and validity 
of atest, ‘Factor Analysis’, etc.,so much so that a new subject called ‘Psychometry’ 
has come into existence. 

Statistics and War. In war, the theory of ‘Decision Functions’ can be of great 
assistance to military and technical personnel to plan ‘maximum destruction with 
minimum effort’. 

Thus, we see that the science of Statistics is: associated with almost all thé 
sciences — social as well as physical. Bowley has rightly said, " A knowledge of 
Statistics is like a knowledge of foreign language or of algebra, it may prove of.use 
at any time under any circumstance.” 

1-4, Limitations of Statistics. Statistics, with its wide applications in almost 
every sphere of human activity; is not without limitations. The following are some 
of its important limitations : 
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(i) Statistics is not suited-to the study of qualitative phenomenon. Statistics, 
being a science dealing with a set of numerical data, is applicable to the study of 
only those subjects of enquiry which are capable of quantitative measurement. As 
such; qualitative phenomena like honesty, poverty, culture, etc., which cannot be 
expressed numerically, are not capable of direct statistical analysis. However, 
statistical techniques may be applied indirectly by first reducing the qualitative 
expressions to precise quantitative terms. For example, the intelligence.of-a group 
of candidates can be studied on the basis of their scores in a certain test. 

(ii) Statistics does not study individuals. Statistics deals with an aggregate of 
objects and does not give any specific recognition to the individual items of a series. 
Individual items, taken separately, do'not constitute statistical data and are mean- 
ingless for any statistical enquiry. For example, the individual figures of agricul- 
tural production, industrial output or national income of any country fora particular 
year are meaningless unless, to facilitate comparison, similar figures of other 
countries or of the same country for different years are given. Hence, statistical 
analysis is Stxod to only those problems where group characteristics are to be 

_ Studied. 

(iii) Statistical laws are not exact. Unlike the laws of physical and natural 
sciences, Statistical laws are only approximations and not exact. On the basis of 
Statistical analysis we can talk only in terms of probability and chance and not in 
terms of certainty. Statistical conclusicns are not universally true — they aré true 
only onan average. For example, let us consider the statement :" It has been found 
that 20 % of-a certain surgical operations by a particular doctor are successful." 

+ The statement does not imply that if the doctor is to operate on 5 persons on any 
day and four of the operations have proved fatal, the fifth must be a success. It may 
happen that fifth man also dies of the operation or it may also happen that of the 
five operations on any day, 2 or 3 oreven moré may be successful. By the statement 
we mean that as number of operations becomes larger and larger we should expect, 
on the average, 20 % operations to be successful. 

(iv) Statistics is liable to be misused. Perhaps the most important limitation 
of Statistics is that it must be used by experts. As the saying goes, “ Statistical 
methods are the most dangerous tools in the hands of the inexperts. Statistics is 
one of those sciences whose adepts must exercise the self-restraint of an artist.” 
The use of statistical tools by inexperienced and untrained persons might lead to 
very fallacious conclusions. One of the greatest shortcomings of Statistics is that 
they do not bear on their face the ‘label of their quality and as such can be moulded 
and manipulated in any manner to support one’s way of argument and reasoning. 
As King says, ” Statistics are like clay of which one can make a god or devil as one 
pleases.” The requirement of experience and skill for judicious use of statistical 
methods restricts their use to experts only and limits the chances of the mass 
popularity of this useful and important science. 
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1-5. Distrust of Statistics. We often hear the following interesting comments 
on Statistics : 

(i) ‘An ounce of truth will produce tons of Statistics’, 

(ii) ‘Statistics can prove anything’, 

(iii) ‘Figures do not lie. Liars figure’, 

(iv) ‘If figures say so it can’t be otherwise’, 

(v) ‘There are three type of liés — lies, demand lies, and Statistics — wicked in 
the order of their naming, and so on. 

Some of the reasons for the existence of such divergent views regarding the 
nature and function of Statistics are as follows : 

(i) Figures are innocent, easily believable and more convincing. The facis 
supported by figures are psychologically more appealing. 

(ii) Figures put forward for arguments may be inaccurate or incomplete and 
thus might, lead to wrong inferences. 

(iii) Figures, though accurate, might be moulded and manipulated by selfish 
persons to conceal the truth and present a distorted picture of facts to the public to 
meet their selfish motives. When the skilled talkers, writers or politicians through 
their forceful writings and speeches or the business and commercial enterprises 
through advertisements in the press mislead the public or belie their expectations 
by quoting wrong statistical statements or manipulating statistsical data for per- 
sonal motives, the public loses its faith and belief in the science of Statistics and 
starts condemning it. We cannot blame the layman for his distrust of Statistics, as 
he, unlike statistician, is not in a position to distinguish between valid and invalid 
conclusions from statistical statements and analysis. 

It may be pointed out that Statistics neither proves anything nor disproves 
anything. It is only a tool which if rightly used may prove extremely useful and if 
misused, might be disastrous. According to Bowley, “Statistics only furnishes a 
tool, necessary though imperfect, which is dangerous in the hands of those who do 
not know its use and its deficiencies.” It is not the subject of Statistics that is to be 
blamed but those people who twist the numerical data and misuse them either due 
to ignorance or deliberately for personal selfish motives. As King points out, 
"Science of Statistics is the most useful servant but only of great value to those who 
understand its proper use.” 

We discuss below a few interesting examples of misrepresentation of statistical 
data. 

(i) A statistical report : "The number of accidents taking place in the middlc 
of the road is much less than the number of accidents taking place on its side. Hence 
it is safer to walk in the middle of the road.” This conclusion is obviously wrong 
since we are not given the proportion of the number of accidents to the number of 
persons walking in the two cases. 
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(ii) “The number ofstudents taking up Mathematics Honours in a University 

has increased 5 times during the last 3 years. Thus, Mathematics is gaining 
‘popularity among the students of the university." Again, the conclusion is faulty 

since we are not given any such details about the other subjects and hence 
comparative study is not possible. 

(iit) "99% of the people who drink alcohol die before attaining the age of 100 
years. Hence drinking is harmful for longevity of life." This statement, too, is 
incorrect since nothing is mentioned about the number of persons who do not drink 
alcohol and die before attaining the age of 100 years. 

Thus, statistical arguments based on incomplete data often lead to fallacious 


_ conclusions. . 


FREQUENCY DISTRIBUTIONS AND 
MEASURES OF CENTRAL TENDENCY 


2-1. Frequency Distributions. When observations, discrete or continuous, 
are available on a single characteristic of a large number of individuals, often it 
becomes necessary to condense the data as far as possible without losing any 
‘nformation of interest. Lei us consider the marks in Statistics obtained by 250 
candidates selected at random from among those appearing in a certain examina- 
tron. 

TABLE 1: MARKS IN STATISTICS OF 250 CANDIDATES 


32 47 41 51 41 30 39 18 48 53 
54 32 31 46 15 37 32 56 42 48 
38 26 50 40 38 42 35 22 62 51 
44 21 45 31 37 41 44 18 37 = 47 
68 41 30 52 52 60 42 38 38 34 
41 53 48 21 28 49 42 36 41 29 
30 33 37 35 29 37 38 40 32 49 
43 32 24 38 38 22 41 50 17 46 
46 50 26 15 23 42 25 52 38 46 
4) 38 40 37 40 48 45 30 28 31 
40 33 42 36 51 42 56 44 35 38 
31 51 45 41 50 53 50 32 45 48 
40 43 40 34 34 44 38 58 49 28 
40 45 19 24 34 47 37 33 37 36 


48 50 43 55 43 39 41 48 53. 34 
32 31 42 34 34 32 33 24 43 39 
40 30 27 47 34 44 34 33 47 42 
17 42 57 35 38 ? 33 46 36 2B 
48 30 31 38 33 44 26 29 31 37 
47 35 57 37 41 54 42 45 47 43 
37 52 47 46 44 50 44 38 42 19 
32 45 23 41 47 33 42 24 48 39 


This representation of the data does not furnish any useful information and is 
rather confusing to mind. A better way may be to express the figures in an 
ascending or descending order of magnitude, commonly termed as array. But this 
does not reduce the bulk of the data. A much better representation is givén on the 
next page. 

A bar (|) called tally mark is put against the number when it occurs. Having 
occurred four times, the fifth occurrence is represented by putting a cross tally (/) 


on the first four tallies. This technique facilitates the counting of the tally marks 
at the end. 
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The representation of the data as above is known as frequency distribution. 
Marks are called the variable (x ) and the ‘number of students’ against the marks 
is known as the frequency (f ) of the variable. The word ‘frequency’ is derived 
from ‘how frequently’ a variable occurs. For example, in the above case the 
frequency of 31 is 10 as there are ten students getting 31 marks. This repre- 
sentation, though better than an array’ , does not condense the data much and it is 
quite cumbersome to go through this huge mass of data. 

TABLE 2 


Marks No.of Students Total Marks No.of Students Total 


Tally Marks Frequency 


Tally Marks Frequency 


15 =2 40 Wi un tf =11 
17 ill =3 41 =10 
18 =2 42 wi wl it =13 
19 Il =2 43 wh Ill = 
21 tl =2 44 ul Ut Il =12 
22 ll =2 45 Wi Il =7 
23 =3 46 wil tl =7 
24 HHI =4 47 UA ilk =8 
25 | =1 48 wi Wl it = 12 
26 HT =3 49 | oe =3 
27 =1 50 UA. - Uf = 10 
28 iN =3 51 fil =4 
29 | =2 52 un =5 
30 wi =5 53 UU) =4 
31 ui wh =10 54 tH =3 
32 ~ ul uf =10 55 il =2 
33 WA ill =8 56 ll =2 
34 wi wi! =11 57 il =2 
35 un =5 58 il =2 
36 ul =5 60 UII =3 
37 Ul wl Il =12 61 | =] 
3 wi Ww Ww It =17 ‘62 | =1 
39 ul | =6 68 | =1 


If the identity of the individuals about whom a particular information is taken 


is not relevant, nor the order in which the observations arise, then the first real step 
of condensation is to divide the observed range of variable into a suitable number 
of class-intervals and to record the number of observations in each class. For 


example, in the above case, the data may be expressed as shown in Table 3. 
Such a table showing the distribu- —TABLE3:FREQUENCY TABLE __ 


tion of the- frequencies, in the different ; Marks No. of students. 
classes is called a frequency table and (x) (f) 
the manner in which the class frequen- 15 — 19 9 
cies are distributed over the class inter- 20 — 24 1 
vals is called the grouped frequency a = an re 
distribution of the variable. 35 —39 45 
Remark. The classes of the type i aa oe 
15—19, 20—24, 25—29 etc., in which 50 — 54 26 
both the upper and lower limits are 55 —59 8 
included are called ‘inclusive classes’. eee ; 
For example the class 20—2A, includes Total 750 
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all the values from;20 to 24, both inclusive ‘and the classification is termed as 
inclusive type classification. 

In spite of great importance of classification in statistical analysis, no hard and 
fast rules can be laid down for it. The following points may be kept in mind for 
classification : 

(i) The classes should be clearly defined and should not lead to any ambiguity. 

(ii) The classes should be exhaustive, i.e., each of the given values should be 
included in one of the classes. 

(iii) The classes should be mutually exclusive and non-overlapping. 

(iv) The classes should be of equal width. The principle, however, cannot be 
rigidly followed. If the classes are of varying width, the different class frequencies 
will not be comparable. Comparable figures can be obtained by dividing the value 
of the frequencies by the corresponding widths of the class intervals. The ratios 
thus obtained are called ‘frequency densities’ . 

(v) Indeterminate classes, e.g., the open-end classes, less than ‘a’ or greater 
than ‘b’ should be avoided as far as possible since they create difficulty in analysis 
and interpretation. 

(vi) The number of classes should neither be too large nor too small. It should 
preferably lie between 5 and 15. However, the number of classes may be more* 
than 15 dependin g upon the total frequency and the details required, but it is 
desirable that it is not less than 5 since in.that case the classification may not reveal 
the essential characteristics of the population. The following formula due to 
_ Struges may be used to determine an approximate number k of classes : 

k=1+ 3-322 logio N, 
where WN is the total frequency. 


The Magnitude of the Class Interval 


Having fixed the number of classes, divide the range (the difference between 
the greatest and the smallest observation) by it and the nearest integer to this value 
gives the magnitude of the class interval. Broad class intervals (i:e., less number 
of classes) will yield only rough estimates while for high degree of accuracy small 
class intervals ( i.e., large number of classes) are desirable. 

Class Limits 

The class limits should be chosen in such a way that the mid-value of the class 
interval and actual average of the observations in that class interval are as near'to 
each other as possible. If this is not the case then the classification gives a distorted 
Picture of the characteristics of the data. If possible, class limits stiould bé located 
at the points which are multiple of 0, 2, 5, 10,... etc., so that the midpoints of the 


Classes are the Common figures, viz., 0, 2, 5, 10..., etc., the figures capable of easy 
and simple analysis. 
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2-1-1. Continuous Frequency Distribution. If we deal with a continuous 
variable, it is not possible to arrange the data in the class intervals of above type. 
Let us consider the distribution of age in years. If Class intervals are 15—19, 
20—24 then the persons with ages between 19 and 20 years are not taken into 
consideration. In such a case we form the class intervals as shown below. 

Age in years 
Below 5 
5 or more but less than 10 
10 or more but less than 15 
15 or more but less than 20 
20 or more but less than 25 
and so on. 
Here all the persons with any fraction of age are included in one group or the 
other. For practical purpose we re-write the above classes as 


0—5 
5— 10 
10 — 15 
15 — 20 
20 — 25 


This form of frequency distribution is known as continuous s-equency distribu- 
tion. 

It should be clearly understood that in, the above classes, the upper limits of 
each class are excluded from the respective classes. Such classes in which the upper 
limits are excluded from the respective classes and are included in the immediate 
next class are known as ‘exclusive classes’ and the classification is termed as 
‘exclusive type classification’ . 

2-2. Graphic Representation of a Frequency Distribution. It is often useful 
to represent a frequency distribution by means of a diagram which makes the 
unwieldy data intelligible and conveys to the eye the general run of the observa- 
tions. Diagrammatic representation also facilitates the comparison of two or more 
frequency distributions. We consider below some important types of graphic 
representation. 

2-2-1. Histogram. In drawing the histogram of a given continuous frequency 
distribution we first mark off along the x-axis all the class intervals on a suitable 
scale. On each class interval erect rectangles with heights proportional to the 
frequency of the corresponding class interval so that the area of the rectangle is 
Proportional to the frequency of the class. If, however, the classes are of unequal 
width then the height of the rectangle wili be proportional to the ratio of Ahe 
frequencies to the width of the classes. The diagram of continuous rectangles so 
obtained ts called histrogram. 
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Remarks. 1. To draw the histogram for an ungrouped fréquency distribution 
ofa variable we shall have to assume that the frequency corresponding to the variate 
value x is spread.over the interval x—- h#/2 to x+h/2, where A is the jump from 
one value to the next. 

2. If the grouped’ frequency distribution is not continuous, first it is to be 
converted into continuous distribution arid then the hisfrogam is drawn. 

3. Although the height of each rectangle is proportional to the frequency of 
the corresponding class, the height of a fraction of the rectangle is not proportional 
to the frequency of the corresponding fraction of the class, so that histogram cannot 
be directly used to read frequency over a fraction of a class interval. 

4, The histogram of the distribution of marks of 250 students in Table 3 (page 
2:2) is obtained as follows. 

Since the grouped frequency distribution is not continuous, we first convert it 
into a continuous distribution as follows: pistoGRAM FOR FREQ. DISTRIBUTION 


Marks No. of Students 36 
14-5—-19-5 9 48 
195-245 ! I 
24-5-29. 7 
29-5-34-5 44 yee 
34-5-39-5 45 O 92 
39-5,-44-5 54 z 
49.5-54-5 26 Ss 
54-5-59-5 8 © 16 
59-5-64-5 5 u. 
64:5-69-5 1 8: 


MARKS 
Remark. The upper and ‘lower class limits of the new exclusive type classes 
are known as class boundaries. 
If dis the gap between the upper limit of any class and the lower limit of the 
Succeeding class, the class boundaries for any class are then given by : 


Upper class boundary = Upper class limit + 5 


Lower class boundary = Lower class limit — s 


2-2-2, Frequency Polygon. For an ungrouped distribution, the frequency 
Polygon is obtained by plotting points with abscissa as the variate. values and the 
Ordinate as the corresponding frequencies and joining the plotted points by means 
of Straight lines. For a grouped frequency distribution, the abscissa-of points are 
mid-values of the class intervals. For equal class intervals the frequency polygon 
can be obtained by joining the middle points of the upper sides of the adjacent 
rectangles of the histogram by mzans of straight lines. If the class intervals aré of 
Small width the polygon can be approximated by a smooth curve. The frequency 
Curve can be obtained by drawing a smooth freehand curve through the vertices of 
the frequency polygon. 
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2-3. Averages or Measures of Central Tendency or Measures of Location. 
According to Professor Bowley, averages are "statistical constants which enable 
us to comprehend in a single effort the significance of the whole.” They give us 
an idea about the concentration of the values in the central part of the distribution. 
Plainly speaking, an average of a statistical series is the value of the variable which 
is representative of the entire distribution. The following are the ave measures of 
central tendency that are in common use: 

(i) Arithmetic Mean or simply Mean , (ii) Median, 

(iti) Mode, (iv) Geometric Mean, and (v) Harmonic Mean. 

2-4. Requisites for'an Ideal Measure of:Central Tendency. According to 
Professor Yule, the following are the characteristics to be satisfied by an ideal 
measure of central tendency : 

(i) It should be rigidly defined. 

(ii) It should be readily comprehensible and easy to calculate. 

(iii) It shoulé be based on all the observations. 

(iv) It should be suitable for further mathematical treatment. By this we mean 
that if we are given the averages and sizes of a number of series, we should be.able 
to calculate the average of the composite series obtained on combining the given 
series. 

(v) It should be affected as little as possible by fluctuations of sampling. 

In addition to the above criteria, we may add the following (which is not due 
to Prof. Yule) : 

(vi) It should not be affected much by extreme values. 

2-5. Arithmetic Mean. Arithmetic mean of aset of observastions is their sum 
divided by the number of observations, e.g., the anthmetic mean x of n observa- 
tiONS X, , X2,..., XeiS given by 

ee | I: = 
X= al Mt X2t 1+ Mp )= —- Lx, 

n nM j=1 
In case of frequency distribution x; | f;, i= 1,2,...,, where f; is the frequency of 
the variable x; ; 


fix 
= 1Xit+ f2 Xat... t+ [aXe i=l Le 
x= = = —— Lfx, | = N (21 
fit fat een 7 Zf = 2d [ he ( 


In case of grouped or continuous eile distribution, x is taken as the mid-value 
of the corresponding class. 

Remark. The symbol & is the letter capital sigma of the Greek alphabet and 
is used in mathematics to denote the sum of values. 
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Example 2-1. (a) Find the arithmetic mean of the following frequency dis- 


tribution: 
es I 2 3 4 5 6 7 
5 9 12 17 14 10 6 
() Calculate the arithmetic mean of the marks from the following table : 
Marks : 0-10 10-20 20-30 30-40 40-50 50-60 
No. of students : 12 18 27 20 17 6 


Solution.. (a) 


$F OD 
x f fx 
ne cS 
l 5 5 
2 9 18 
3 12 36 
4 17 68 
5 14 70 =e 
6 10 60 
7 6 42 
73 299 
sat _ 299 _ 
r= a LZ fx= 73 = 4-09 
(b) 
Marks No. of students Mid - point fx 
(f) (x) 
0-10 12 5 
10-20 18. 15 270 
20-30 27 25 675 
30-40 20 35 7 
40-50 17 45 765 
50-60 6 55 330 
Total 100 2,800 
Arithmetic mean or x= — ~ E fre = x 2,800= 28 


It may be noted that if Pi values of x or (and) f are large, the calculation of 
mean by formula (2-1) is quite time-consuming and tedious. The arithmetic is 
reduced to a great extent by taking the deviations of the given values from any 
arbitrary point ‘A’, as explained below. 

Let d= x- A, then. fidi=fi(i—- A= fixie Afi ' 

Summing a sides sh from Ito n, we get . 


E fide E fixn- AE f= E fin- A.N. 


i= | i= 1 t= 1 ~gx l 
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1 2 I 


where x is the arithmetic mean of the distribution. 


nh 
= LX fixi- A= x-A, 


i= 


X= At x (= fidi (2-2) 


This formula is much more convenient to apply than formula (2:1). 

Any number can serve the purpose of arbitrary point ‘A’ but, usually, the 
value of x corresponding to the middle part of the distribution will be much more 
convenient. 

In case of grouped: or continuous frequency distribution, the arithmetic is 
reduced to a sull greater extent by taking 
x, —-A 

A’ 
where A is an arbitrary point and A is the common magnitude of class interval. In 
this case, we have 


d; = 


hd;= xi- A, 
and proceeding exactly similarly as above, we get 
R 
Z=A+" & fid, (2-3) 
N jn) 
Example 2-2. Calculate the mean for the following frequency distribution. 
Class-interval : 0-8 816 16-24 24-32 32-40 40-48 


Frequency : 8 7 16 24 15 7 
Solution. 
Class~interval mid-value Frequency d=(x-A)/h fd 
(x) (f) 

0-8 4 8 —3 7 
8-16 12 7 ~2 ~14 
16-24 20 16 —l, -16 
24-32 28 24 0 0 
32-40 36 I5 1 15 
40-48 44 To 2 14 
77 -25 


Here we take A = 28 andA=8. 
r= A Aefe 28 + Sx 8) | 28 - = 25-404 


2-5-1. Properties of Arithmetic Mean 

Property 1. Algebraic sum of the deviations of a set of values from 
their arithmetic mean is zero. If xi\fi, i= 1,2,..,n is the frequency distribu- 
tion, then 
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n 
x fi (4%i- X) = 0, x being the mean of distribution. 
tol 


Proof. = fi(xi-X)= = fpx—-x Tf= = fixi-x.N 
i t t 


= fi Xi 
Also ¥=- => f fix=eNx 
t 


N 


nv 
Hence 2 fi(xa- x)= N.x-x.N=0 
| 
Property 2. Fhe sum of the squares of the deviations of a set of values is 
minimum when taken about mean. 


Proof. For the frequency distribution x;|f;, i=-1,2,...,, let 


” 
Z= = f(i- A), 
t=] 
be the sum of the squares of the deviations of given values from any arbitrary point 
‘A’. We have to prove that Z is minimum when A= x. 
Applying the principle of maxima and minima from differential calculus, Z 
will be minimum for variations in A if 


2 
92 oO and Te > 0 


dA 
dZ ‘ 
Now aA Ot fila-A)= 0 => &F fi(xj-Aj)= 0 
i i 
=> Lfixi- ALfi= 0 or An *hh 5 


 ) 


Again -- 2E f(-= 25 f= WO 
t t 


Hence Z is minimum at the point A = x. This establishes the résult. 
Property 3. (Mean of the composite series). If x;, (i= 1,2,..,k) are 
the means of k-component series of sizes nj, (i = 1, 2, ..., k) respectively, then the 
mean X of the composite series obtained on combining the component series ts 
given by the formula: 
My Xt NgM2+ Lt Me 


x= = In x; / In; Be 2-4) 
Nyt Met oot Ny, . 4 : 


Proof. Let x1) ,12, ...Xint be m members of the first series ; x3), 4, 
Xa, be yw. members of the second series, Xi , Xt2, -.-, Xin, DC me members of the 
kth series. Then, by def., 
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ae | 
x1* m (x11 + X12+ ... + Xing ) 


- 1 
x2= moot X22 + ... + X2n,) 


..(*) 


ee ] 
Xh mm Ont Xk2+ ... + Xi, ) 


The mean x of composite series of size 7m, + m2 + ... + My iS given-by 

(X11 + X12 +... + Xt) + (Kor + X22 + 02. HXony) + oe. + (Nat + Xk2 + 00. + Xm) 
a Ny + Nyt... My 
_ mM x: + Ny X2+ cee tM Xp 


> 


ny +ne+ ... Mm [From (*)] 
Thus, X= = jx; | (Zn;) 
8 2 

Example 2:3. The average salary of male employees in a firm was Rs.520 
and that of females was Rs.420. The mean salary of all the employees was Rs.500. 
Find the percentage of male and femdle employees. 

Solution. Let m, and m2 denote respectively the number of male and female 
employees in the concern and x, and x. denote respectively their average salary 
(in rupees). Let x denote the avarage salary of all the workers in the firm. 

We are given that : 

x, =520, X.=420 and x=500 
Also we know 
ca ny x + N2 x2 


y+ N2 
=> 500 (m + n2) = 520 ny, + 420 n2 y 
=> (520 - 500) m, = (500 — 420) n, 
=> 20 n= 80 nr 
a m4 
Mm +1 

Hence the percentage of male employees in the firm 

- resi x 100 = 80 
and percentage of female employees in the firm 

7 a x 100 = 20 


2:5:2. Merits and Demerits of Arithmetic. Mean 
Merits. (i) It is rigidly defined . 
(it) It is easy to understand and easy to calculate. 
(11t) It is based upon all the observations. 
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(iv) It is amenable to algebraic treatment. The mean of the composite series 
in terms of the means and sizes of the component Series is given by 
k k 
c— ap» njx;/ (2 n; ) 
i=! i 

(v) Of all the averages, arithmetic mean is affected least by fluctuations of 
sampling. This property is sometimes described by saying that arithmetic mean is. 
a stable average. 

Thus, we see that arithmetic mean satisfies all the properties laid down by Prof. 
Yule for an ideal average. 

Demerits. (i) It cannot be determined by inspection nor it can be located 
graphically. 

(ii) Arithmetic mean cannot be used if we are dealing with qualitative charac- 
teristics which cannot be measured quantitively; such as, intelligence, honesty, 
beauty, etc. In such cases median (discussed later) is the only average to be used. 

(iii) Arithmetic mean cannot be obtained tf a single observation is missing or 
lost or is illegible unless we drop it out and compute the arithmetic mean of the 
remaining values. . 

(iv) Arithmetic mean is affected very much by extreme values. In case of 
extreme items, arithmetic mean gives a distorted picture of the distribution and no 
longer remains representative of the distribution. 

(v) Arithmetic mean may lead to wrong conclusions if the details of the data 
from which it is computed are not given. Let us consider the following marks 
obtained by two students A and B in three tests, viz., terminal test, half-yearly 
examination and annual examination respectively. 


Marksin:— ITest Il Test Ill Test Average marks 
A 50% 60% 70% 60% 
B 20% 60% 50% 60% 


Thus average marks obtained by each of the two students at the end of the year 
are 60%. If we are given the average marks alone we conclude that the level of 
intelligence of both the students at the end of the year is same. This is a fallacious 
conclusion since we find from the data that student A has improved consistently 
while student B has deteriorated consistently. 

(vi) Arithmetic mean cannot be calculated if the extreme class is open, e.g., 
below 10 or above 90, Morever, even if a single observation is missing mean cannot 
be calculated. 

(vii) In extremely asymmertrical (skewed) distribution, usually arithmetic 
mean is not a suitable measure of location. 

2-5-3. Weighted Mean. In calculating arithmetic: mean we suppose that all 
the items in the distribution have equal importance. But in practice this may not 
be so. If some items in a distribution are more important than others, then this 
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point must be borne in mind, in order that avcrage computed is representative of 
the distribution. In such cases, proper weightage is to be given to various items 
— the weights attached to cach item being proportional to the importance of the 
item in the distribution. For example, if we want to have an idea of the change 
in cost of living of a certain group of pcople, then the simple mean of the prices 
of the commodities consumed by them will not do, since all the commodities are 
not equally important, e.g., wheat, rice and pulscs are more important than 
cigarettes, tea, confectionery, etc. 


Let w; be the weight attached to the item x;, i= 1,2, ..., 2. Then we define : 


Weighted arithmetic mean or weighted mean = ¥ w;4;/ Dw; ... (2-5) 
i i 


It may be observed that the formula for weighted mean is the samc as tlic 
formula for stmple mean with fj, (i= 1, 2, ..., 2), the frequencies replaced by 
w,, (i= 1, 2, ..., a), the weights. 


Weighted mean gives the result cqual to the simple mean if the weights 
assigned to each of the variate valucs are cqual. It results in higher value than the 
simple mean if smaller weights are given to smaller items and larger weights to 
larger items. If the weights attached to larger items are smaller and those attached 
to smaller items are larger, then the weighted mean results in smaller value than 
the simple mean. 


Example 2:4. Find the simple and weighted arithmetic mean of the first n 
natural numbers, the weights being the corresponding ntimbers. 


Solution. The first natural numbers are 1, 2 


sD casuals 
We know that 
+ 
142434. ¢ns8@t 
(24592 32... a 


Simple A.M. is 


Weighted A.M. is 


dYwX 174+274+.., +1? 
""Yw  1L4+e2e.. tan 


_an(u+1) Qn +1) 2 
7 6 "n (nt 1) 


xX 
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9.6. Median. Median of a distribution is the value of the variable which 
divides it into two equal parts. It,is the value which exceeds and is exceeded by 
the same number of observations, ie., it is the value such that the number of 
observations above it is equal to the number of observations below it. The median 
is thus a positional average. 

In case of ungrouped data, if the number of observations is odd then median 
is the middle value after the valdes nave been arranged in ascending or descending 
order of magnitude. In case of even number of observations, there are two middle 
terms and median is obtained by taking the arithmetic mean of the middle terms. 
For example, the median of the values 25, 20, 15, 35, 18, i.e., 15, 18, 20, 25; 35 is 
0 and the median of 8, 20, 50, 25, 15, 30, i.e., of 8, 15, 20, 25, 30, 50 is 
(204 25 )= 22:5. 


Remark. In case of even number of observations, in fact any value lying 
between the two middle values can be taken as median but conventionally we take 
it to be the mean of the middle terms. 

In case of discrete frequency distribution median is obtained by considering 
the cumulative frequencies. The steps for calculating median are given below: 

(i) Find N/2, wheteN= & f,. 


t 
(ii) See the (less than) cumulative frequency (c/f.) just greater than N/2. 
(iii) The corresponding value of x is median. 
Example 2-5. Obtain the median for the following frequency distribution: 


5 ae I 2 3 4 5 6 7 8 9 
ie 8 10 WU 16 2 .%g I5 9 6 
Solution. 

x f Sea OOF 

1 8 g 

2 10 18 

3 11 29 

4 16 45 

5 20 65 

6 25 90 

7 15 105 

g 9 114 

9 6 120 

120 


Hence N=120 => N/2=60 

Cumulative frequency (c,f.) just greater than N/2 , is 65 and the value of x 
corresponding to 65 is 5. Therefore, median is 5. ; 

In the case of continuous frequency distribution, the class corresponding to the 
cf. just greater than N/2 is called the median class and the value of median is 
obtained by the following formula : 
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|, ACN | 
Median= I+ (3 c| (26) 


where is the lower limit of the median class, 
f is the frequency of the median class, 
h is the magnitude of the median class, 
‘c’ is the cf. of the class preceding the median class, 
and N= Lf. 
Derivation of the Median Formula (2-6). Let us consider the following 
continuous frequency distribution, ( x1 < x2< ...<2Xe+1): 


Class interval : x; — x2, X2 — x3, scGecee’s Xk — Akal 4 csovceee Xn — Xn+1 

Frequency : ii fa a | ee eee Sn 

The cumulaltive frequency distribution is given by : 

Class interval : x, — X2 , X23, cecccsee Mh May coccenes Xu — Xa+1 

Frequency : F, | Fy sitkice, Ps 
where F;= fit fat ...... + f;. The class x» — x41 is the median class if and only 


if Fy.:< N/2< Fy. 

Now, if we assume that the variate values are uniformly distributed over the 
median-class which implies that the ogive 
is a Straight line in the median-class, then 
we get from the Fig. 1, 


RS. AC 
ano= Bs ~ BC 
,,  RE-TS _ AQ-CO 
BS BC 
RT-BP AQ-BP 
cr 
BS PQ 
op N22= Pet Fam Fecs 
BS PO 
_f Oo P T Q 
A -— h ————__+} 


where f; is the frequency and h the magnitude of the median class. 
h 


N 
S Ae vt] 


Hence 
Median= OT= OP + PT= OP +BS 
h( N 
=l+—|>-— Fy- 
l fil 2 k ; 
which is the required formula. 


Remark. The median formula (2-6) can be used only for continuous classes 
without any gaps, i.e., for ‘exclusive type’ classification. If we are given a frequency 
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distribution in which classes are of ‘inclusive type’ with gaps, then it must be 
converted into a continuous ‘exclusive type’ frequency distribution without any 
gaps before applying (2-6). This will affect the value of / in (2-6). As an 
‘tlustration see Example 2-7. 

Example 2-6. Find the median wage of the following distribution : 

Wages (in Rs.) : 20—30 30—40 40—50 50—60 60—70 

No. of labourers : 3 5 20 10 5 
{Gorakhpur Univ. B. Sc. 1989] 


Solution. 

Wages (in Rs.) No. of labourers cf. 
20—30 S 3 
30—40 5 8 
40—S50 20 28 
50—60 10 38 
60—70 5 43 


Here N/2 = 43/2 = 21-5 . Cumulative frequency just greater than 21-5 is 28 
and the corresponding class is 40-50. Thus median class is 40-SO. . Hence using 
(2-6), we get 


Median = 40+ (215 8)= 40+ 6-75 =, 46-75 


Thus median wage is Rs. 46:75. 

Example 2-7. Jn a factory employing 3,000 persons, 5 per cent earn less than 
Rs. 3 per hour, 580 earn from Rs. 3-01 to Rs. 4-50 per hour, 30 percent earn from 
Rs.4-51 toRs. 6-00 per hour, 500 earn from Rs.6-01 to RS. 7-50 per hour, 20 percent 
earn from Rs. 7-51 to Rs. 9-00 per hour, and the rest earn Rs. 9-01 or more per 
hour. What is the median wage? [Utkal Univ. B.Sc.1992] 


Solution. The given information can be expressed in tabular form as follows. 
CALCULATIONS FOR MEDIAN. WAGE 


Earnings Percentage No. of Less than Class 

(in Rs.) of workers workers (f ) c.f. boundaries 
less than 3 5% = x 3000 = 150 150 Below 3-005 
3-:01—4-50 —_ 580 730 3-005—4-505 
4-51—6-00 30 % 2. x 3000 = 900 1630  4.505—6.005 
601—7-50 — $00 2130 6-005—7-505 
7519.00 20 % = x 3000 = 600 2730 —-'7-505—9-005 

9-01 and above 3000 - 2730 = 270 3000=N 9-005 and above 


a 


t 


2-16 Fundamentals Of Mathematical Statistics 


N/2 = 1500. ine cf. just greater than 1500 is 1630. The corresponding class 
4-51-6-00, whose class boundaries are 4-505-6-005, is the median class. Using the 
median formula, we get : 

h( N 1-5 
Median = | + fl 2 - C |= 4-505 + 900 (1500 = 730) 
= 4.505+ 1-283 = 5-79 
Hence median wage is Rs. 5-79. 


Example 2-8. An incomplete frequency distribution is given as follows’ © 


Variable Frequency Variable Frequency 
10—20 12 S50—60 ? 
20—30 30 60—70 25 
30—40 ? 70—80 18 
40—50 65 Total 229 
Given that the median value is 46, determine the missing frequencies using the 
median formula. [Delhi Univ. B. Sc., Oct. 1992] 


Solution. Let the frequency of the class 30—40 be f; and that of 50—60 
be fa. 

Thén § f:-+ f2=229 -(12 + 30 + 65 + 25 + i8)=79. 

Since median is given to be 46, the class 40—S0 is the median class. 

Hence using median formula (2-6), we get 


46 = 40 + 114-5 — G2) x10 
46 - 49 = “9=L +x 10 or 6-3 oh 


fi=72-5-—39 =33-5 = 4 
[Since frequency is never fractional] 
fr=79-—34= 45 [Since f, +f2= 79 ] 
2-6-1. Merits and Demerits of Median 
Merits. (i) It is rigidly defined. 
(ii) It is easily understood and is easy to calculate. In some cases it can be 
located merely by inspection. 
(iii) It is not at all affected by extreme values. 
(iv) It can be calculated for distributions with open-end classes. 
Demerits. (i) In case of even number of observations median cannot be 
determined exactly. We merely estimate it by taking the mean of two middle terms. 
(ii) It is not based on all the observations. For example, the median of 10, 25, 
50, 60 and’65 is 50. We can replace the observations 10 and 25 by any two values 
which are smaller than 50 and the observations 60 and 65 by any two values greater 
than 50 without affecting the value of median. This property is sometimes described 
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py saying that median is insensitive. 

(iii) It is not amenable to algebraic treatment. 

(iv) As compared with mean, it is affected much by fluctuations of sampling. 

Uses. (i) Median is the only average to be used while dealing with qualitative 
data which cannot be measured quantitatively but stl can be arranged in ascending 
or descending order of magnitude, e.g., to find the average intelligence or average 
honesty among a group of people. 

(ii) It. is to be-used for determining the typical value in problems concerning 
wages, distribution of wealth, etc. 

2:7, Mode. Let us cosider the following statements : 

(i) The average height of an Indian (male) is S’~6". 
(ii) The average size of the shoes sold in a shop is 7. 
(iti) An average student in a hostel spends Rs.150 p.m. 

In all the above cases, the average referred to is mode. Mode is the value which 
occurs most frequently in a set of observastions and around which thé other items 
of the set cluster densely. In other words, mode is the value of the variable which 
is predominant in the series. Thus in the case of discrete frequency distribution 
mode is the value of. x corresponding to maximum frequency. For example, in the 
following frequency distribution : 

ee 2 3 4 5 6 7 8 

f : 4 9 16 8 25 22 15 7 3 
the value of x corresponding to the maximum frequency, viz., 25 is 4. Hence mode. 
is 4, 

But in any one (or.more) of the following cases : 

(i) if the maximum frequency is repeated, 

(ii) if the maximum frequency occurs in the very beginning or at the end of 
the distribution, and | 

(iii) if there are irregularities in the distribution, 
the value of mode is determined by the method of grouping, which is illustrated 
below by an example. 

Example 2-9. Find the mode of the following frequency distribution : 

Size(x): 1 2 3 45 6 7 8 9 10 11 #=12 
Frequency(f): 3 8 15 23 35 40 32 28 20 45 14 6 

Solution. Here we see that the distribution is not regular since the frequencies 
are increasing steadily up to 40 and then decrease but the frequency 45 after 20 
does not seem to be consistent with the distribution. Here we cannot say that since 
maximum frequency is 45, mode is 10. Here we shall locate mode by the method 
of grouping as explained below : 

The frequencies in column (i) are the original frequencies. Column (ii) is 
obtained by combining the frequencies two by two. If we leave the first frequency 
and combine the remaining frequencies two by two we get column (iii). Combining 
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4 
&’ 
& 


Frequency - 
(x) (i) (ii) (iii) (iv) (v) (vi) 


26 
23 ie B 
58 i 
2 | 107 100 


. | 
Biola sel, 
10 45 65 79 


59 | 6s 
20 


the frequenciés two by two after leaving the first two frequencies results in a 
repetition of column (ii). Hence, we proceed to combine the frequencies three by 
three, thus getting column (iv). The combination of frequencies three by three after 
leaving the first frequency results in column (v) and after leaving the first two 
frequencies results in column (vi). 

The maximum frequency in each column is given in black type. To find mode 
we form the following table : 


ANALYSIS TABLE 


Oo CONAN & WH 
> Ww 
oS 
et eee eee SS een ee” 
~J] 
wr. 
—m_yeet eee eee eee ee’ 


Value or combination of 
- values of x giving max. 
frequency Bi (2) 


Column Number Maximurn Frequency 
(I) (2) 


” (i) 10 
(ii) 5,6 
{TS a) | ne? Star soeer rere ie neon eae ore 6,7 
(iv) 4,5, 6, 
(v) 5,6,7 
(vi) 6,7,8 


On examining the values in column (3) above, we find that the value 6 is 
repeated.the maximum number of times and hence the value of mode is 6 and not 
10 which is an irregular item. 
In case of continuous frequency distribution, mode is given by the formula : 
Mode = 14M fo 5, Mi=fo_ (2: 
0 GB G-W) R-h-h a 
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where ! is the lower limit, A the magnitude and fi the frequency of the modal class, 
fo and fo are the frequencies of the classes preceding and succeeding the modal 


class respectively. 
Derivation of the Mode Formula (2-7). Let us consider the continuous 


frequency distribution : 
Class > X1—X2, X2— As, ee ney AEA Nha cee eve y An —— An 
Frequency: fi Go art ‘fe aes fei 
If f is the maximum of all the frequencies, then the modal class is 
(Xe — Xe+1)- 


Let us further consider a portion of the histogram, namely, the rectangles 
erected on the modal class and the two adjacent classes. The mode is the value of 
x for which the frequency curve has a maxima. Let the modal point be Q. 


LM _ LD _ AL _ _ AD. 
~ BC 


LM ___ PD- AP 
LN- LM BR-CR 
LM__ fe- fe-1 
or ao ip) 
h-LM fi- fear’ where ‘h’ is the magnitude of the 
modal (class. Thus solving for LM , we get 


Le., 
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LM= h(fi- fe-1) = h( fi -fi-1) 
(fe-frsr + (fe-fi-1)  2ft-fe-1— fev 
Hence Mode=OQ=OP+PQ=OP+LM 
h(fe~ fe-1) 
2h ~ fe-1 nd ii +1 
Example 2:10. Find the mode for the following distribution : 
Class - interval: 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 
Frequency : § 8 7 12 28 20 10 10 
Solution. Here maximum frequency is 28. Thus the class 40—S0 is the modal 
class. Using (2-7), the value of mode is given by 
_ 10(28-12) = 
Mode = 40+ (2x 28-12-20) ~ 40 + 6-666 = 46-67 (approx.) 
Example 2-11. The Median and Mode of the following wage distribution are 
known to be Rs. 33:50 and Rs. 34 respectively. Find the values of fs , fa and fs . 


= 1+ 


Wages : 0-10 10-20 20-30 30-40 40-50 
(in Rs.) 
Frequency : 4 16 f fs fs 
Wages : 50-60 60-70 Total 
Frequency : 6 4 230 
[Gujarat Univ. B.Sc., 1991] 
Solution. 
CALCULATIONS FOR MODE AND MEDIAN 
Wages Frequency Less than 
(in Rs.) ff) cf. 
0—10 4 4 

10—20 16 20 

20—30 fs 20 + fs 

30—40 fa 20+ fa+fa 

40—50 fs UW+fAtfatfs 

50—60 — 6 UG+thtfhtfs 

60—70 4 30+fstfatfs  _ 

Total 230 =30+fAtfetfs 


From the above table, we get 
Lf =320+fh+fa+fs=230 
=> fitfatfs= 230-30 = 200 (0) 
Since median is 33-5, which lies in the class 30-40, 30-40 is the median class. 
Using the median formula, we get 


hi N 
Md= {I+ 4 F-¢] 
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- 33-5 = 30+ = [115- (204 6) 
33-5- 30_ 95- : 
= 10..———«S 
= 0-35fA=95-fp => fe= 95-035f, ii) 


Mode being 34, the modal class is also 30-40. Using mode formula we get : 
34= 30+ J0U4- A) 


2fs- — 
34-30 fo+ 035 fa- 
10. 2f.— (200- 7 [Using (i) and (ii) ] 
ve 1-35 fs — 95 
= 04 = FA — 200 
=> 12f,- 80= 1-35f,- 95 
__95- 80 15 _ 
= f= 735-1207 0157 [© Aili) 


Substituting in (ii) we get: 
fs= 95— 0:35 x 100= 60 

Substituting the values of fs and f, in (i) we get: 
fs= 2000-fp-fa= 200-60- 100= 40 

Hence f= 60, fa = 100 and fs= 40. 

Remarks. 1. In case of irregularities in the distribution, or the maximum 
frequency being repeated or the maximum frequency occurring in the very begin- 
ning or at the end of the distribution, the modal class is determined by the method 
of grouping and the mode is obtained by using (2-7). 

Sometimes mode is estimated from the mean and the median. For a symmetri- 
cal distribution, mean, median and mode coincide. If the distribution is moderately 
asymmetrical, the mean, median and mode obey the following empirical relation- 
ship (due to Karl Pearson) : 

Mean — Median = = (Mean — Mode) 
=> Mode = 3 Median — 2 Mean (2-8) 

2. If the method of grouping gives the modal class which does not correspond 
to the maximum frequency, i.e., the frequency of modal class is not the maximum 
frequency, then in some situations we may get, 2 ft —ft-1 — fs 1 = 0. In such cases, 
the value of mode can be obtained by the formula : 


z h(fi— fe-1) 
Mode = {+ mittee rag Ye 


2 22 Fundamentals Of Mathematical Statistics 


2-7-1. Merits and Demerits of Mode 

Merits. (i) Mode is readily comprehensible and easy to calculate. Like 
median, mode can be located in some cases merely by inspection. 

(ii) Mode is not at all affected by extreme values. 

(iti) Mode can be conveniently located even if the frequency distribution has 
class-intervals of unequal magnitude provided the modal class and the classes 
preceding and succeeding it are of the same magnitude. Open—end classes also do 
not pose any problem in the location of mode. 

Demerits. (i) Mode is ill-defined. It is not always possible to find a clearly 
defined mode. In somc. cases, we may come across distributions with two modes, 
Such distributions are called bi-modal. If a distribution has more than two modes, 
it is said to be multimodal. 

(ii) It is not based upon all the observations. 

(iti) It is not capable of further mathematical treatment. 

(iv) As compared with mean, mode is affected to a greater extent by fluctua- 
tions of sampling. 

Uses. Mode is the average to be used to find the ideal size, e.g., in business 
forecasting, in the manufacture of ready-made garments, shoes, etc. 

2-8. Geometric Mean. Geometric mean of a set of n observations is the 
nth root of their product. Thus the geometric mean G,’ of n observations 
x;,t=1,2,...,7 is 

G= (t%1.X2.... .%0) (2:9) 


The computation is facilitated by the use of logarithms. Taking loganthm of 
both sides, we get 


n 
“log G= = (log. x + log x2+...+ log x.) = Zz logx 
i=) 
: es i ee 
G= Antilog E x log x | . 
nM j=1 .-(2-9a) 


In case of frequency distribution x |-f.,, (i= 1, 2, ..., 2) geometric mean, G is 
given by 


e. wee n 
G= [ xf cf... x |, where N= £ ff 
i=] 


..(2°10) 
Taking logarithms of both sides, we get 


log G= 5 (fi logxu+ frlogxm+ ...+ fr log x.) 


...(2:10a) 
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Thus we see that logarithm of G is the anthmetic mean of the loganthms of 
the given values. From (2:10a), we get 


N. 


n 
G = Antilog Li x flog «| .-.(2°10b) 
t=] 

In the case of grouped or continuous frequency distribution, x is taken to be 
the value corresponding to the mid-point of the class-intervals. 

2-8-1. Merits and Demerits of Geometric Mean 

Merits. (i) It is ngidly defined. 

(ii) It is based upon all the.observations. 

(iii) It is suitable for further mathematical treatment. If m and nz are the 
sizes, G, and G2 the géometric means of two sefies respectively, the geometric 
mean G, of the combined series is given by 
n log G, + .n2 log G2 

m+ -{2°11) 

Proof. Let x; (f=1,2,...,m) and x, (j=1,2,..,m) bem, and 

items of two series respectively: Then by def., 


log G = 


4d | 


1 
Gi= (Xu .X1.. a) - => log G\= a x log Mi 
A i 
1 
'G2= (x21 XN ve X2n2) os => log Gi= ae z log x7 
zy a 
The geometric mean G of the combined series is given " 
1 
Ge (Xi X12 «2. Xin» X21. X22 vee X2q2) | (+n) » 
ny n2 
log G = z log x;+ 2 log x, 
m+mljep jel 


1 
= G log G 
i+ ti [ m4 log i+ Nz 10g 2] 


The result can be easily generalised to more than two series. 

(iv) It is not affected much by fluctuations of sampling. 

(v) lt gives comparatively more weight to small items. 

Demerits. (i) Because of its abstract mathematical character, geometric mean 
is not easy to understand and to calculate for a non-mathematics. person. 

(ii) If any one of the observations is zero, geometric mean becomes Zero and 


if any one of the observations is negative, geometric mean becomes imaginary 
regardless of the magnitude of the other items. 


Uses. Ceommetric mean is used - 
(i) To find the rate of populatién growth and the rate Of interest. 
(tt) In the construction of.index numbers. 
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Example 2°12. Show that in finding the arithmetic mean of a set of readings 
on thermometer it does not matter whether we measure temperature in Centigrade 
or Fahrenheit, but that in finding the geometric mean it does matter which scale 
we use. [Patna Univ. B.Sc., 1991] 

Solution. LetC, , C2, ..., C, be the m readings on the Centigrade thermometer, 
Then their arithmetic mean C is given by : 


Ge * (Cit Cot .. + Cy) 


If F and C be the readings in Fahrenheit and Centigrade respectively then we 
have the relation : 


F-32 C 9 
ao 0p. F= 32+ =C. 
Thus the Fahrenheit equivalents of C; , C2, ..., C, are 
9 9 9 
32 + race 32 + 5 © ey oe 5 Ons 


respectively. 
Hence the arithmetic mean of the readings in Fahrenheit is 


Fet J (324 2c,)4(324 20,)4... + (324 5 C; | 
n 5 5 5 
1 
n 


aa ye Z an C2 + =) 
Pe) n 
a 9 _ 
= 32+ 5C 


which is the Fahrenheit equivalent of C . 
Hence in finding the arithmetic mean of a set of n readings on a thermometer, 
it is immaterial whether we measure temperature in Centigrade or Fahrenheit. 
Geometric mean G, of n readings in Centigrade is 
Ge (Cy.C2... Cn)” 
Geometric mean Gy, (Say), of Fahrenheit equivalents of C;, C2, ..., C, is 


9 9 9 1,’n 
= 9 — ~~ >) See eat n 1 
G { (22+ 5 ci} [2+ 52] [32+ 5° }| 


which is not equa] to Fahrenheit equivalent of G, viz., 
fe (Cy.Cz oc. Cy) 4 32 


Hence. in finding the geometric mean of the n readings ona thermometer, the 
scale,(Centrigrade or Fahrenheit) is important. 
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2-9. Harmonic Mean. Harmonic mean of a number of observations is the 
reciprocal of the anthmetic mean of the'reciprocals of the given values. Thus, 
harmonic mean H, of n observations x;, (= 1, 2, ..., is 

| 


H = ————— ...(2:12) 
ee 
— 2 (1/%; ) 
Niel 
In case of frequency distribution x;|f;, (§= 1, 2,..., n), 
n 
H=———_, [N= x fi (2-124) 
1 A 


2-9-1. Merits and Demerits of Harmonic Mean 
Merits. Harmonic mean is rigidJy defined, based upon all the observations 
and is suitable for further mathematical treatment. Like geometric mean, it is not 
affected much by fluctuations of sampling. It gives greater importance to small 
items and is useful only when small items have to be given a greater weightage. 
Demerits. Harmonic mean is not easily understood and is difficult to compute. 
Example 2-13. A cyclist pedals from his houseto his college at a speed of 10 
m.p.h. and back from the college to his house at 15 m.p.h. Find the average speed. 
Solution. Let the distance from the house to the college be x miles. in going 


from house to college, the distance (x miles) is covered in aa haus, while in 
coming from college to house, the distance is covered in = hours. Thus a total’ 


distance of 2x miles is covered in (5+ dl hours. 


Total distance travelled 2X 
Hence average speed = ———__—____——_ = ———_ 
Total time taken (23.5 
10 «15 
2 
TP AY 12 mop.h. 
(73° 33] 


Remark. 1. In this case.the average speed is given by the harmonic méan of 
10 and 15 and not by the airthmetic mean. 

Rather, we have the following general result : 

If equal distances are covered (travelled) per unit of time with speeds equal to 
Vi, Vo, .... Va, say, then the average speed is given by the harnionic mean of - 
Vis. Vay aes Vie Lé., 


Average speed = ee ee eee ee 


1 1 1 l 
{y+ oe 7 z (5) 


Proof is left as an exercise to the reader. 
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: ; Distance : Distance 
Hint. Speed = ae => Time = “Speed 
Total distance travelled 

Total time taken 
2. Weighted Harmonic Mean. Instead of fixed (constant) distance being 

travelled with varying speed, let us now suppose that different distances, say, 
S,, S3, ..., S,, are travelled with different speeds, say, V;, V2, .... V,. respectively. In 
that case, the average specd is given by the weighted harmonic mean of the speeds, 
the weights being the corresponding distances travelicd, i.e., 


Average Speed = 


- S,+5S2+...+5, 2S 

Average speed = -————_____ = ——_ 
S152, a >> Ss 
V; V2 a“ V, “a 


~ Example 2-14. You can take a trip which entails travelling 900 km. by train 
atan average speed of 60 km. per hour, 3000 km. by boat at an average of 25 km. 
p.h., 400 km. by plane at 350 km. per hour and finally LS km. by taxi at 25 km. 
per hour, What ts your average speed for the entire distance ? 

Solution. Since different distances are covered with varying speeds , the 
required average speed for the entire distance is given by tbe weighted harmonic 
mean of the specds (in km.p.b.), the weights being the corresponding distances 
covercd (in kms.). 

COMPUTATION OF WEIGHTED H. M. 
Average spced 


Speed Distance 
(km. [ hr.) (inkm.) W/X : =Ww 
_*- W = (W/X) 
60 900. 15-00 : 4315 
25 3000_—. 120-00 137-03 
350 400 1-43 a 
95 15 0-60 31-489 kon.p.h. 
Total 2W=4315 = (W/X) = 137-03 


2-10. Selection of an Average. From the preceding discussion it is evident 
that no single average is suitable for al] practical purposes. Each one of the average 
has its own merits and demerits and thus its own particular field of importance and 
utilitv. We cannot use the averages indiscriminately. A judicious selection of the 
average depending on the nature of the «lata and the purpose of the enquiry is 
essential ‘for sound statistical analysis. Since arithmetic mean satisfies all the 
properties of an ideal average as laid down by Prof. Yule, is familiar to a layman 
and further has wide applications in statistical theory at large, it may be regarded 
as the best of all the averages. ' 

2-11. Partition Values. These are the values which divide the scries into a 
number of equa! parts. 7 
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The three points which divide the series into four equal parts are called 
quartiles. The first, second and third points are known as the first, second and third 
quartiles respectively. The first quartile, Q,, is the value which exceed 25% of the 
observations and is exceeded by 75% of the observations. The second quartile, 
Q>, coincides with median. The third quartile, Q, , is the point which has 75% 
observations before it and 25% observations after it. 

The nine points which divide the series into ten equal parts are called deciles 
whereas percentiles are the ninety-nine points which divide the series into hundred 
equal parts. For example, D,, the seventh decile, has 70% observations before it 
and Pe, the forty-seventh percentile, is the point which exceed 47% of the 
observations. The methods of computing the partition values are the same as those 
of locating the median in the case of both discrete and continuous distributions. 

Example 2-15. Eight coins were tossed together and the number of heads 
resulting was noted. The operation was repeated 256 times and the frequencies (f) 
that were obtained for different values of x, the number of heads, are shown in the 
following table. Calculate median, quartiles, 4th decile and 27th precentile. 


x: 0 1 2 3 4 5 6 7 8 
f: 1 9 26 59 72 52 29 7 1 
Solution. ‘ 


x 0 1 2 3 4 5 6 7 8 
f: 1 9 26 59 72 52 29 7 1 
opt 1 10 36 95 167 219 248 255 256 
Median : Here N/2 = 256/2 = 128 . Cumulative frequency (c.f just greater 
than 128 is 167. Thus, median = 4. 
Q, : Here N/4 = 64. cf. just greater than 64 is 95. Hence, Q: = 3. 
Q; : Here 3N/4 = 192 and c/f. just greater than 192 is 219. Thus Q; = 5. 


D, : as 4x256= 102-4 and cf, just greater than 102: 4 is 167. Hence 


Py: 100 = 27 X2:56= 69-12 and cf. just greater than 69-12 is 95. Hence 


2:11-1. Graphical Location of the Partition Values. The partition values, 
viz., quartiles, deciles and percentiles, can be conveniently located with the help of 
a curve called the ‘cumulative frequency curve’ or ‘Ogive’. The procedure is 
illustrated below. 

First form the cumulative frequency table. Take the class intervals (or the 
variate values) along the x~axis and plot the corresponding cumulative frequencies 
along the y-axis against the upper limit of the class interval (or against the variate 
value in the case of discrete frequency distribution). The curve obtained on joining 
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the points so obtained by means of free hand drawing is called the cumulative 
frequency curve of ogive. The graphical lucation of partition values from this 
curve is explained below by means of an example. 

Example 2-16. Draw the cumulative frequency curve for the following 
distribution showing the number of marks of 59 students in Statistics. 


Marks-group : O—10 10—20 20—30 30—40 40—50 50—60 60—70 


No. of Students: 4 8 11 15 12 6 3 
Solution. 
Marks-group | No. of Students Less than More than 

cf. cf. 

10 4 4 59 

10—20 8 12 55 
20—30 11 23 47 
30—40 15 38 36 
40—5S0 12 50 21 
50—60 6 56 9 
60—70' 3 39 3 


Taking the marks-group along #-axis and c/. along y-axis, we plot the 
cumulative frequencies, viz., 4, 12, 23, ..., 59 against the upper limits of the 
corresponding classes, viz., 10, 20,..., 70 respectively. The smooth curve obtained 
on joining these points is called ogive or more particularly ‘less than’ ogive. 


e ? 
“MORE THAN’ LESS THAN 
OGIVE ee 


NO.OF STUDENTS 
S 
° 


0 ~ 10 20 a0 ey 50 60 70 
: BD 4571 
ay MARKS ——» 


If we plot the ‘more than’ cumulative fréquencies, viz., 59, 55,..., 3 against the 
lower limits of the corresponding classes, viz., 0, 10, ..:, 60 and join the points by 
‘a smooth curve, we get cumulative frequency curve which is also known as ogive 
or more particularly ‘more than" ogive. - 
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To locate graphically the value of median, mark a point corresponding to , 
N/2 along y-axis. At this point draw a line parallel to x-axis meeting the ogive at 
the point. ‘A’ (say). From ‘A’ draw a line prependicular to x-axis meeting itin ‘M’ 
(say). Then abscissa of ‘M’ gives the value of median. 

To locate the values of Q,; (or Q3), we mark the points along y-axis correspond- 
ing to N/4 (or 3N/4) and proceed exactly similarly. 

In the above example, we get from ogive 

Median= 34-33, Q:= 22-50, and Q3= 45-21. 

Remarks. 1. The median can also be located as follows : 

From the point of intersection of ‘less than’ ogive and “more than’ ogive, draw 
perpendicular to OX. The abscissa of the point so obtained gives median. 

2. Other partition values, viz., deciles and percentiles, can be similarly located 
from ‘ogive’. 

EXERCISE 

1. (a) What are grouped and ungrouped frequency distributions? What are 
their uses? What are the considerations that one has to bear in mind while forming 
the frequency distnbution? 

(b) Explain the method of constructing Histogram and Frequency Polygon. 
Which, out of these two, is better representative of frequencies of (i) a parucular 
group, and (ii) whole group. 

2. What are the principles governing the choice of : 

(i) Number of class intervals, 
(ii) The length of the class interval, 
(iii) The mid-point of the class interval. 
3. Write short notes on : 
(i) Frequency distribution, 
(ii) Histogram, frequency. polygon and frequency curve, 
(iti) Ogive. 

4. (a) What are the properties of a good average? Seanaine these properties 
with reference to the Arithmetic Mean, the Geometric Mean and the Harmonic 
Mean, and give an example of situations in which each of them can be the 
appropriate measure for the average. 

(b) Compare mean, median- and mode as measures of location of a distribu- 
tion. 

fc) The mean is the most common measure of central tendency ot the data. It 
satisfies almost all the requirements of a good average. The median is also an 
average, but it does not statisfy all the requirements of a good average. However, 
it Carries certain merits and hence is useful in particular fields. Critically examine 
both the averages. 

__ (d)Describe the different measures of central tendency ofa frequency distribu- 
tion, mentioning their merits and demerits. 
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5. Define (i) arithmetic mean, (ti) geometric mean and (iii) harmonic mean of 
grouped and ungrouped data. Compare and contrast the merits and demerits of 
them. Show that the geometric mean is capable of further mathematical treatment. 

6. (a) When is an average a meaningful statistics? What are the requisites of 
a statisfactory average? In this light compare the relative merits and demerits of 
three well-known averages. 

(b) What are the chief measures of central tendency? Discuss their merits. 

7. Show that (i) Sum of deviations about arithmetic mean is zero. 

(ii) Sum of absolute deviations abodt median is ‘least. 
(iii) Sum of the squares of deviations about arithmetic mean is least. 

8. The following numbers give the weights of 55 students of a class. Prepare 
a suitable frequency table. 

422 74 40 6 82 115 41 #61 #$7 £483 ~~ 63 

533. 110 76 & 50 67 65 #7 7 S56 95 

68 69 104 80 79 779 54 73 #459 81 10 

6 4 #77 #90 8 76 42 & 6 7 80 

722 #850 79 #452 103 9% S51 86 #=$%7 94 ~ 7;i 

(i) Draw the histogram and frequency polygon of the above data. 
(ii) For the above weights, prepare a cumulative frequency table and draw 
the less than ogive. 

9. (a) What are the points to be bome in mind in the formation of. frequency 
table? 

Choosing appropriate class-intervals, form a frequency table for the following data: 
102 05° 52 61 31 67 89 72 849 
54 36 92 61 73 20 13 64 = 80 
43 47 124 86 131 32 95 76 40 
51 81 21 115 31 68 70 82 £20 
31 65 11:2 120 S121 109 112 85 23 
34 52 107 49 62 : 

(b) What are the considerations one has_to bear in mind while forming a 
frequency distribution? 

A sample consists of 34 observations recorded correct to the nearest integer, 
ranging in value from 2Q1 to 337. If itis decided to use seven classes of width 20 
integers and to begin the first clags at 199-5, find the class limits and class marks 
of the seven classes. 

(c) The class marks in a frequency table (of whole numbers) are given to be 
5, 10, 15, 20, 25, 30, 35, 40, 45 and 50. Find out the following : 

(i) the true classes. 
(ii) the true class limits. 
(iii) the true upper class limits. 


Frequency Distributions And Measures Of Central Tendency 2.31 


10. (a) The following table shows the distribution of the number of students 

per teacher in 750 colleges :- 
Students : | 4 7 1 13 1 19 22 #25 = #28 
Frequency: 7 46 165 195 189 89 28 19 9 3 
Draw the histogram for the data and superimpose on it the frequency polygon. 
(b) Draw the histogram and frequency curve for the following data. 
Monthly wages 
inRs. 10-13 13-15 15-17 17-19 19-21 21-23 23-25 
No.of workers 6 53 85 56 21 16 8 
(c) Draw a histogram for the following data : 
Age(in years): 2-5  S-11 11-12 12-14 14-15 15-16 
No. of boys : 6 2 5 ] 3 

11. (a) Three people A, B,C were given the job of finding the average of 
5000 numbers. Each one did his own simplification. A’s method : Divide the sets 
into sets of 1000 each, calculate the average in each set and then calculate the 
average of these averages. B’s method : Divide the set into 2,000 and 3,000 
numbers, take average in each set and then take the average of the averages. C’s 
method :500 numbers were unities. He averaged all other numbers and then added 
one. Are these methods correct? 

Ans. Correct, not correct, not correct. : 

(b) The total sale (in ’000 rupees) of a particular item in a shop, on 10 
consecutive days, is reported by a clerk as, 35-00, 29-60, 38-00, 30-00, 40-00, 41-00, 
42-00, 45-00, 3-60, 3-80. Calculate the average. Later it was found that there was 
a number 10-00 in the machine and the reports of 4th to 8th days were 10-00 more 
than the true values and in the last 2 days he put a decimal in the wrong place thus 
for example 3-60 was really 36-0. Calculate the true mean value. 

Ans. 30-8, 32-46, 

12. (a) Given below is the distribution of 140 candidates obtaining marks X 
or higher in a certain examination (all marks are given in whole numbers) : 

X: 10 20 30 40 $0 6& 7 80 90 100° 

cf.: 140 133 118 100 75 £45 ~~ 25 9. 2 0 


Calculate the mean, median and mode of the distribution. 


cf. 
(less than) 


140 - a 7 
133 — 118 = 15 
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Mean = 54-5 + 140 = 50-714 
10 f 140 
Median = 49-5 + 70 a -65 | = 51-167 
(b) The four parts of a distribution are as follows : 
Part Frequency Mean 
1 50 61 
2 100 710 
3 120 80 
“4 30 83 
Find the mean of the distribution. (Madurai Univ. B.Sc., 1988) 


13. (a) Define a ‘weighted mean’. If several sets of observations are combined 
into a single set, show that the mean of the combined set is the weighted mean of 
several sets. 

(b) The weighted geometric mean of three numbers 229, 275 and 125 is 203 
The weights for the first and second numbers are 2 and 4 respectively. Find the 
weightof third. Ans. 3. 

14. Define the weighted arithmetic mean of a set of numbers. Show that it is 
unaffecied if all weights are multiplied by some common factor. 

The following table shows some data collected for the regions of a country: 


Number of inhabitants 


Percentage of 
(million) 


literates 


Average annual 
income per 
person (Rs.) 


850 


10 52 
5 68 
18 


Obtain the overall figures for the three regions taken together. Prove the 
formulae you use. [Calcutta Univ. B.A.(Hons.), 1991] 


15. Draw the Ogives and hence estimate the median. 
Clas 09 10-19 20-29 30-39 40-49 50-59 60-69 70-79 
Frequency 8 32 142 216 240 206 143 13 
16. The following data relate to the ages of a group of workers in a factory. 


Draw the percentage cumulative curve and find from the graph the number of 
workers between the ages 28—48. 
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17. (a) The mean of marks obtained in an examination by a group of 100 
students was found to be 49-96. The mean of the marks obtained in the same 
examination by another group of 200 students was 52-32. Find the mean of the 
marks obtained by both the groups of students taken together. 

(b) A distribution consists of three components with frequencies 300, 200 and 
600 having their means 16, 8 and 4 respectively. Find the mean of the combined 
distribution. 

(c) The mean marks got by 300 students in the subject of Statistics are 45. 
The mean of the top 100 of them was found to be 70 and the mean of the last 100 
was known to be 20. What is the mean of the remaining 100 students? 

(d) The mean weight of 150 students in a certain class is 60 kilograms. The 
mean weight of boys in the class is 70 kilograms and that of the girls is 55 kilograms. 

Find the number of boys and number of girls in the class. 
ns. (a) 51-53, (b)8, (c)45, (d) Boys = 50, Girls = 100. 

18. From the following data, calculate the percentage of workers getting 
wages 

(a) more than Rs. 44, (b) between Rs. 22 and Rs. 58, (c) Find Q, and Q3. 
Wages (RS.) 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 
No. of workers 20 45 85 1680 #70 55 35 30 

Hint. Assuming that frequencies are uniformly distributed over the entire 
interval, 

(a) Number of persons with wages more than Rs. 44 is 


[Pip x70 +55 +35 + 30162 


10 
Hence the percentage of workers getting over Rs. 44 is 
162 
= S09 * 100 = 32-4% 
(&) Percentage of workers getting wages between Rs. 22 und Rs. 58 is 
( OD * 8s |+ 160 +70+( => x 55 |x 100 + 500 = 68-4% 
19. For the two frequency distributions give below the mean calculated from 
the first was 25-4 and that from the second was 32-5. Find the values of x and y. 


Distribution I 
Frequency 


20 
20—30 15 
30—40 10 


x 
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20. A number of particular articles has been classified according to their 
weights. After drying for two weeks the same articles have again ‘been weighted 
and similarly classified. It is known that the median weight in the first weighing 
was 20-83 oz. while in the second weighing it was 17-35 oz. Some frequencies g 
and 6 in the first weighing and x and y in the second are missing. It is known that 

=1x and b=1y. Find out the values of the missing frequencies. 


Class Frequencies Class Frequencies 
Ist weighing Ind weighing Ist weighing IInd weighing 
0—S a x 1S— 20 52 50 
5—10 b y 20—25 75 30 
10—15 ss 11 40 25—30 22 28 


Hint. We have x = 3a, y = 2b, 
N, = Total frequency in lst weighing = 160+ a+b. 
N.= Total frequency in 2nd weighing = 148 + x+ y = 148 + 3a+ 2b. 


Using Median formula, we shall get 
; 2083=20+5| Mt _@3+a+0)| 
=> 15 (2083-20) =*P4A4? 634445) 
=> 1245 = 17-25" 
= a+b=2(17—- 1245) =9-10 <9 -..(*) 


Since a and b, being frequencies are integral valued, a + is also integral 
valued. Now the median of 2nd weighing gives : 


5 [ 148+ 3a+2b 
1735 = 15+ 5| MB+34+20 _ G04 x+y | 
= 10x 235 =74 +942" _ 49-30-20 
mn 257 24 -235=105 
=> 3a + 2b =21 w.(#*) 
Multiplying (*) by 3, we get 
3a + 36=27 oo (#4#) 


Subtracting (**) from (***), we get b=6. Substituting in (*), we get 
a=9-6=3. 
ne a=3, b=6; x=3a=9, y=2b= 12, 
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41. From the following ‘table showing the wage distribution in a certain 

factory, determine : 

(a) the mean wage, 

(b) the median wage, 

(c) the modal wage, 

(d) the wage limits for the middle 50% of the wage earners, 

(e) the percentages of workers who earned between Rs.75 and Rs.125. 

(f) the percentage who eared more than Rs.150 per week, and 

(g) the percentage who earned less than Rs.100 per week., 


Weekly wages | No. of employees 
) 


Weekly wages 
(Rs.) 


No. of employees 


Ans. (a) X = 108-5, (6) Med. = 108-75, (c) Mo= 118-3, (d) 81-25 , 129-3 
(e) 48, (f) 12, (g) 40. 

22. (a) Explain how the ogives are drawn for any frequency distribution. Point 
out the method of finding out the values of median, mode, quartiles, deciles and 
percentiles graphically. Also, write down the formula for the computation of each 
of them for any frequency distribution. 

(b) The following table gives the frequency distribution of marks in a class of 
65 students. 


Marks No. of Students Marks No. of students 
0-4 10 14-18 3 
4-8 12 18-20 3 

8-12 18 20-25 4 

12-14 7 25 and over 6 

Total 65 


Calculate : (i) Upper and lower quartiles. 

(ti) No. of students who secured marks more than 17. 

(iii) No. of students who secured marks between 10 and 15. 

(c) The following table shows the age distribution of heads of families in a 
certain country during the year 1957. Find the median, the third quartile and the 
second decile of the distribution. Check your results by the graphical method. 
Age of head of family 

years Under25 25-29 30-34 35-44 45-54 55-64 65-74 above 74 
Number 
(million) 2-3 4-1 53 106 97 68 4-4 1-8 
Total 45 
Ans. Md = 45-2 yrs.; G3 = 57-5 yrs.; 22 = 32-5 yrs. 
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23. The following data represent travel expenses (other than transportation) 
for 7 trips made during November by a salesman for a small firm : 


‘An auditor criticised these expenses as excessive, asserting that the average 
expense per day is Rs. 10 (Rs. 70 divided by 7). The salesman replied that the 
average is only Rs. 4-20 (Rs. 105 divided by 25) and that in any event the median 
is the appropriate measure and is only Rs. 3. The auditor rejoined that the arithmetic 
mean is the appropriate measure, but that the median is Rs. 6. 

You are required to: 

(a) Explain the proper interpretation of each of the four averages mentioned. 

(b) Which average seems appropriate to you ? 

24. (a) Define Geometric and Harmonic means and explain their uses in 
Statistical analysis. 

You take a trip which entails travelling 900 miles by train at an average speed 
of 60 m.p.h., 300 miles by boat at an average of 25 m.p.h., 400 miles by plane at 
350 m.p.h. and finally 15 miles by taxi at 25 m.p.h. What is your speed for the 
entire distance? 

(b) A train runs 25 miles at a speed of 30 m.p.h., another 50 miles at a speed 
of 40 m.p.h., then due to repairs of the track travels for 6 minutes at a speed of 10 
m.p.h. and finally covers the remaining distance of 24 miles at a speed of 24 m.p.h. 
What is the average speed in m.p.h.? 

(c) Aman motors from A to B. A large part of the distance is uphill and he 
gets a mileage of only 10 per gallon of gasoline. On the return trip he makes 15 
miles per gallon. Find the harmonic mean of his mileage. Verify the fact that this 
is the proper average to be used by assuming that the distance from A to B is 60 
miles. 

(d) Calculate the average speed of a car running at the rate of 15 km.p.h. during 
the first 30 kms., at 20 km.p.h. during the second 30 kms. and at 25 kmp.h. during 
the third 30 kms. 

25. The following table shows the distribution of 100 families according to 
their expenditure per week. Number of families corresponding to expenditure 
groups Rs. (10—20) and Rs.(30—40) are massing from the table. The median and 
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mode are given to be Rs.25 and 24 Calculate the missing frequencies and then 
arithmetic mean of the data : 


Expenditure : 0—10 10—20 20—30 30—40 40—50 
No. of families : 14 ? 27 ? 15 
Hint. 
Expenditure | No. of Families Cumulative frequencies 
0O—10 14 14 
10—20 |, fi l4+f, 
20—30 27 41+f, 
30—40 hh 41+fitfe 
40—S0 15 S6+fithe 
+ 
25 = 20 + a x 10 
_ 27-fi é 
and 24=20+ 2x27 =fixhr x 10 
Simplying these equations, we get 
fi-fh=l 
and 3fi — 2f2 = 27. 
Ans. 25,24 


26. (a) The numbers 3-2, 5-8, 7-9 and 4-5 have frequencies x, (x + 2), (x — 3) 
and (x +6) respectively. If their arithmetic mean is 4-876, find the value of x. 

(b) If M,.x is the geometric mean of N x’s amd M,, is the geometric miean of 
N y’s, then the geometric mean M, of the 2N values is given by 

M, =M,.:M,.y- (Nagpur Univ. B.Sc., 1990). 

(c) The weighted geometric mean of the three numbers 229, 275 and 125 is 
203. The weights for the first and the second numbers are 2 and 4 respectively. 
Find the weight of the third. Ans. 3. 

27. The geometric mean of 10 observations on a certain variable was calcu- 
lated as 16-2. It was later discovered that one of the observations was wrongly 
recorded as 12-9; in fact it was 21-9. Apply appropriate correction and calculate 
the correct geometric mean. 

Hint. Correct value of the geometic mean,G’ is given by 

»_ (06-2)! x 21-9 
G'= 129 = 17-68 
28. A variate takes the values a, ar, ar’, ..., ar"~' each with frequency unity. 


If A, G and H are respectively the arithmetic mean, geometric mean and harmonic 
mean, show that ‘ 
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_aU-r) Gz = grit ~ V2, H= an (1 -r)r' 
n(l-ry’ (1 -—r") 
Prove that G2 = AH. Prove also that A > G > H unless r = 1, when all the 
three means coincide. 


n n+] n+2 


- 1 | _ | 
29.Ifx,=-— & x,x5=- ZX x:and x4=- Xx; 
nse on Se! 4 n 2% 


then show that 
ey ol Se i 
(a) x2= xX, + Ons 1—X;), and (b) x4= x2 + On 4+2— 2) 


30. A distribution x), x2, ..., x, with frequencies f,, fp ...,f,, transformed 
into the distribution X,, X2, ..., X, with the same corresponding frequencies by 
the relation X, = ax, + b, where a and b are constants. Show that the mean, 
inedian and mode of the new distribution are given in terms of those of the first 
distribution by the same transformation. [Kanpur Univ. B.Sc., 1992] 


Use the method indicated above to find the mean of the following 
distribution: x (duration of telephone conversation in seconds) 


49-5, 149-5, 249-5, 349-5, 449-5, 549-5, 649-5,749-5, 849-5, 949.5 
f (respective frequency) 


6 28 88 180 247 260 132 48 11 5 


31. If x,, is the weighted mean of x,’s with weights w;, prove that 


n n n a a 
(x w) (5 w;(x;~— Xy)* )= d x Ww; ww; (x; - x)”, where > w; #0. 
i= i i= 


i=l j>i 


= (Allahabad Univ. B.Sc., 1992)-' 


nt 


a 
Hint. . 2 Ww; Wj (Xj - X) », », WW = x)” 


_l 
i=] i> ~ 2 


= a xo WwW; Wj { (x;-%,) - (x;- iw)} | 


32. In a frequency table, the upper boundary of each class interval has a 
constant ratio to the lower boundary. Show that the geometric mean G may be 
expressed by the formula : 


log G= =Xy +5 id 


where x, is the logarithm of the mid-value of the first interval and ¢ is the 
logarithm of the ratio between upper and lower boundaries. 


[Delhi Univ. B.Sc. (Stat. Hons.), 1990, 1986] 
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33. Find the minimum value of : 
(i) f(x) =(x- 6) + (x +3)? + (x - 8) + (x + 4) +(x - 3) 
(ii) 8) =| 6|+]x+3|+|x-8[+|x+4/+|x-3]. 
[Delhi Univ. B.Sc.(Stat. Hons.), 1991] 
Hint. The sum of squares of deviations is minimum when taken from 
arithmetic mean and the sum of absolute deviations is minimum when taken from 
median. 
34. If A, G and H be the arithmetic mean, geometric mean and harmonic 
mean respectively of two positive numbers a and b, then prove that : 
()Az2GeH. 
When does the equality sign hold? 
(ii) G? = AH. 
35 Calculate simple and wéighted arithmetic averages from the following 
data and comment on them : 


Designation Monthly salary (in Rs.) Strength of the cadre 
Class I Officers 1,500 10 
Class JI Officers 800 20 
Subordinate staff 500 70 
Clerical staff 250 100 
Lower staff 100 150 


Ans. X = Rs. 630, X. = Rs. 302-86. Latter is more representative. 

36. Treating the number of letters in each word in the follow.ag passage as 
the variable x, prepare the frequency distribution table and obtain its mean, 
median, mode. 

"The reliability of data must always be examined before any attempt is made 
to base conclusions upon them. This is true of all data, but particularly so of 
numerical data, which do not carry theiz quality written large on them. It is a waste 
of time to apply the refined theoretica] methods of Statistics to data which are 
suspect from the beginning." 

Ans. Mean = 4-565, Median = 4, Mode = 3. 


OBJECTIVE TYPE QUESTIONS 
I. Match the correct parts to make a valid statement : 


(a) Arithmetic Mean (i) I+ [fo/ (fi + fr)| xt 
(b) Geometric Mean (ii) (X1 .X2.....Xn)" 

(c) Harmonic Mean (i) LfX/Tf 

(d) Median Th ee ace Cee 


f 
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(e) Mode (v) F (Retest yf 
WAX, XxX, x 
. fi = fo ; 
Oy aie 
II. Which measure of location will be suitable to compare: 
(i) heights of students in two classes; 
(ii) size of agricultural holdings; 
(iii) average sales for various years; 
(iv) intelligence of students; 
(v) per capita income in several countries; 
(vi) sale of shirts with collar size; 16”, 153”, 15”, 14”, 13%, 15%: 
(vii) marks obtained 10, 8, 12, 4, 7, 11, and X (X < 5). 
Ans. (i) Mean, (ii) Mode, (iii) Mean, (iv) Median, (v) Mean, (vi) Mode, 
(vit) Median. 
III. Which of the following are true for all sets of data? 
(i) Arithmetic Mean < median < mode, 
(if) Arithmetic mean 2 median 2 mode, 
(iii) Arithmetic mean = median = mode 
(iv) Nonc of these. 
IV. Which of the following are truc in respect of any distribution? 
(i) The percentile points are in the ascending order. 
(ii) The percentile points are equispaced. 
(iit) The median is the mid-point of the range and the distribution. 
(iv) A unique median value exists for cach and every distribution. 
V. Find out the missing figures : 
(a) Mean = ? (3 Median — Mode). 
(b) Mean — Mode = ? (Mean ~ Median). 
(c) Median = Mode + ? (Mean — Mode). 
(d) Mode = Mean —? (Mean ~ Median). 
Ans, (a) 1/2, (b) 3, (c) 2/3, (d) 3. 
VI. Fill in blanks : 
(i) Harmonic mean of a number of observations is ...... 
(ii) The geometric mean of 2, 4, 16, and 32 is ...... 
(ii) The strength of 7 colleges in a city are 385; 1,748; 1,343; 1,935; 786; 
2.874 and 2,108. Then the median strength is ...... 


(tv) The geometric mean of a set of values lies between arithmetic mean 
and... . 


(v) The mean and median of 100 items are 50 and 52 respectively. The 
value of the largest item is 100. It was later found that it is actually 110. 
Therefore, the true mean is ... and the truce median is 
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(vi) The algebraic sum of the deviations of 20 observations measured from 
30 is 9, Therefore, mean of these observations is ..... 

(vii) The relationship between A.M., G.M. and H.M. is . 

(viii) The mean of 20 observations is 15. On checking it: it-was found that two 
observations were wrongly copied as 3 and 6. If wrong observations are replaced’ 
py correct values 8 and 4, then the correct mean is ..... 

(ix) Median = ..... Quartile. 
(x) Median is the average suited for ..:.. classes. 
(xi) A distribution with two modes is called ..... and with more than two 
modes is called ..... 
(Xii) ...+ is not affected by extreme observations. 

Ans. (ti) 8; (ttt) 1,748 ; (iv) H.M.; -(v) 50-1,523; (vt) 30-1 ; 
(vii) A. M. 2 G.M. 2 H.M. ; (vuti) 15- 15; (t%) Second: (x) Open end ; (xi) Bimodal, 
multimodal ; (xii) Median or mode. Za E 

VII. For the questions given below, give correct answers. 

(i) The algebraic sum of the deviations of a setof nm values from their 
arithmetic mean is 
(a) n, (b) 0, (c) 1, (d) none of these. 
(ii) The most stable measure of central tendency is 
(a) the mean, (b) the median, (c):the mode, (d) none of these. 
(iti) 10 is the mean of a set of 7 observations and 5 is the mean of'a set of 3 
observations. The mean of a combined set is given by * 
(a) 15, (b) 10, (c) 8-5, (d) 7-5, (e) none of these. 
(iv) The mean of the distribution, in which the value of x are 1, 2, ..., m, the 
frequency of each being unity is: 
(a) n(n+1)/2, (b) n/2, (c) (n+ 1)/2, (d) none of these. 
(v) The arithmetic mean of the numbers 1, 2, 3, ..., n is 


(q)2@tU Grey) py 220. mins 


(iu) The most stable measure of een tendency 
(vi) The point of intersection of the ‘less than’ and the ‘greater than’ ogive 
corresponds to 
(a) the mean, (b) the median, (c) the geometric mean, (d) none of these. 
(wi) When x; and y; are two variab;_s (1 = 1, 2, ..., n) with G.M.’s G; and G> 


, (c) nine’) , (d) none of these. 


e Xj e 
respectively then the geometric mesn of G is 
‘ 


(a) a (b) antilog | 2), (c) n (logG, ~logG.): 
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(d) Antilog _— =e 


Ans. (i) (b) ; (ii) (a); (iii) (©); (iv) (©); () J); (vi) (Bb); (vii) ©). 
VII. State which of the following statements are True and which-are False, 
In case.of false statements give the correct statement. 
(y The harmonic mean of 7” numbers is the reciprocal of thé Arithmetic 
mean of the reciprocals of the numbers. 
(it) Forthe wholesale manufacturers interested in the type which is usually 
in demand, median is the most suitable average. 
(ut) The algebraic sum of the deviations of a senes of individual observat— 
ions from.their mean is always zero. 
(iv) Geometric mean is the appropriate average when emphasis in on the 
rate of change rather than the amount of change. 
(v) Harmonic mean becomes zero when one of the items is zero. 
(vi) Mean lies between median and mode. 
(vii) Cumulative frequency is not-decreasing. 
(vit) Geometric mean is the arithmetic mean of harmonic mean and arith- 
metic mean. 
(ix) Mean, median mode have the same unit. 
(x) One quintal of wheat was purchased at 0-8 kg. per rupee and another 
quintal at 1-2 kg. per rupee. The average rate ‘per rupee is ‘1kg. 
(xi) One limitation of the median is that it cannot be calculated from a fre- 
quency distribution with open-end classes. 
(xi) The arithmetic mean of a frequency distribution is always located i in the 
class which has the greatest number of frequencies. 
(xiii) Ina moderately asymmetrical distribution, the mean, median and mode 
are the same. 
(xiv) It is really immatenal in which class an item falling at the boundary 
between two classés is listed. 
(xv) The median is not affected by extreme items. 
(xvi) The median is the point about which the sum of squared deviations is 
minim. 
(xvii) In construction of the frequency distribution, the selection of the class 
interval is arbitrary. 
(xviii) Usual attendance of B.Sc. class is 35 perday. So for 100 working days 
total attendance is 3,500. 
(xix) A car travels 100 miles at a speed of 40 m.p.b. and another 400 miles at 
a speed of 30 m.p.h. So the aversee speed for the whole journey is either 35 m.p.b. 
or 33 10.p.h. 
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(xx) In Calculating the mean for grouped data, the assumption is made that 
the mean of the items in each class is equal to the mid-value of the class. 
(xxi) The geometnc mean ofa group of numbers is less than the arithmetic 
mean in all cases, except in the special case in which the numbers are all the same. 
(xxii) The geometric mean equals the antilog of the arithmetic mean of the 
jogs of the values. 
(xxiii) The median may be considered more typical than the mean because the 
median is not affected by the size of the extremes. 
(xxiv) The Harmonic Mean of a series of fractions is the same as the reciprocal 
of the arithmetic mean of the series. 
(xxv) Ina frequency distribution the true value of mode cannot be calculated 
exactly. ; 
IX. In each of the following cases, explain whether the description applies to 
mean, median or both. 
(i) it can be calculated from a frequency distribution with open-end ciasses. 
(ii) the values of all items are taken into consideration in the calculation. 
(iti) the values of extreme items do not intluence the average. 

(iv) Ina distribution with a single peak and moderate skewness to the nght 
it is closer to the concentration of the distribution. 

Ans. (i) median, (ii) mean, (iii) median, (iv) median, 
X. Be brief in your answer : 

(a) The production in an industrial unit was 10,000 units‘dunng 1981 and 
in 1980 the production was 25,000 units. Hence the production has declined by 
150 percent. Comment.. 

(b) A man travels by a car for 4 days. He travelled for 10 hours each day. 
He drove on the first day at the rate of. 45 km per hour, second day at 40 km. per 
hour, third day at the rate of 38 km. per hour and the fourth day at the rate of 
37 km. per hour. 

Which average, harmonic mean or arithmetic mean or median will give us his 
average speed? Why? 

(c) It is seen from records that a country does not export more than 5 % of 
its total production. Hence export trade is not vital to the economy of that country. 
Is the conclusion right? 

(d) A survey revealed that the children of engineers, doctors and lawyers 
have high intelligence quotients. It further revealed that the grandfathers of these 


children were also highly intelligent. Hence the inference is that intelligence is 
hereWitary. Do you agree? 
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X5. Do you agree with the following interpretations made on the basis of the 
facts piven. Explain briefly your answer. 

(a) The number of deaths in military in the recent war was 10 out of 1,000 
while the number of deaths in Hyderabad in the same period was 18 per 1,000. 
Hence it is safe to join military service than to live in the city of Hyderabad. 

(b) The examination result in a college Y was 70% in the year 1991. In the 
same vear and at the same examination only 500 out of 750 students were successful 
in college Y. Hence the teaching standard in college X was better. 

(c) The average daily production in a small-scale factory in January 1991 
was 4,000 candles and 3,800 candles in February 1981. So the workers were more 
efficient in January. 

(d) The increase in the price of a commodity was 25%. Then the price dec- 

sreased by 20% and again increased by 10%. So the resultant increase in the price 
was 25—20 + 10 = 15 % 

(e) The rate of tomato in the first week of January was 2 kg. for a rupee and 
in the 2nd week was 4 kg. for a rupee. Hence the average price of tomato is 
+ (2+ 4) =3 kg. fora rupee. 

XU. (a) The mean‘mark of 100 students was given to be 40. It was found 
later that a mark 53 was read as 83. What is the corrected mean mark? 

(6; The mean salary paid to 1,000 employees of an establishment was 
found to be Rs. 108-40. Later on, after disbursement of salary it was discovered 
that the salary of two employees was wrongly entered as Rs. 297 and Rs. 165. Their 
correct salaries were Rs. 197 and Rs. 185. Find the correct arithmetic mean. 

(c) Twelve persons gambled ona certain night. Seven of them lost at an 
average rate of Rs. 10-50 while the remaining five gained al an average of 
Rs. 13-00. Is the information given above correct? If not, why? 


CHAPTER THREE 
Measures of Dispersion, 
Skewness and Kurtosis 


3-1. Dispersion. Averages or the measures of central tendency give us an 
idea of the concentration of the observations abou the central part of the dis- 
tribution. If we know the average alone we cannot form a complete idea about 
the distribution as will be cear from the following example. 

Consider the series (1) 7, 8, 10, 11, (ii) 3, 6, 9, 12, 15, (iti) 1, 5, 9, 13, 17. 
In all these cases we see that n, the number of sbecratiens | is 5 and the mean 
is 9. If we are given that the mean of 5 observations is 9, we cannot form an 
idea as to whether it is the average of first series Or Second Series or third scries 
or of any other serics of 5 observations whose sum is 45. Thus we see that the 
measures of central tendency are inadequate to give us a complete idea of the 
distribution. Thev anust be supported and supplemented by some other measures. 
One such measure is Dispersion. 

Literal meaning of dispersion is ‘scatteredness’. We study dispersion to 
bave an idca about the homogeneity or heterogeneity of the distribution. In the 
above case we sav that senes (1) is more homogeneous (less dispersed) than the 
serics (tt) or (111) or We sa v that series (111) is more heterogeneous (more scattered) 
than the series (i) or (11). 

3-2. Characteristics for an Ideal Measure of Dispersion.. The desiderata 
for an idea) measure of dispersion are the same as those for an ideal measure 
of central tendency, viz, 

(1) It should he rigidly defined. 
(tt) It should be easv to calculate and easy to understand. 
(iti) 1t should be based on all the observations. 
(iv) It should be amenable to further mathematical treaiment. 
(v) It should be affected as little as possible by fluctuations of sampling. 

3-3. Measures of Dispersion. The following are the measures of dis- 
persion: 

(1) Range, 

(ii) Quartile deviation or Semi-interquartile range, 
(tit) Mean deviation, and 
(iv) Standard deviation. 


3-4. Range. The range is the difference between two extreme obsertations, 
of the distribution. If A and B are the grcatest and smallest observations respec- 
Gvely in a distnbution, then -its range is A ~ B. 

Range is the simplest but a crude measure of dispersion. Since it is based 
On {wo extreme observations which themselves are subject to chance fluctuations, 
il is not at all a reliable measure of: dispersion. 


35. Quartile Deviation. Quartile deviation or semi-interquartile range 
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Q is given by 
Q=1(Q:-)), BV) 
where Q, and Q; are the first and third quartiles of the distribution respectively, 


Quartile deviation is definitely a better measure than the range as it makes 
use of 50% of the data. But since it ignores the other 50% of the data, it cannot 
be regarded as a reliable measure. 


3-6. Mean Deviation. If x,|f,,7=1, 2, ..., 2 is the frequency distribution, 
then mean deviation from the average A, (usually mean, median or mode), is 
given by 


Mean deviation = 7 x fijxi-A|, Dff=N .(3°2) 


where \x;-A | represents the modulus or the absolute value of the deviation 
(x; - A), when the —-ive sign is ignored. 


Since mean deviation is based on all the observations. it is a better measure 
of dispersion than range or quartile deviation. But the step of ignoring the signs 
of the deviations (x, - A) creates artificialitv and rendcrs it useless for further 
mathematical treatment. 


It mav be pointed out here that mean deviation is least when taken from 
median. (The proof is given for continuous variable in Chapter 5) 

3-7. Standard Deviation and Root Mean Square Deviation. Standard 
deviation, usually denoted by the Greek letter small sigma (0), is the 
positive square root of the arithmetic mean of the squares of the deviations 
of the given values from théit arithmetic niean. For the frequency distribution 


wilfi,t= I..2; een MI, 
o -\V/~ rf Qi - xX) ...(3°3) 
Ni; 


where xv is the arithmetic mean of the distribution and 2 f, = N. 
t 
The step of squaring the deviations (x;~ x) overcomes the drawback of 
ignoring the signs in mean deviation. Standard deviation is also suitable for 
further mathematical treatment (§ 3-7-3). Moreover of all the measures, standard 
deviation is affected least by fluctuations of sampling. 


Thus we sce that standard deviation satisfies almost all the properties laid 
down for an ideal measure of dispersion except for the general nature of extracting 
the square root which is not readily comprehensible for a non-mathematical 
person. It mav also be pointed out that standard deviation gives greater weight 
10 extreme values and as such bas not found favour with economists or 
businessmen who are more interested in the results of the modal class. Taking 
into consideration the pros and cons and also the wide applications of standard 
deviation in statistical theory, we mav regard standard deviation as the best and 
the most powerful measure of dispersion! 


The square of standard deviation is called the variance and is given by 
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°% 
) 


gee Thi -F/ (3-34) 
N i 
Root mean square deviation, denoted by ‘s’ is given by 


sq Fifi Ay G4) 


where A is any arbitrary number. s* is called mean square.deviation. 
3-7-1. Relation between o and s. By definition, we have 


21 te AY et Thee Tae Ze AY 
s = Xfi Qi A) = Hy XSi x+x-A) 


| ale 


E fil (-¥P + (F- AP +2(F-A) Gi -% | 


wt yp (;-3 + (F-AP + Bh 42(F-AVEA =D. 
N ; N ; i 


(F-A), being constant is taken outside, the summation sign. But 2 f; (x; - x) = 0, 
t 


being the algebraic sum of the deviations of the given values from their mean. Thus 
s-xo'+(x-A) =o +d°, whered=x-A’ 

Obviously s~ will be least when d = 0, /.e., x = A. Hence mean square dévia- 

tion and consequently root mean square deviation is least when the deviations 


arc taken from A =x, é.e., Standard deviation is the least value of root mean 
square deviation. 


The same result could be obtained alternatively as follows: 
Mean square deviation is given by 


2e dt sry Ay 
soE Hy af (a; ~ A) 
lt has been shown in § 2:5-1 Property 2 that Z fii- A)’ is minimum when 


A=x. Thus mean square deviation is minimum when A= X and its minimum 
value is 


(s?) min = 4 EfiGi-X Yeo" 


Hence variance ts the minimum value of mean square deviation or standard 
deviation is the minimum value of root mean square deviation. 


37-2. Different Formulae For Calculating Variance. Bv definition, 
we have 
o* = 


LfiQi-x) 


e e e 2 es e a 
More precisely we write if as Oy , Le, variance of x. Thus 
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2 t a 2 
Ox =F = fiQi-x) .-.(3-5) 


If x is not a whole number but cones out to be in fractions, the calculation 
of a,” by using (3-5) is very cumbersome and time consuming. In order to 
overcome this difficulty, we shall develop different forms of the formula (3:5) 
which reduce the arithmetic to a great extent and are very useful for com- 
putational work. In the following sequence the summation is extended over j 
from 1 to nv. 


ae } ees eee <7 oe 
Ox =H Zh r= ay TAGE +x -2x;x) 
Sha a Spe tee Ss 
N r N 1 N. ; 
=H Lfixz +X oe. “5 efi xf-X ...(3°6) 
=>- gee r fix; - -(¥ E fixi| ...(3-6a) 
N jj 


If the values of-x and f are large the calculation of fi, fir is quite tedious. 
In that case we take the deviations from anv arbitrary point. ‘A’. Generally the 
point in the middle of the distribution is much convenient though the formula 
is true in general. We have 


2_1 ay eer a 0 wen TY 
Ox =F % fir x= ay Zh Gs A+A-xX) 


= Efidi+A-¥), where die xi-A. 
os E fide t(A-¥)?+2(A-¥) di] 
== Efid?+(A-¥)+2@-¥).— ni Efids 


We know that if dj=x;-A then x=A +E fidi 
$ 


2-1 5,,72,(_1ys-4) .9f{_lys-g)\(1sey. 
oF = Efi? +| ne fidi| +2| ye || ay Ea 


= Efidi ae Eha| 7) 


= Ox = od {On comparison with (3-6a)| 
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Hence variance and consequently standard deviation is independent of change 


of origin. 
If we take dj = (4j-A)/h so that (4; - A) =/id;, then 


og =t Sf (x,-¥P =s Tfiji-A+A-xX) 
N : Nos 
lL Sfh(id4A—vy 
as Xfi (hdi+A x) 
ohh S fide +(A-¥) +2(A-X) ho Sfidi 
i 4 * 4 
E fid; 
N 


Using x=Ath , we get 


1 ypd?- (3 E fi |" 
NX A 


which shows that variance is not independent of change of scale. 


-A 


Aliter. If a; = a then 


=A+thlid; and YeAth.x Lfidj=Arthd 


Obviously x=h(di—d) 
Ox == Lfai-x) ah? Lfidi-dy =h" og 


=h* Ou : ...(3°8) 


Ge = h 7 


Hence variance is independent of change of origin but not of scale. 
Example 3-1. Calculate the mean and standard deviation for the following 
table giving the age distribution of 542 members. 
Age inyears: 20—30 30—40 40—50 50—60 60—70 70—80 80—90- 
No. of members: 3 61 132 153 140 31 2 
—4 


Solution. Here we take d= 


20 — 30 -9 27 
30 — 40 —122 244 
40 — 50 -132 132 
350 — 60 0 0 

60 — 70 140 140 
70 — 80 102 204 
80 — 90) 6 18 


Lfd = -15| Sfd* = 765 


pee) = 55 - 0-28 = 54-72 vears. 


-(5 2} 7 = 100 | —_ (0-28)*| 


3-6 Fu ndamentals of Mathematical Statistics 


= 100 x 1-333 = 133-3 
oO (standard deviation) = 11-55 vears 
Example 3-2. Prove that for any discrete distribution standard deviation 
is not less than mean deviation from mean. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989) 
Solution. Let x;|f;, i= 1, 2,3, ..., be any discrete distribution. Then we 
have to prove that 
S.D. ¢ Mean deviation from mean 
=> (S.D.)° ¢ (Mean deviation from mean)* 
=> (S.D.)* = (M. D. from mean)* 
n n 2 
=> NE fei-FY 2 [fF fils Fl) 
If we put | x;-x| =z;, then we have to prove that 


i: 2 2 1 2 ) 2 
ae 7 > ee » ye 
Nae? i Ef | 


1 n : 12 2 
Lé., Neo (ay Fh] 2 0 


‘on Se 

Le, N z fili-Zy = 0 
t=] 

Lé., o? 2 0, 


which is always true. Hence the result. 


Example 3:3. Find the mean deviation from the mean and standard deviation 
of A.P. a,a+d,a+ 2d, ...,a + 2nd and verify that the latter is greater than the 
former. [Delhi Univ. B.Sc. (Stat. Hons.), 1990} 

Solution. We know that the mean of a series in A.P. is the mean of its 
first and last term. Hence the mean of the given series is 


v¥=+(a+a+42nd)=a+nd 


a+(n - 2)d 2d * d? 
a+(n—1)d d 17.d? 
a+nd 0 0 
at+(n+1)d d 1? .d? 


at+(n+2)d ad 2d? 
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Meas 


at+(2n~2)d 
at+(2n~-2)d 
a+2nd 


Mean deviation from mean = == |x-x| 
2n+1 
1 : 
Fra est 2t3t... tn) 
_n(ntt)d 
~ (2n +1) 
2. 1 ogg, 1 9 92 424 52, 2 2 
oO aE a x) map e4 (1° + 2°+3°4...4+7°) 
__ 1 54? n(n+1)(QQn+1)_n(n+1)d° 
~ 2n+1 6 7 3 


Hence standard deviation 


n(n+1) 
a ae 


Verification. 
S.D.> M.D. from mean 
if (S.D.)* >(M.D. from mean) 
2 Ree: 
ie, if nee . (Aes) 
or if (2n +1)? > 3n(n+1) 
or if n+n+1>0 
or-if (n+4)+2 > 0 


which is always ture. 
Example 3-4. Show that in a discrete series if deviations are small 


compared with mean M so that (x/M)° and higher powers of (x/M) are 
neglected, we have 


we . D) 4 see : o° 
()M-~G-=o-, and (ui) H= M La | 
M 
where M is the arithemetic mean, G, the grometric mean, H, the harmonic mean 


and G ts the standard deviation of the distribution. 


Solution. Let X;|f;,1=1,2,..,2 be the given frequency distribution. 
Then we are given that x;=X;-M, i.e.,X;=x;+M where M is the mean of 
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the distribution. We have 
x fi xi =X fi (Xi - M) = 0, (1) 
ri t 


being the algebraic sum of the deviations of the given vaJues from their 
mean. Also 


fixe = & fi (X; - My’ = 0° ...(2) 


(i) By definition, we have 
G=(XF XFL. XA , where N = Ef; 


log Get E filog X;= = 2 fi log Qi + M) 
t t 


M(veelll 


log M + tog 1+ 57] 


--M 


ff log | M 


7 


~logM +5 3 = fi sel 


--M 


‘, 1 Xj ? 
= log M += Ef a F543 (7 ce V5 


the expansion of log (1+4 + . | in ascending powers of (x;/M) being valid since 
|x;/ M|<1. Neglecting (x; hd and. higher powers of (x;/M), we get 
log G = log M + —— ni 2 2 fix; - 55 zi E fixi 


2 


= log M - iE {On using (1) and (2)} 
= log( M07 97™ 
2 : 2 
=> G=Me"°’™ =M 1-— 


? 


neglecting higher powers. 


2 
Hence G=M 1-4. =. ...(3) 
re oe 
(it) Squaring both sides in (3), we get 
2 2 
ae Lr: ae 
2 M- | M? |} 
ACEICUNE (o/M)". 
M?-G? (4) 


(iii) By definition, harmonic mean ” H is given bv 
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1 1 1 ; 
Hy TWX) = ay EV + MY 
-1 
1 fit | Xi 
MN 7 [+ (7M MN 7"\1* 


-1 
<1, the expansion of [1 4 in ascending powers of 


¥; 


Since 


(x;/M) is valid. Neglecting (x;/MY° and higher powers of (x;/M), we get 


1 1 Xi xi 
ae sql 
“ual w 2 — Ta Rhxit say thx 
— 1 + O° [On using (1) and (2 
M\| MM? g )I 
=1 
2 2 
Hom [14S M1 : 
M? 2 
higher powers being neglected.. 
? \ 
se 
Hence H=M|1-— fe 
| M? (5) 


Example 3-5. For a group of 200 candidates, the mean and standard 
deviation of scores were found to be 40 and 15 ;espectively. Later on it was 
discovered that the scores 43 and 35 were misread as 34 and 53 respectively. 

Find the corrected mean and standard deviation corresponding to the corrected 
figures. 

Solution. Let x be the given variable. We are given n = 200, x = 40 and o = 15 


Now y= Tx => ZT rxaz=nx=200x 40 = 8000 
t 
Also o 
Lx7=n(o7 +x ) = 200 (225 + 1600) = 365000 
t 
Corrected 2 x; = 8000 - 34 - 53 + 43 + 35 = 7991 


and Corrected = x;7 = 365000 ~ (34)° - (53)° + (43)° + (35)° = 364109 
f 


7991 a 
Hence, Corrected mean = 00 = 39-955 
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364109 
200 
Corrected standard deviation = 14-97 


Corrected o? = ~ (39-955)? = 1820-54 - 1596-4U.= 224-14 


3-7:3. Theorem. (Variance of the combined series). If n,n. are the 
sizes; X; ,X_ the means, and 0, , 02 the standard deviations of two series, then 
the standard deviation o of the combined series of size n, + nz ts given by 


2 1 > ae ey ee: 
Oo = n (Or + dy°)'+.n2 (07 +d | --.(3°9 
ag Lior +d?)4m (or + 2) (9) 
where d,=xX,-x, d,=X2-X : 
ae nXi + nz Xx2 : ‘; 2 
and x =—————.,, is the mean of the combined series. 
ny+n2 
Proof. Let x17; ¢=1,2,..,21 and x2;; J =1,2,...,22, be the two series 
then 
, ny na 
= 2 seve een 
Xie— ZT x1; OP=— £ (1-1) 
1 Nz eee ; Nr ere ) 
2=— Tx; O72 =— 2 (x2; - 2) 
M2 501 2 jel 
The mean x of the combined series is given by 
2 1 le 2 ny x, +2 X2 
Y= 2X + FZ 2X2; = [From (*)] 
ny, + n2 jad j=l ny + N2 
The variance o7 of the combined series is given by 
2. 1 . be 2 . 
O* = 2 (vi-x) + F K2j-xX) .»(3°10) 
EN el pel 
a I= 
Now 
nay ay 
=\2 = - =, 
ZQyj-xX) = fF Qyi-Xy441-X) 
t=] i=l 
ay sly 
om. a _ —_ —_ _ 
=2 (xj;-x1)° +11 (41 -xX y +2(x1-¥) 2 (1; -%1).3-10a) 
t= l i=l 


n 
But 5 (x1; -¥1) = 9, being the algebraic sum of the deviations the values 
of first oe from their mean. Hence from (3-10a), on using (*#*), we get 
n 
z (nj-¥) emo +m (x) -¥) =mortmdr (3°10) 
meer =xX)-2X. 
Similarly, we get 
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nz 
E(aj-x)= 5 (x2j-X2+%2-X)° 


je 1 jel 
n2 = zs = 5 
= 2 (x2j- x2)? +n2(x2-X y = n7 02" + nz dz .--(3°10c) 
jo 5 
where dz =X2- 


Substituting from 6 10b) and (3-10c) in (3-10), we get the required 
fornwla 


a d 
ie | m1 (or + dy’) + nz (a7 + ?)| 
This formula: can be simplified still further. We bave 


an = my xX) +12X2_ _ 2 (41 - x2) 


d, =X, - = 
ny +n ny) +n? 
gees) «ike - hes mxi_tn2Xx2 ny x2-X] 
d>= 2-X=X _ mix + 2x2 _ m1 (42-%1) 
ny +n2 ny+n2 
Hence 
2/;= — \2 2; — \2 
2 1 2 x fmny (x1-242)" mn (42-41) 
Oo =- nO +202" +) ¢ 
ny+nr (m, + 12)" (my + n2)° 
2 D) nh, n2 a 
= ny Oy +207 + (x, - x2) (3°11) 
ny, +n? ny) +n2 


Remark. The formula (3-9) can be easily generalised to the case of more 
than two series. If ;, x;and 0; ,1=1,2,...,k are the sizes, means and standard 
deviations respectively of k-coniponent series then the standard deviation o of 

k 


the combined series of size 2 nj _ is given by 


tel 
2 1 | 7 , ; : , 
= —_—_——— | ni (Or + d)") +2 (02° + do’) +... + mk (GK +. 
nytna+... tM 1 (Or + dy") + 12 (02 + dy’) «(OK + dk) 
, 1..(3°12) 
where dj=xj-x ; i=1,2,..,k 


My Ny + NIXI+ 2. HMEXE 
Ay t Nat 1... Mk | 
Example 3-6. The first of the tvo samples has 100 ttems with mean 15 
and standard deviation 3. If the whole group has 250 items with mean 15-6 
and standard deviation Vv 13-44, find the standard deviation of the second group. 
Solution. Here we are given 
= 100, ¥; =15 and 0) =3 
n=n,+n2=250, x = 15-6, and o =V13-44 


and X= 


We want ox. 
Obviously n2= 250 + 100 = 150. We have 
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ze my X) + 2X2 ay Sees 100 x 15 + 150 x x2 
nytn2 250 
=> 150 x2 =250 x 15-6 ~ 1500 = 2400 
— 2400 
oe 150 _ 150 7*° 


Hence dd) =x,;-x =15~-156=-0°6 
and d2=x2- eee cece 
The variance o° of the combined group is given by the formula : 
(1 + n2)07.= n (o: + d;’) + N2 (a? + d>’) 
=> 250x 13-44 = 100 (9 + 0-36) + 150 (a2? + 0°16) 
150 G2? = 250 x 13-44 - 100 x 9-36 - 150 x 0-16 
= 3360 - 936 - 24 = 2400 


>» 2400 
s Or = 150 * 16 
Hence o2=V16 =4 


3-8. Co-efficient of Dispersion. Whenever we want to compare the 
variabilitv of the iwo scries which differ widely in their averages or which are 
measured in different units, we do not merely calculate the measures of dispersion 
but we calculate the co-efficients of dispersion which are pure numbers inde- 
pendent of the units of measurement. The co-efficients of dispersion (C.D.) 
based on different measures of eres are as follows : 

1. C.D. based upon range =4 ce A~. where A and B are the greatest and 
the smallest items in the series. 


2. Based upon quartile deviation : 
Cp, = 2-1/2 _ 3-0 
(Q3+01)/2 Q3+Qi 


3. Based upon mean deviation : 


- C.D. = 


Mean deviation 
Average from which it is calculated 
4. Based‘upon standard deviation : 
S.D. 6 


Mean x 
3-8-1. Co-efficient of Variation. 100: times the co-efficient of desperison 
based upon standard deviation is called co-cffcient of variation (C.V.), 


C.V. = 100 x= / (3-13) 


C.D. = 


ccording to Professor Karl Pearson who suggested this measure, C.V. is te 

percentage variation in the mean, standard deviation being considered as the total 
variation in the mean. 

For comparing the variability of two series, we calculate the co-ctficient 

of variations for cach series. The series having greater C.V. is said to be 

more variable than ihe other and the scrics having lesser C.V. is said to be 
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more consistent (or homogenous) than the other. 
Example 3-7. An analysis of ‘monthly wages paid to the workers of two 
firms A and B belonging to the same industry gives the following results : 


‘ . Firm A Firm B 
Number of workers 50C 600: 
Average maonilily wage Rs. 186-00 Rs. {25:00 
Variance of distribution of wages 8} 100 


(i) Which firm, A or B, has a larger wage bill ? 
= ( 

Per ow ° oo * & 7 yo «ge . . eo e 
(ii) In which firm, A or B, ts there greater yartability in individual wages? 
(iit) Galculate (a) the average monthly wage, and (b) the variance of the 

distribution of wages, of all the workers in the firms Acand B taken together. 

Solution. 

(i) Firma: 

No. af wage-cafner (sav) = 500 

Average monthly wages (sav).v) = Rs.186 

Total wages paid 

Avcraze monthly wage = 7 

. = No. al workers 

Hence total wages paid to the workers = mt, = 500 x 186 = Rs. 93.090 

Firm B 

No. of wage-carners (sav)nz = 600 

Averaye monthly wages (Sav) ¥:= Rs.175 

Total wages paid to the workers = diz.¥- = 600 x 175 = Rs. 1,05,000 

Thus we see that the firm B has larger wave bill. 

(ii) Variance of distnbution of wages in firm A (Sav) o,” = 81 

Vaniancce of distribution of wages in firm B (say) Gx = 100 


ahs ? : Q x: 
C.V. of distribution of wages for firm A = 100 x = are = 4:84 
4} v 
100 x 10 


nn ie 

Since C.V. for firm B is greater than C.V. for firm A, firm B has greaier 
Variability in individual wages. 

(it) (a) The average monthly wages (sav) x, of all the workers ii the 
wo firms A and B taken together is given by 

=. mx, + mx> — 500 x 186+ 600x175 — 198000 | 


sage fees AS ey O2 
C.V. of distribution of wages for firm B = 100 « = = 


= —~— = =R_ [80 
ny + ns 500 + 600 L100 
(65) The combined varianee o is given by the formulas 
. | 7 “ . ae 
SS [ m1 Cor + ay )+ Wr (OS + de \ 
Nyt: ne 
Wheied, = ¥,- Y and d:=)- ¥ 


Here d; = 186 - 180 = 6 and d>= 175 - 180 =- 5 
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> 500(81+ 36)+600(100+ 25) 133500. 
ee 500 + 600 aig, 
EXERCISE 3 (a) 


1. (a) Explain with suitable examples the term ‘dispersion? State the 
relative and absolute measures of dispersion and describe the merits and demerits 
of standard deviation. 

(b) -Explain the main difference between mean deviation and standard devia- 
tion. Show that standard deviation is independent of change of origin and scale, 
“(c) Distinguish between absolute and retative measurés of dispersion. 

2. (a) Explain the graphical method of obtaining median and quartile 


deviation. (Calicut Univ.B.Sc, .April 1989) 
(6) Compute quartile deviation graphically for the following data : 
Marks : 20-30 30-40 40-50 50-60 60-70 70 &over 

Number of 
students : 5 20 14 10 8 5 


3. (a) Show that for raw data mean deviation is minimum when measured 
from the median. 


(b) Compute a suitable measure of dispersion for the following grouped 
frequency distnbution giving reasons : 


Classes Frequency 
Less than 20 30 
20 - 30 20 
30 - 40 15 
46 - 50 10 
50 — 60 5 
(c) Age distribution of hundred life insurance policyholders is as follows: 
Age as on nearest birthday Number 
17 - 19-5 9 
20 -— 25-5 16 
26 — 35:5 12 
36 — 40-5 26 
41 -— 50-5 14 
531 - 55:5 12 
56 — 60:5 6 
61 — 70:5 5 


Calculate mean deviation from median age. 

Ans. Median = 38-25, M.D.=10-605 

4. Prove that the mean deviation about the mean x of the variate x, the 
frequency of whose ith size x; is f; is given bv 
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lean 


X¥1< & XS £ 


Hint. Mean deviation about mean 


*[e E fi(E- x) + ‘aba 


Xj >X 


“5 (<2 fi(xi- x) + & ern 


Xji< Xj >X 


Since (2fi(xi- x) = 0, 
Lfi(xi- ¥) + 2 fi(xi- x)= =( 


x;> ¥ x;Q X¥ 


MD-= 5 (22 ee Ne x) - ial oe | Oa sil 


5. What is standard deviation ? Explain its superiority over other measures 
of dispersion. 
6. Calculate the mean and standard deviation of the following distribution: 
x: 25—7.5 7.5 —12.5 12.5 —17.5 17.5 —22.5 


Xj<X 


f: 12 28 65 121 

x: 22.5 —27.5 27.5 — 32.5 32.5 — 37.5 37.5 — 42.5 425 — 47.5 
f: 175 198 176 120 66 

x: 47.5 — 52.5 52.5 —57.5 57.5 — 62.5 

f: 27 9 3 


Ans. Mean = 30.005, Standard Deviation = 0.01 


7. Explain clearly the ideas implied in using arbitrary working orgin, and 
scafe for the calculation of the arithmetic mean and standard deviation of a 
frequency distribution. The values of the arithmetic mean and standard deviation 
of the following frequency distribution of a continuous variable derived from 
the analysis in the above manner are40.604 Ib. and 7.92 Ib. respectively. 

x:-3 2 -1 0 1 2 3 4 Total 

f: 3 15 45 57. 0 —50—— ss 6 Oo DHsCD”D 240 

Determine the actual class intervals. 


8. (a). The arithmetic mean and variance of a set of 10 figures are known 
to be 17 and 33 respectively. Of the 10 figures, one figure (i.e.,26) was 
subsequently found inacurate, and was weeded out. What is the resulting (a) - 
anthmetic mean and (b) standard deviation. (M.S. Baroda U. BSc. 1993) 


(6) The mean and standard deviation of 20 items is found to be 10 and 
2 respectively. At-the time of checking it was found that one item 8 was , 
incorrect. Calculate the mean and standard deviation if 
(i) the wrong item is omitted, and 
(W) it is replaced by 12. 
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(c) For a frequency distribution of marks in Statistics of 200 candidates 
(grouped in intervals 0-5, 5~10,..., etc.), the mean‘and standard deviation were 
found to be 40 and 15 respectively. Later it was discovered that the score 43 
was misread as 53 in obtaining the frequency distribution. Find the corrected 
meanand standard deviation corresponding to the corrected frequency distribution, 


Ans. Medan = 39-95, S.D. = 14:974. 


9. (a) Complete a table showifg the frequencies with which words of 
different numbers of letters occur in the extract reproduced below (omitting 
punctuation marks) treating as the variable the number of letters in each word, 
and obtain the mean, median and co-efficient of variation of the distribution : 


"Her eves were blue : blue as autumn distance-blue as the blue we see, 
between the retreating mouldings of bills and woqdy slopes ona sunny September 
morning : a misty and shady blue, that had no beginning or surface, and was 
looked into rather than at. 

Ans. Mean =:435,- Median = 4, 0 =2:23 and C.V. = 51-26 

(6) Treating the number of letters in each word in the following passage 
as the variable x, prepare the: frequency distribution table and obtaif its mean, 
median, mode and variance. 

"The reliability of data must always be exainined before any attempt is 
made to base conclusions upon them. This is true of all data, but particularly 
so of nuinerical data,,.which do not carry their quality written large on them. It 
is a waste of time to apply the refiited theoretical methods of Statistics to data 
which are suspect from the, beginning. " 

Ans. Mean = 4-565, Median = 4, Mode = 3, S.D. = 2-673. 

10: The mean of 5 observations is 4-4 and variance is 8-24. It three of 
the five observation are 1, 2 and 6, find the other two. 

11. (a) Scores of two golfers for 24 rounds were as follows : 


Golfer A : 74, 75, 78,72, 77; 79, 78, 81, 76; 72; 72, 77, 74, 0; 78, 79,'80, 81, 
74, 80, 75, /1,/3. 
Golfer B : 86, 84, 80, 88, 89, 85, 86, 82, 82, 79,.86. 8v, 92, 76, 86, 89; 87, 83, 
80, 88, 86, 81, 81, 87 
Find which golfer may be considered to be a more consistent player ? 
Ans.. Golfer B is more.consistent player. 
(6b). The sum and ‘sum of squares corresponding to length X (in cms.) wae 
weight Y (in gms.) of 50 tapioca tubers are given below : 
D¥= 212, ZX’ = 9028 
LY= 261, Y= 1457-6 
Which is more varying, the length or weight. 
12. (a) Lives of two models of tefrigerators turned in for new nrodels in 
a recent survey are 
' Life 
(No. of years) Model A Model. B. 
0-2 5 2 
2-4 ~ 16 7 
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4-6 13 12 
6-8 7 19 
8 - 10 5 a 
1Q — 12 4 1 


What is the average life of each model of these: refrigerators ? Which model 
shows more uniformity ? 
Ans: C.V. (Model A)=54-9%, C.V. (Model B)=3-62% 


(b) Goals scored’ by two teams A and B in a football season were as 
follows : 


No. of goals scored No. of matches 
ina match A B 
0 27. 17 
| 9 9 
2 8 6 
3 5 5 
4 4 3 


(Sri Venketeswara.U. B.Sc. Sept. 1992) 
Find out which team is more consistent. 
Ans. TeamA: C.V.= 122-0, Team B: G.V.= 108-3. 


(c) An analysis ‘of thonthly wages paid to thé workers in two firms, A 
and B belonging to the same ‘industry, gave the following results : 
Firm A Firm B 


No. of wage-earners , 986 548- 
Average monthly wages, Rs. 52-5 Rs,, 47-5 
Variance of distribution of wages 100 121 


(i) Which firm, A of B, pays out larger amount as monthl¥ wages ? 
(ii) In which firm A or B, is there greater variability in individua] wages? 


(iii) “What are the measures of average monthly wages and the variability 
in individual wages, Of dll tlie workers ‘in the two firms, A and B taken together. 


Ans. (i) Firm B pays a larger amount as monthly wages. 
ii) There is greater variability im individual wages in firm B. 
(iii) Combined arithmetic’ mean = Rs.49-87. 
Combined standard deviation = Rs.10-82. 


14. (a) The following data give the arithmetic averages and standard devia-: 
tions of three,sub-groups. Calculate the .arithmetic average and standard devia- 
tion of the whole group. 


Sub-gruup ‘No.ofmen Average wages (Rs.) Standard 
P deviation (Rs) 
A 50 61-0 8:0 
B 100 70-0 9:0 
C 120 80-5 10:0 


Ans. Combined Mean = 73, Combined $.D.=11-9- 
(6) Find the missing information from the following data : 
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| Group 1 | Group Ht | Group Ut 


Number 50 i 90 200 
Standard Deviation 6 7 9 7:746 
Mean 113 - ? 115 116 


Ans. m= 60, x2 = 120 and 63= 8 


15. A collar manufacturer is considering the production of a new style 
collar to attract young men. The following statistics of neck circumference are 
avail based on the measurement of a typical group of students : 

Mid-value : 12-5130 13:5 140 145 15:0 15:5 160 
in inches 
No. of students : 4 19 30 63 66 £=29 18 l 

Compute the mean and standard deviation and use the criterion x+ obtain 
the largest and smallest size of collar he should make in order to meet needs 
of practically all his customers bearing in mind that the collars are worn on 
average 3/4 inch larger than neck size. (Nagpur Univ. B.Sc., 1992) 

Ans. Mean = 14-232, $.D.=0-72, largest size = 17-14", smallest size = 12-83" 

16. (a) A frequency distribution is divided into two parts: The mean and 
standard deviation of the first part are mi and si and those of the second part 
axe m2 and s2 respectively. Obtain the mean and standazd deviation for the 
combined distribution. {Delhi Univ. B.Sc.(Stat.Hons.), 1986] 

(b) The means of two samples of size 50 and 100 respectively are 54-1 
and 50-3 and the standard deviations are 8 and 7. Obtain the mean and standard 
deviation of the sample of size 150 obtained by combining the two samples. 

Ans. , Combined mean = 51-57. Combined S.D: = 7:5 approx. 

(c) A distribution consists of three components with frequencies 200, 250 
and 300 having means 25, 10 and 15 and standard deviations 3, 4 and 5 respectively. 


Show that the mean of the combined group is 16 and its standard deviation 
7.2 approximately. (Bangalore Univ. B.Sc. 1992) 


17. In a certain test for which the pass marks is 30, the distribution of 


His of passing candidates classified by sex (boys and girls) were as given 
elow : 
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The overall means and standard deviation of marks for bovs including the 
30 failed were 38 and 10. The corresponding figures for girls including the 10 
failed were 35 and 9. 

(i) Find the mean and standard deviation of marks obtained by the 30 
boys who failed in the test. 

(ii) The moderation committee argued that percentage of passes among 
girls is higher because the girls are very studious and if the intention is to pass 
those who are really intelligent, a higher pass marks-should be used for' girls. 
Wiothout quetioning the propriety of this argument, suggest. what the pass mark 
should be which would allow only 70% of the girls to pass. 

(iii) The prize committee decided to award prizes to the best 40 candidates 
(irrespective of sex) judged on the basis of marks obtained in the test. Estimate 
the number of girls who would receive prizes. 

Ans. (i) X = 22:83, 02 = 8-27 (ii) 39 (iii) 15 

18. Find the mean and variance of first n-natural numbers. 

(Agra Univ. B.Sc., 1993) 


—- ne 
Ans. x = 7 2 


19. In a frequency distribution, the n intervals are 0 to 1, 1 to 2, ..., 
(n—1) to n with equal frequencies. Find the mean deviation and variance. 

20. If the mean and standard deviation of a variable x are m and o respec- 
tively, obtain the mean and standard deviation of (ax + b)/c, where a, b and c 
are constants. 


- l1,- a 
Ans. u = ~ (ax+ 5), Ou= | 


o 
21. Ina series of se we obtain m, values of magnitude 
Xi, m2 values of magnitude x2, and so on. If x is the mean value of all the 
measurements, prove that the standard deviation is 


=m, (k - x,)’ _ 8° 
: Sa 


where x = k+ 5and kis any constant. Delhi Univ. B.Sc. (Stat. Hons.), 1992 


22. (a) Show that in a discrete series if deviations are small compared 


with mean M so that (x/M) and higher powers of (x/M) are neglected, prove 
that 


(i) MH=G (I) M- 2G+H=0, 
where G is geometric mean and H is harmonic mean. 

(b) The mean and standard deviation of a variable x are m and o respectively. 
If the deviations are small compared with the value of the mean, show that 


(i) Mean (vx) = Vm (4 - oe 
\ 8m 


2 


1 307 
(ji) Mean (Fe SOY pa t + aa approximately. 
a, (M.S. Baroda U. BSc. 1993) 
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(c) If the deviation X; = x;- M is very small in comparison with mean 4 
and (X;/‘M) and higher powers of (X;/ MY are neglected,, prove that 


V= 2(M ~ G) 
"  oM 
where G is the geometric mean of the valucg x;, x2, ...,X%, and V is the 
éoefficient of dispersion (6/M). (Lucknow Uniy. B.Sc., 1993) 


23. From: a. Sample-of ‘observations the arithmeiic meah and variance. dre 
calculatcd. It. 1s then found that one of the values, .t,, is in error and should be 
replaced by x,’. Show that thé’adjustment to the Variance to“correct this error is 


er eee Arsat2t | 
aC S Rend e (x Pa ae 
where T is the total of the original results. 
(Meerut Univ. . = 1992; Delhi Univ. B.Sc. (Stat. Hons.), 1989, ios} 
Hint. o=1 5 Y xx? 


Lis} 
1/2. 2 2\ T2 
= A xp +42 4+... +n) - 
n i: 
where T= o +Xq +... $Xy. 


Tet oO; be the corrected variance. Then 


2 
2 I1f2. 2 2 T -¥, +4)" 
O1= 7 Xt X2 +... Xf - —— 
| 


Adjustment to the variance to-correct the crror is : 


6p 26%— { x) - xt 3 (7-21 +x/) - 7} 


Nn 


a4 { tea} (x'— a} ord (av * +')] 


24. Show that, if the variable takes the values 0, 2; ya with 
[requencics proportional to thé binomial coefficients "Co io "Co, ight Ca 
respectively’ then the mean-of the distribution 4s (7/2), the mean square deviation 
abaut, x'=0 18 72 (2.4 1)/4 and. the variance.is 1/4 

[Delhi Univ. B.Sc. (Stat. Hons.), 1991] 

Hint. N= f= "Co + "C), +"Cos oe $C = @| + |)" = 2" 

Lx = 0."Co.+ LC, + 2C, +3. "Gat... HC, 


=n | ec ).4- 


= fi @| + Iy-! =n. Qe-) 
et (a - |)- 
Hence mean (x) = 7 n 2x aoe 5 
The mean square adie s?, (say), about the- point x = = 01 is given by 
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52 walt Nae? =>, [12."C, + 22, "C, + 32."C, + ae + n2"C,] 


= Fn at +2(ni- Hae 5 (n= 1) (9— 2) 4+ 0. +2) 


SaCthace I) + naan oe +1] 


+{(n-—l)4t(n- 1) (n—2) 4. ut G1— 4))) 

rrr G 'Co + n-IC, aa seal OF es ee ~ "-IC,_)) 
+ ((n- 1) 7g + 2 7C, +... +" 7C, 9) 
= on [(+ly~! + (n-1)Cl syhepo +) 


4 
= n(n + 1) neon 
we eat ad. 40 


25. (a) Let r be the range and s be the standard deviation of a set of 
observations Xj, 2, ...,%;,7 then prove by gencral reasoning or othcrivise that 
gsr. 


Hint. Since x;-x <r, i=1,2,...,”, we have 
1 N _ bets: 1 ae 
N Li ( fe ) N afi Cr ) 
| n 
=> SPs Pa Sfort = ssi 
i=] 


(b) Let r be the range and 
| ) 
Xj- Xx 
s= (55 J 5 (- x¥ 
be the standard deviation of a set of observations x}, x, ..., X,, then prove that 


° 


| 
z 
S< {—*] [Punjab Univ. B.Sc (Stat. Hons.), 1993] 


3:9. Moriients. The rth moment-of a Variable x- about any point. x = A, 
usually denoted by p, is givenby —- 


=4 LA@i-AN, LGEN v1) 
= Lhidi — " ... G-14a) 


where d; = x; -A. 


The rth moment of a variable about the mean x, usually denoted by U,1s 
given by 


l - ] ' 
Hes DF Gin x)= 2 fiz ... (3-15) 
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where Zi=xX-X. 
In particular 


1 oft see 
Ho = Hy 2 files - ») n th 1 


and 1 = * z fi(xi- x) = 0, being the algebraic sum of deviations from 
the mean. Also 
1 : 
U2 = y LhiGi- x= 0° .-.(3°16) 
These results, viz.. ¥o= 1, pi = 0, and p= o ; are of fundamental im- 
portance and should be committed to memory. 
We know that if d; = x;- A, then 
Fe At OE fidi = A + wy (3-17) 
3-9-1. Relation between ‘moments about mean in terms of moments 


about any point and vice versa. 
We have 


Up = x % fi i x) = + > fi i At+tA- xy 
= yrhildi+a- x), where dj= x;- A 
Using (3°17), we get 
Uy = yEhidi- y1')’ 
“TEA ld-'Cs Ge" Va" Codi? wi? "Cadi? i? +. (1) an] 


.-(3°18) 
= te — Cyptear’ wr’ + "Co pee’ i? —-...+(-1) pr” | On using (3-14a)] 
In particular, on putting r = 2, 3 and 4 in (3-18), we get 


Yo = po’ = pr? 

M3 = ps’ — 3p2’ pi’ + 21” 3.19) 
Ye = pe’ — 4yis'par’ + Gp’ pr? _ 3p” 

Conversely, 


ee ee re eee 
Mr Nr AG A) F rts el A) 


: 5 2 fi Git u1')! 
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where x; - X = and x =A + MY 
, -2 , , 
Thus pt,’ = get (i 4Cyae py + + Cath by’? +... + p)’7) 


=H, + "Cypy Hi’ + C2 My-2 a)? +... +p)". [From (3-15) 
In particular, putting r= 2, 3 and 4 and noting that 1, = 0, we get 
[o’ = be + Hy? 
Ha = Ms + 3p2 by’ + Hy? .++ (3-20) 
Ha’ = Wy + 4ptgyy” + Oy Wy”? +) 


These formulae enable us to find the moments about any point, once the 
mean and the moments about mean are known. 


3.9.2 Effect of Change of Origin and Scale on Moments. 


=a , so thatx=A+hu,x =A+hu and x -—x=h(u- u) 
Thus, : moment of x pn any pointx = is given by 
ni DSi (x; - A)" =, x DSi (hujy = hr a, DSi uj 


And ae moment of x ns mean is 


br= Wy Ei (- FV = 4 Xf [A(u; -— u))" 


xo—- 
Let “= 


pl. PAY 
=h N Xfi (uj) 


Thus the rth moment of the variable x about mean ts h” times the rth moment 
of the variable u about its mean. 


3-9-3. Sheppard’s Corrections for Moments. In case of grouped 
frequency distribution, while calculating moments we assume that the 
frequencies are concentrated at the middle point of the class intervals. If the 
distribution is symmetrical or slightly symmetrical and the class intervals are 
not greater than one-twentieth of the range, this assumption is very nearly true. 
But since the assumption is not in general true, some error, called the ‘grouping 
error’, creeps into the calculation of the moments. W.F. Sheppard proved that if 


(i) the frequency distribution is continuous, and 
(ii) the frequency tapers off to zero in both directions, 


the effect due to grouping at the mid-point of the intervals can be corrected by 
the following formulae, known as Sheppard’s corrections : 


2 
M2 (corrected) = U2 — - ... (3-21) 


3 (corrected) = [, 


M4 (corrected) = py - Ls [> + a h4 
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where /2 is the width of the class interval. 
3-9-4, Charlier’s Checks. The following identities 
Let = Lfe oN; Df + 1% =LDfr+ 2 fee 
Lx + 1R => fed + 3D fe? + 3D fe +N 
Lx + 1) = 2d fet + 4 fx? + OL fe? + 4 fe + N, 


are often used in checking the accuracy in the calculation of first four moments 
and are known as Charlier’s Checks. 


3°10. Pearson’s B and y Coefficients. Karl Pearson defined the following 
four coefficients, based upon the first four moments about mean : 


M3 , 
Bi=—z. ¥.=+VB; and B)="3, 2 =B2 -3 ... (3-22) 
2 2 ; 
It may be pointed out that these coefficients ‘are pure numbers independent of 


units of measurement. The practical utility of these coefficients is discussed in 
§ 3-13 and § 3-14. 


Remark. Sometiines, another coefficient based on moments, viz, Alpha 
(a) coefficient is used. Alpha coefficients are —— as : 
Lt pa VB, = 
Oo, = oa =0, a Te 1, = oo = Vis oy = 4 = Br 


3-11. an Moments. Factorial moment of order r about the origin 
of the frequency distribution x;!f;, (i= 1, 2, ... n), is defined as 


Hy = yy & fix? ... (3-23) 


n 
where x) = x (x- 1) (x-2)... ¥—r+ lI) andN= 2 fi 
i= 
Thus the factorial ‘moment of order r aboiit any point x =a is given by | 
»_ J wate 
Hwy =H af (x;- a)” .+ (3-24) 


where (x — a) = (x —a) (x-a@—-1)... &-aert 4) 
In particular from (3-23), we have 


Har’ = py Eii= Ur" (about origin) =-Mean (#) 


; ] ] 
Hay = yy LAix= N 2% (x; - 1) 


l 1 ete 
Nn ESR N Ui xi = bo - By 
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Hay = x afi x)= . 2 fii (x; 1) @- 2) 
= LANE -3 yp Df? +25, Doi 
= fy’ — 3a" + 2p)’ 
yay = 7 2 fix® = x 2 fei (x;- 1) Gj - 2) Gj - 3) 
= Dain (x; — 6x72 + 11x;- 6) 
ze‘ 4 a5 3 li 2_¢1+ : 
= N eSixi — 6. NU dSixi + We LSi% —6.5) LAX 


= py’ — Og” + V1pts’ = 6p)" 
Conversely, we will get 


Ly” = Woy 
bo’ = Hay + Hay 
{3° = Way + 3Hay + Hay .+. (3-25) 


Ha’ = Hyg) + Opeay’ + THe2)’ + Hay’ 
3°12. Absolute Moments. For the frequency distribution x; /f; i = 1, 2, ... 
n, the sth absolute moment of the variable about the origin is given by 


Le r 
N= fix | N= Ds ... (3+ 26) 
i=l 
where | Xj | represents the absolute or modulus value of Xj. 


The th absolute moment of the variable about the mean x is given by 
| Hn -_ r 
NLR x | ..- (3-26a) 
is 


Example 3-8. The first four moments of a distribution about the value 4 of 
the variable are — 1-5, 17, — 30 and 108. Find the moments about riean, B, and 


p>. 
lind also the moments about (i) the origin, dnd (ii) the point-x = 2. 
Solution. In the usual notations, we are given A’= 4 and 
uy = —1-5, p>’ = 17, pa’ = — 30 and wy’ = 108. 

Moments about mean : i, = 0 
My =p) —py’? = 17 —(-1-5)* = 17 — 2-25 = 14-75 
Hy = Hy’ — 3p)’ py’ + 2,2 - 

=.— 30 —3 x (17) X (-1-5) +,2 (-1-5)% 

=.— 30 + 76-5 — 6-75 = 39-75 
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M4 = Wa’ — 43’ Wy’ + 6py’ py? — 3,4 
= 108 — 4(-30)— 1-5) + 6(17)(-1-5)* ~ 3(-1-5)4 
= 108 — 180 + 229-5 — 15-1875 = 142-3125 


_) _ 39-75 
Hence B, a (14-75)3 > 0-4924 


= Be (1423125) _ 


Also x =A +p)’ =44 (-1-5)=2'5 
Moments about origin. We have 


x =2-5, Hy = 14-75, pt, = 39-75 and wp, = 142-31 (approx). 


We know x =A + {1)’, where 1,’ is the first moment about the point x =A. 
Taking A = 0, we get the first moment about origin as {1,’= mean = 2-5. 
Using (3-20), we get 
Ilo” = My + py’? = 14-75 + (2-5)? = 14-75 + 6-25 = 21 
plz” = fs + Bylo Hy’ + yis3 = 39-75 + 3(14-75) (2-5) + (2-5)5 
= 39-75 + 110-625 + 15-625 = 166 
Ha’ & Hg + Apa’ + Oops? + yy 
= 142-3125 + 4 (39-75) (2-5) +.6(14-75)(2-5)? + (2-54 
= 142-3125 + 397-5 + 553-125 + 39-0625 
= 1132. 


Moments about the point x = 2. We have x =A + y,’. Taking A = 2, the 
first moment about the point x = 2 is 


uy) =x -2=2:5-2=0-5 
Hence 
Mo’ = py + py? = 14-75 + 0-25 = 15 
3” = [lz + Buoy’ + py = 39-75 + 3(14-75)(0-5) + (0-5)3 
= 39:75 + 22-125 + 0-125 = 62 
Hg’ = Wy + 4pegpty’ + Opopys”? + 1,4 
= 142-3125 + 4(39-75)(0-5) + 6(14-75)(0-5)2 + (0-5)4 
= 142-3125 + 79-5 + 22-125 + 0-0625 
= 244 : 
Example 3-9, Calculate the first four moments of the following 
distribution about the mean and hence find 6, and By. 
x: 0 J 2 3 4 5 6 7 8 
f: ] 8 28 = 56 70 «65606 28-88 ] 
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Solution. CALCULATION OF MOMENTS 


ogee ath 
= <2 fd = 0 and uy’ = +E fd‘ = 


Mone about mean are : 
= 0, H2= po! yi” = 2 
ne us’ — 3p2'pr’ + 2p” = 
Wy = <a 4u3'u1' + 6p2'pr’ — 3p. = 11 


0 


11 
B, = 2 = 0 f= aa 7285 
we 4 
Example 3-10 For a catia the mean is 10, variance is 16, y2 is + 
l and Bo is 4. Obtain the first four moments about the orgin, Le., zero. 
Comment upon the nature of distribution. 


Solution. We are given 

Mean = 10, w2= 16, yi = +1, B1= 4 
First four moments about origin (y1', W2', 43’, pa’) 
ty’ = First moment about origin = Mean = 10 


fo = [> - uy” = Uo’ = [2 + uy” => M2’ = 16 + 10° = 116 


we have =_ => ca 1 
[2 
=> us= 2, = (16)? = 4° = 64 


a 
M3 = M3) — 3y2'py' +- . 
=> 3 = 3+ Buoy — 2u1” 
64+ 3x 116x 10- 2x 1000= 3544~- 2000 = 1544 
Now B2 = A => 4 > Ly = 4x 16° = 1024 
Te 


and = ‘ tog oe 4 
Ma = wey — Aus + Ge pa” - py 
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=> wy = 1024+ 4x 1544x 10-6 116 100 + 3 x 10000 
= 92784 - 69600 = 23184. 

Gomments of Nature of distribution: [c.f. § 3:13 and § 3-14] 
Since ¥; = +1, the distribution is moderately positively skewed, i.e, if we draw 
the curve for the given distribution, it will have longer tail towards the right. 
Further since B»= 4> 3, the distribution is leptokurtic, i.e., it will be more 
peaked thai the normal curye. 

Example 3-1'1. /f for a random variable x, the absolute moment of order 
k exists for ordinary k = 1, 2, ..., n-1, then the following inequalities 

‘ K K és Ye m 
di. Be s Bro Best, (i) Bes Bast * 
holds for k=1, 2, ..., n=, wher: Bx is the ah absolute moment about the origin. 
[Dethi Univ. B.Sc. (Stat.Hons.) 1989] 
Solution. If x;|f;, i=1, 2, ..., is the given frequency distrbution: then 


t_ouié 
Bee tEfht| Al) 
uw and v be arbitrary real balla then the ‘expression 


y f [ u tx-Y?] + vi xeon’, is non-negative. 


iz] 


=n y f [ u Lx, 1-2 + vi x; yernray” > 0) 


i=l 
=> p? > fix! + v2 > f; el? + 2uv Xf; xi" 20 
es throughout by N and using relation (1), we get 
UBx- + V" Baar + 2uvBx 2 0, i.e, u'Be-rt+ 2uvB, + V Bas 2 0 »(2) 


We know that the condition for the expression a vo+ Qhixy + by* to be 
non - negative for all values of x and y is that 


[a Al = 0 
lh ob | 
Using this result, we get from (2) 
Be-1 Px 
Bx Wee 
=> Beat» Bx+i - Bx 2 0 --(3) 
Raising both sides of (3) to power k, we get 
as 2 Br-1- Bawa” . (4) 
Putting k=1, 2, ..., K-1, k successivelv in (4), we get ! 
pi- s Bop. 
62s Pips 
BS s Bip 
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2k oak ak 
t SBe-r Bess 
Multiplying these inequalities and noting that.By = 1, we get 


c’' < Bees fork=1, SAS | aoa 
Raising both sides of the inequality to the power kK 7 y? we get 
eS Beg EHD 1. (5) 


Remark. Result (5) shows that B,!“ is an increasing function of k. 


EXERCISE 3 (b) 


1. (a) Define the raw and central moments of a frcquericy distribution. 
Obtain the relation between the central moments of order rin terms of the raw 
snoments. What are Sheppard’s corrections to the central moments ? 

(b) Define moments. Establish the relationship between the moments about 
mean, &e., Central Moments in terms of moments about any arbitrary point 
and vice versa. 

The first three moments of a distribution about the value 2 of the variable 
are 1, 16 and — 40..Show that the mean is 3, the variance is 15 and p13 = —86. 
Also show that the first three moments about x = 0 are 3, 24 and 76. 

(c) For a distribution the mean is 10, variance is 16, y, is + 1 and B, is 4. 
Find the first four moments about the origin. . 

Ans. Lu,’ = 10, Ly’ = 116, Ll; = = 1544 and Tr = 23184. 

(d) (i) Define ‘moment’. What is its use ? Express first four central 
moments in terms of moments about the origin. What is the effect of change of 
origin and scale on pl; ? 

(it) The first three moments of a distribution about the point X = 7 are 3, 
|! and 15 respectively. Obtain mean, variance and B;. 

2. The first four moments of distribution about the value 5 of the variable 
are 2, 20, 40 and 50. Obtain as far as possible, the various characteristics of the 
distribution on the basis of the information given. 

Ans. Mean = 7, pl, =16, Hy = — 64, pW, = 162, B, = | and B, = 0-63. 

3. (a) If the first four moments of a distribution about the value 5 are equal 
to —4, 22, - 117 and 560, determine the corresponding. moments (7) about the 
mean, (ii) about zero. 

(b) What is Shéppard’s correction? Wha. will be the corrections for the first 
lour moments ? 

The first four moments of a distribution about-x = 4 are 1, 4, 10, 45. Show 
that the mean.is 5 and the variance is 3 and, and 14 are O and_26 respectively, 

(c) In certain distribution, the first four moments about the point 4 are 
~1-5, 17, — 13 and 308. Calculate B, and By». 

(/) The first four moments of a frequency distribution-about the point 5 are 
~0.55,4-46, — 0-43 and -68-52. Find B, and B>. 

Ans. ly = 4-1575, py = 6-5962, py = 75-3944, B, = 0-6055, B, = 4-3619., 

4. (a) For the following data, calculate (i) Mean, (ii) Median, (iii) Semi- 
inter-quartile range, (iv) Coefficient of variation, and (v) B, and 8, coefficients. 
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Wages in 170— 180— 190— 200— 210— 220— 230— 249 


Rupees : 180 190 200 210 220 230 240 250 
No. of 52 68 85 92 100 95 70 28 
Persons : 


Ans. Mean = 209 (approx.); Median = 209-8; Q.D. = 15-8; o = 19.7. 
C.V. = 9-4; B, = 0-003; B, = 26-105. 

(b) Find the second, third and fourth central moments of the frequency 
distribution given below. Hence find (i) a measure of skewness (y,) and 
(it) measure of. kurtosis (‘Y>). 


Class Limits Frequency 
100-0 — 114-9 5 
115-0 -— 119-9 15 
120-0 — 124-9 20 
125-0 — 129-9 35 
130-0 — 134-9 10 
135-0 — 139-9 10 
140-0 — 144-9 5 


Also apply Sheppard’s corrections for moments. 
Ans. [, = 2°16, p13 = 0-804, 4 = 12-5232 


y, = VB, =0-25298; y =P, -3 =- 0317. 

(c) The standard deviation of a symmetrical distribution is 5. What must be 
the value of the fourth moment about the mean in order that the distribution be 
(i) leptokurtic, (41) mesokurtic, and (iif) platykurtic ? 

Hint : 1, = |; = 0 (because distribution is symmetrical). 

6=5>07=p =25 


B = Ua = Ha 
2 py 625 
(i) Distribution is leptokurtic if B, > 3, ie. io 3 => p4> 1875 


(ii) Distribution is mesokurtic if B,=3 => if 4 = 1875 
(iii) Distribution is platykurtic if B2<3 = if Wy < 1875 
5. Show that for discrete distribution B, > 1. 
[Allahabad Univ. M.A., 1993; Delhi Univ. B.Sc.. (Stat. Hons), 1992] 
Hint. We have to show that p4/3 > 1, ie, Wa > W2. If x; fp i= 1,2, on 


n, be the given discrete distribution, then we have to prove that 
2 


- in) & ; -¥)} 
N afi (x; x) ? N 2S (x; x) 
Putting (x;— x)? =2;, we have to show that 

2 


+ Efizi > (1 Efiz) 
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4 


Le. wiht (yeh > 0 


— 


i.e., oz > 0, 
which is always true, since variance is always Eoave 
Hence B2 > 1. 

6. (a) The scores in Economics of 250 candidates appearing at an-examina- 
tion bave 


Mean = x = 39-72 
Variance = 0° = 97:80 
Third Ceritralmoment = ps = = 114-18 
Fourth central moment = uy = 28,396-14 


It was later found on scrutiny that the score 61 of a candidate has been wrongly 
recorded as 51.‘Make necessary corrections in the given valucs of the mean and 
the central moment. (Gujarat Univ. M.A, 1993) 

(b) For a distribution of 250 heights, calculations showed that the mean, 
standard deviation, B; and [2 were 54 inches, 3 inches 0 and 3 inches respectively. 

It was, however, discovered on checking.that the two items 64 and Jin the original / 
data were wrongly written in place of correct values 62 and 52 inches respectively. 
Calculate the correct frequency constants. 

Ans. Correct Mean = 54, S.D. = 2-97, wa = — 2°18, py = 218-42, B, = 00070 
and $2 = 2°81 

7. In calculating the moments of a frequency distribution based on 100 
observations, the following results are obtained : 

Mean = 9, Variance = 19, B; = 0-7 (usa + ive), B2 = 
But later on its was found that one observation 12 was read as 21. Otlain the correct 
value of the first four central moments. 


Ans. Corfectéd mean = 8-91, 2 = 17-64,u3 = 57-05, U4 = 1257-15, 
Bi = 0-59 and B2 = 4-04. 

8. (a) Show that if'a range of six times the standard deviation Covers at least 
18 class intervals, Sheppard’s correction will make a difference of less then 0-5 
percent in the corrected ‘yalue of the standard deviation. , 

Hint. If /: is the magnitude of the class interval, then we want : 


os SeS shor 2 ee Sasa se 


9. 9 


(corrected) = A af 20 - ee =o (1 \ 
" pe 9 9x 12% © 


1 i oo 
=> s.d. (corrected) 2 oO [1 = 06 | —?o a a a iw) 
wi 
2 


.. Required adjustement = o — 0 (corrected ) <—— 6 « oan 
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(b) Show that, if the class intervals of a grouped distribution is less than 
one-third of the calculated standard deviation, Sheppard’s adjustment makes 


difference of less than ; % in the estimate of the standard deviation 


9. (a) If 0, is the rth absolute moment about zero, use the mean value of 
[ulxI@-DRgvix tet ey 
to show that 
(5,)°" $ (0, _4)" (0, 41)" 

From this derive the following inequalities - 

(i) (0,)'*' $(6,4.)% Gi) OD) S$ 6,4 yt? 

(b) For a random variable-X moments of al] order exist. Denoting by 4; and 
d;, the jth central moment and jth absolute moment respectively, show that, 


(i) (Hapa 1)? S Hey Moje 2» 


(ti) (0) 4 < jy )YOtY (Karnataka Univ. B.Sc., 1993) 
10. If 8, and B, ‘are the Pearsons’s coefficients of skewness and Kurtosis 
respectively, ‘show that By > B, + 1. (Bangalore Univ. B.Sc., 1993) 


a. 


3°13, Skewness. Literally, skewness means ‘lack of symiietry’. We study 
skewness to have an idea about the shape of the curve which we can draw with 
the help of the given data. A distribution is said to be' skewed if 

(i) Mean, median and mode fall at different points, 

i.e, Mean # Median # Mode, 

(i?) Quartiles are not equidistant from median, and 

(iii) The curve drawn with the help of the given data is not symmetrical 
but stretched more to one side than to the other. 

Measures of Skewness. Various measures of skewness are 

(1) S,=M—-—M, (2) S, =M-Mo. 
where M is the mean, M,, the median and Mo, the mode of the distribution. 

(3) 5, = (Q3 — Ma) — (Mg — Q)). 

These are the absolute measures of skewness. As in dispersion, for 
comparing two series we do not, calculate these absolute measures but we 
calculate the relative measures called the co-efficients of skewness which are 
pure numbers independent of units of measurement. The following are the 
coefficients of Skewness. 

1. Prof. Karl Pearson’s Coefficient of Skewness. 

Goes (M ~ Mo) 
= 
0) 
where. oi is the standard deviation of the distribution. 

If mode is ill-defined, then using the relation, My = 3M, -— 2M, fora 

moderately asymmetrical distribution, we get 
3(M ~~ M1) 
S;, es 

Oo 


se 3°27) 


... (3-274) 
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The limits for Karl Pearson’s coefficient of skewness are + 3. In practice, 
these limits are rarely attained. 

Skewness is positive if M> Mo or M > Mg and negative if M < My or 
M < Mua. 

Il. Prof. Bowley’s Coefficient of Skewness. Based on‘ quartiles, 

(Q3-My)-(Mqg-Q1) 23+ 2, - 2M, 
SK=(Q,;=Ma)+(Ma-91)Q3- Qi oe 

Remarks 1. Bowley’s coefficient of skewness is also known as Quartile 
coefficient of skewness and is especially useful in situations where quartiles and 
median are used, viz., : ‘ 

(i) When the.mode is ill-defined and extreme observations are present in the 
ata. ; ‘ 
: (ii) When the distribution has open end classes of unequal.class intervals. . 

In these situations Pearson’s coefficient of skewness cannot be: used: 

2. From (3-28), we observe that 

S,=0, if Q3-Ma=Mya-Q 
This implies that for a symmetrical distribution (S, = 0), median is equi- 
distant from the upper and lower quartiles. Mofeover skewness is positive if : 
Q,;-Myg>My-Q; = Q3+Q,> 2M, 
and skewness is negative if 
Q3-Ma<Ma-Q, = Q3+01< 2M, 

3. Limits. for Bowley’s Coefficient of Skewness. We know that for two real 
positive numbers a and b-{i.e., a > 0 and b > 0), the moduls value of the 
difference (a — b) is always less than or equal to the modules: value of the sum 
(a+b), Le., 

1a —b 
la-bisla+bl =» [Po | <1 we) 

We also know that (Q3 = M,) and (My — Q)) are both non-negative. Thus, 

taking a= Q,-M, and b=M,- Q, in (*), we get | 


(Q3 — Mg) - (Mag~ Q1)) © 
(Q3;-My)+(Mag-Q\)' - 
=> ls, (Bowley) | <1 
=> —1<Sk (Bowley) < 1. 
Thus, Bowley’s coefficient of skewness ranges from — | to I. 
Further, we note from (3-28) that : os 
S,=+ lsif Mg- Q) =0, Le., if My = Q) 
S, =-1,if Q3 -My = 0, ise, if Os = Mo. 
4. It should bé clearly understood that the’ values of the coefficients of 
skewness obtained by Bowley’s formula and Pearson’s formula are not 
comparable, although in each case, S, = 0, implies the absence of skewness, i.e., 


the distribution is symmetrical. It may even happen that one of them gives 
Positive skewness while the other gives negative skewnéss. 
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5. In Bowley’s coefficient of skewness the disturbing factor of ‘variation is 
eliminated by dividing the absolute measure of skewness, viz., (Q3 — Md) ~ 
(Md -—(Q, ) by the measurevof dispersion (Q3 — Q} ), i-e., quaftile range. 

6. The only and perhaps quite serious limitations of this coefficient is that it 
is based only on the centra) 50% of the.data and ignores the remaining 50% of the 
dala towards the extremes. 

Ill.. Based upon moments, co-efficient of skewness is 


5, =< —vbi (B2 + 3) 
"2 (582 - 6B: - 9) (3°29 ) 


where symbols have their usual meaning. Thus S, = 0 if either Bi = 0 or B2 = — 3. 
But since B2 = p4/ u3, cannot be negative, 5, = 0 if and only if 8, = 0. Thus 
for a symmetrical distribution B; = 0. In this respect Bi is taken to be a measure 
Of SkeWnéss. The co-efficient, in ( 3-29 ) is to be regarded as without sign. 

We observe in (3-27 ) and (3-28) that skewness can be positive as well as 
negative. The skewness is positive if the larger tail of the distribution lies towards 


x (Mean) = Mo = Ma 
(Symnietrical Distribution) 


the higher values of the variate (the right), i.c., if the curve drawn with the help of 
the given data is tretched more to the right than to the left and is negative 


Jp An 


M, Mg Mean Mean Mg Mg 


(Positively Skewed Distribution) (Negatively Skewed Distribution) 


wr 
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in the Contracy case. 

3-14. Kurtosis. If we know the measures of central tendency, dispersion 
and skewness, we still cannot form a complete idea about the distribution as will 
be clear from the following figure in which all the three curves A, B and C are 
symmetrical about the mean ‘m’ and have the same range. 

In addition to these measures We should know one more measure which Prof. 
Karl Pearson calls as the ‘Convexity of curve’ or Kurtosis. Kurtosis enables us 
to have:an idea about the flatness -or peakedness of the curve. It is measured bv the 
co-efficient B> or its derivation r2 Ae by: 


B2 = ws/u3, = Br - 3 


Curve of the type ‘A’ which is neither flat nor peaked is called the normal 
curve or. ‘mesokurtic curve and for such a curve B: = 3, 1e., y2 = 0. Curve of 
the type ‘B’ which is flatter than the normal curve is known as platykurtic ‘and for 
such a curve B: < 3, Le., y2 < 0. Curve of the tvpe ‘C’ which is more peaked 
than the normal curve is called /eptokurtic and for such a curve, B2 > 3, i.e., 
y2 > 0. 


EXERCISE 3 (c) 


t. What do you understand by skewness ? How is it measured ? Distinguish 
clearly, by giving figures, between positive and negative.skewness. 


2. Explain the methods of measuring skewness and kurtosis of a frequency 
distribution. 


3. Show that for any frequency distribution : 
(i) Kurtosis is greater than unity. 
(ii) Co-efficient of skewness is /ess than 1 numerically. 


4. Why do we calculate in general, onlv the first four moments about mean 
of a distribution and not the higher moments ? 


5. (a) Obtain Karl Pearsons’s measure of skewness for the following data: 
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Values Frequency Values Frequency 
5 — 10 6 25 + 30 15 

10 -— 15 8 30 — 35 11 

15 -— 20 17 35 — 40 2 

20 -— 25 21 


(b) Assume that a firm has selected a random samplé of 100 from its 
production line and has obtain the-data.shown in thé table below : 


Class interval Frequency Class interval Frequency 
130 -— 134 3 150 -— 154 19 
135 — 139 12 155 - 159 12 
140 ~ 144 21 160 - 164 5 
145 ~ 149 98 


. Compute the following : 
(a) The arithmetic niean, (b) the standard deviation, 
(c) Karl Pearson’s coefficient of skewness. 
Ans. (a) 147-2, (b) 7:2083 , (c) 0-0711 
6. \(a) For the frequency distribution given below, calculate the coefficient of 
skewness based on quartiles. 


Annual Sales No. of Firms Annual Sales No. of firms 
(Rs. ’000 ) (Rs. ’000 ) 
Less than 20- 30 Less than 70 644 
Less than 30 225 Less than 80 650 
Less than 40 465 Less than 90 665 
Less than 50 580 Less than 100 680 
Less than 60 634 


(b) (i) Karl Pearsons’s.coefficient of skewness of a distribution is 0-32, its 
s.d. is 6°5 and mean is 29-6. Find the mode of the distribution. 

(ii) Ifthe mode of the above distnbution is 24-8, what will be the s.d. ? 

7. (a) Ina frequency distribution, the co-efficient of skewness based upon 
the quartiles lis 0-6. If the sum oif the upper and lower quartiles is 100 and median 
is 38, find the value of the upper and-lower quartiles. 

Hint. We are given’ 

. Q031+Qi - 2 Md’. 0-6 

Q3- Q1 vel®) 
AlsoQ3 + Qi = 100 and Median = 38 

Subsituting (*), we get 


Sk 


100 - 2,x 38 _ 
Q3 + Qi 

=> Q3 —- Q: = 40 

Simplifying we get | O1 = 30, G3 = 70 
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Meas¥ 
(b) A frequency distnbution gives the following results : 
(i) CV.=5 (ii) Karl Pearsons’s co-efficient of skewness = 0-5 
(iii) O = 2. 


Find the mean and mode of the distribution. 

(c) find the C.V. of.a frequency distribution given that its mean 1s 120, mode 
‘s 123 and Karl Pearons’s co-efficient of skewness is - 0-3. 

Ans. C.V.=8-33 

(d) The first three moments of distribution about the value 2 are 1, 16 and 

40 respectively. Examine the skewness of the distribution, 

8. The first three moments about the origin 51 Kg, calculated from the 

data on the weights of 25 college students are 
uy = + 0-4 kg, Var = 1-2 kg. and (us )’" = — 0:25kg. 
Determine the mean, the standard deviation and coefficient oif skewness. 
9. The first three moments about the origin are given by 


i A »_ (n +1) (2n + 1) n9n+1) 
Mi = 7°? W2 = 6 and U3 4 
Exantine the skewness of the data. 
10. Find out the kurtosis of the data given below : 
Class interval 0 — 10 10-20 20-30 30 -— 40 
Frequency 1 3 4 2. 


I1. Data were obtained for distribution of passengers, entering Bombay local 
trains over time at intervals of 15 minutes for moming and evening rush hours 
separately, and the following results were obtained. 


Morning hours Evening hours 
Arithmetic mean (Peak Hours) 8 his. 38 min. > brs. 40 min. 
Standard devidtion 38-5 min. 34-9 min.. 
Coefficient of skewness 
(in 15 min. unit) - 0-32 + 0-17 
Kurtosis measure / 2:0 2°2 


Interpret the result and discuss giving reasons, whether vou approve of the 
measure of ‘peak hour’. 

12. (a) The standard deviation of a symmertrical distribution ig 5., What 
must be the value of the fourth moment about the mean in order that the distribution 
be (i) Leptokurtic, (ii) mesokurtic, and (iii) platvkurtic. 

Hint. ww) = pa = 0 (Because distribution is symmictrical), g=5=> 07 
M2 = 25 ‘ 

se 
B2 = ue 625 
(i) Distt. is leptokurtic if.62 > 3 i.e., if = > 3 => py > 1875 


(ii) Distt. is mesokurtic if B2 = 3 => if Ro 1875 
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(iii) Distt. is platykurtic if B, <3 = if 4 < 1875. 


(b) Find the second, third and fourth central moments of the frequency 
distribution given below., Hence, find (/) a measure of skewness, and (ii) a 
measure of kurtosis (>). 


Class limits - Frequency 
110-0 — 114-9 

115-0— 119-9 15 
120-0— 124-9 20 
125-0— 129-9 35° 
130-0— 134-9 10 
135-0— 139-9 10 
140-:0— 144.9 5 


Ans. [> = 2-16, 3 = 0-804, Mtg = 12-5232. 


v1 = VB, = 0:25298 ; % =B,-3 =— 0-317. 

13. (a) Define Pearsonian coefficients B, and B, and discuss their utility in 
Statistics. {Delhi Univ. B.Sc. (Hons.), 1993] 

(b) What do you mean by skewness and kurtosis of a distribution ? Show 
that the Pearson’s Beta coefficients satisfy the inéquality B, — 8, - 1 = 0. Alsc 
deduce that B, 2 1. (Delhi Univ. B.Sc. (Stat. Hons.), 1991 

(c) Define the Pearson's coefficients ‘y, and y2 and discuss their utility in 
Statistics. 

OBJECTIVE TYPE QUESTIONS 
1. Match the correct parts to make a valid.statement. 


(a) Range @) (Q3- Qi)/2 

(b) Quartile Deviation (it) \ x z fi (xj- x) 
52s renee o2 

(c) Mean Deviation (iti) Mean 100 

(d) Standard Deviation (iv) x A 

(e) Coefficient of Variation (Vv) Xmax — Xmin 


II. Which value of ‘a’ gives the minimum ? 

(1) Mean square deviation from ‘a’ 

(ii) Mean deviation from ‘a’ 

III. Mcan of 100 observations is 50 and S.D. is 10. What will be the nev 
mean and S.D., if 

(i) 5 is added to each observation, 

(ii) each observation is multiplied by 3, 

(iti) 5 is subtracted from each observation and then it is divided by 4? 

IV. Fill in the blanks : 

(i) (a) Absolute sum of deviation is minimum from........... 
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(ii) The sum of squares of deviations is least when measured from 
ssiegessectneee 
) (iii) | The sum of 10 items is 12 and the sum of their squares is 16:9. 

(iv) In any distribution, the standard deviation is always ................. the 
mean deviation, ; 

(v) The relationship between root mean square deviation and standard 
deviatiONn O iS .....cereereee 

(vi) If 25% of the items are less than 10 and 25% are more than 40, the 
coefficient of quartile deviation is ................. 

(vii) The median and standard deviation of a distribution are 20 and 4 
‘respectively. If each item is increased by 2, the median will be ................. and the 
new standard deviation will be ................. 

(viii) Ina symmetric distribution, the mean and the mode are ................. 

(xi) In symmetric distribution, the upper and the lower quartiles are equi- 
distant from. ..............00 

(x) If the mean, mode and standard deviation of a frequency distribution 
are 41, 45 and 8 respectively, then its pearson’s coefficient of skewness is 


00090 OOF OO OHO SO OO 


(xi) Fora symmuictrical distribution By = .............6 

(xii) If: > 3 the distribution is said to be ................. 

(xiii) Fora symmetric distribution 2 = ...........0006 
2m + 1 FS cpecrcccccscceees 


(xiv) If the mean and the mode of a given distribution are equal then its 
coefficient of SKEWNESS IS ...........c0000 


(xv) Ifthe kurtosis of a distribution is 3, it is called ............... distribution. 
(xvi) Ina perfectly symmetrical distribution 50% of items are above 60 and 
75% items are below 75. Therefore, the coefficient of quartile deviation is 
segltessiesnseaets and coefficient of skewness is ...............5. 
(xvii) Relation between By and‘ Po is given by ................. 
V. For the following questions given correct answers : 
(i) Sum of absolute deviations about median is 
(a) Least, (b) greatest, (c) zero, (d) equal. 
(ii) The sum of squares of deviations is least When measured from 
(a) Median, (b), (c) Mean, (d) Mode, (e) none‘of them: 
(iii) In anvdiscrete series (when all the values are not samc) the relationship 
between M.D. about mean and S.D. is 
(a) M.D. =S.D., (b) M.D-2 S.D., (c) M.D. <S.D., 
(d) M.D.s S.D. 
(e) None of these. 
(iv) Ifeach ofa set of observations of a variable is multiplied by a constant 
(non-zero) value, the variance of the resultant variable. 
(a) is unaltered, (b) increases (c) decreases, (d) is unknown. 
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(v) | The appropriate measure whenever the extreme items, are to be dis- 
regarded and when the distribution contains indefinite classes at the end is 
(a) Median, (b) Mode, (c) Quartile deviation, 
(d) Standard Deviation 
(vi) A.M.,G.M. and H.M. in any series are equal when 
(a) the distribution is symmetric, (b) all the values are same, 
(c) the distribution is positively skewed, 
(d) the distribution is unimodal. 
(vii) The limits for-quartile coefficient of skewness are 
(a) + 3, (b) Oand3, (c) +1, (d) + & 
(viii) ‘The statement that the variance is equal to the second central moment’ 
, (a) always true, (b) sometimes true, (c) never true, 
(d) ambiguous. 
(ix) The standard deviation of a diStribution is 5. The value of the fourth 
central moment ( 44), in order that the distribution be mesokurtic should be 
(a) equalto3, (b) greater than 1,875, (c) equal to 1,875, 
(d) less than 1,875. 
(x) Ina frequency curve of scores the mode was found to be higher than 
the mean. This shows that the distribution is 
(a) Symmetric, (6) negatively skewed, (Cc) positvely skewed, 
(d) normal. ) 
(xi) For any frequency distribution, the kurtosis is 
(a) greaterthan1, (0) less than1, (c) equal to 1. 
(xii) “The measure of kurtosis is 
(a) Bz = 0, (b) Bo = 3, (c) Br = 4, 
(xiii) For the distribution 
(a) py = 0, (b) Median = 0, 
(c) The distribution of x is symmetrical. 
xX: -4 -3 -2 -1 0 fs 2 3 4 Total 
fs Zz 4 5 7 ~©10 7 5 ‘4 2 46 
(xiv) Fora symmetric distribution 
@) w2=0, )H2>0; (©) ps >0 
VI. State which of the following: statements are Ture and which False. In 
each of false. statements given the correct statement. 
(i) Mean, standard deviation and varaince have‘the same unit. 
(ii) | Standard deviation of every distribution is unique and always exists. 
(iii) | Median is the value of the-variance which divides the total frequency 
it (wo equal parts. 
(iv) |Mean— Mode = 3 (mean — median) is often approximately satisified. 


(v) Mean deviation = = (standard deviation) is always satisfied. * 


(vi)  B2 2.1 is always satisfied 
(vii) Bi = O isa conclusive test for a distribution to be symmetrical. 


CHAPTER FOUR 
Theory of Probability 


pe 


41. Introduction. If an experiment is repeated under essentially 
homogeneous and similar conditions we generally come, across two. types of 
situations: 

(i) The result or what is usually known as-the‘eutcome’ is unique or certain. 
(ii) The result is not unique but may be one of the several possible outcomes. 

The phenomena covered by. (i) are known as ‘deterministic’ or ‘predictable’ 
phenomena. By a deterministic phenomenon we mean one in which the result, can 
be predicted with certainty. For example : 

(a) For a perfect gas, 

V «x ‘ i.e., PV = constant, | = 
provided the temperature remains the same. 
(b) The velocity ‘v’ of a'particle after time ‘t’ is given by 
v=u+tat 
where u is the initial velocity and a is the acceleration. This equation uniquely 
determines v_ if the right-hand quantities,are known. 


(c) Ohm’s Law, viz., C= : 


where C is the flow of current, E the potential difference between the two ends of 
the conductor and R the résistance, uniquely determines the value C as soon as E 
and R are-given. § 

A deterministic model is defined as a model which stipulates that the condi- 
lions under which an experiment is performed determine the outcome of the 
experiment. For a number of situations the deterministic model'suffices. However, 
there are phenomena [ds covered by (ii) above] which do not lend themselves to 
deterministic approach and are known as ‘unpredictable’ Or ‘probabilistic’ 
phenomena. For example : | 

(i) In tossing of 4 coin one is not sire if a head or tail will-be obtained. 
(ii) If a light tube has-lasted for ¢ hours, nothing can be said about ‘its further 
life. It may fail to function any moment. 

In such cases we talk of chancé or probability which is'taken to be a quantitative 
measure of certainty. 

_ 42, Short-History. Galileo (1564-1642), an Italian mathematician, was the 
first to attempt at a quantitative ‘measure of probability while dealing with some 
Problems related to the theory of dice in gambling. But the first foundation of the 
mathematical theory if probability was laid in the mid-seventeenth century by two 
French mathematicians, B. Pascal (1623-1662) and P. Fermat (1601-1665), while. 
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solving a number of problems posed by French gambler and noble man Chevalier- 
De-Mere to Pascal. The famous ‘problem of points’ posed by De-Mere to Pascal 
is : "Two persons play a game of chance. The person who first gains a certain 
numiber of points wins the stake. They stop playing before the game is completed, 
How isthe stake to be decided on the.basis of the number of points each has won?" 
The two mathematicians after a lengthy correspondence between themselves 
ultimately solved this problem and this correspondence laid the first foundation of 
the science of probability. Next stalwart in this field was J. Bernoulli (1654-1705) 
whose “Treatise on Probability’ was published posthumously by his nephew N, 
Bernoulli in 1713. De-Moivre (1667-1754) also did considerable work in this field 
and published his famous ‘Doctrine of Chances’ in 1718. Other:main contributors 
are : T. Bayes (Inverse probability), P.S. Laplace (1749-1827) who after extensive 
research over a number of -years finally published “Theoric analytique des prob- 
abilities’ in 1812. In addition to these, other otitstanding contributors are Levy, 
Mises and R.A. Fisher. 

Russian mathematicians also have made very valuable contributions to the 
modem theory of probability. Chief contributors, to mention only a few of them 
are;; Chebyshev (1821-94) who founded the Russian School of Statisticians; 
A. Markoff (1856-1922); Liapounoff (Central Limit Theorem); A. Khintchine 
(Law of Large Numbers): and A. Kolmogorov, who axiomised the ‘calculus of 
probability. | 

4-3. Definitions of Various Terms, In this section we will'define and explain 
the various terms which are used in the-definition of probability. 

Trial and Event. Consider an experiment which, though repeated under 
essentially identical conditions, does not give unique results but may result in any 
one of the several possible outcomes.The experiment is known as a trial and the 
outcomes are known as events or cases. For example : 

(i) Throwing of a die is a trial and getting 1(or 2 or 3, ...or 6) is an event. 
(ii) Tossing of acoin is a trial and getting head (H ) or tail (T ) is an event. 

(iii) Drawing two cards from a pack of well-shuffled: cards is a trial.and 
getting a. king and a queen are events. 

Exhaustiye Events, The total number of possible outcomes in any trial is 
known as exhaustive events or exhaustive cases. For example : 

(i) Intossing of acoin there are two exhaustive cases, viz., head and. tail, 
(the possibility of the coin standing on an edge being ignored). 

(ii) In throwing of a die, there are six exhaustive cases since any one of the 
6 faces 1, 2, ...,.6 may Come uppermost. 

(iii) In drawing two cards from a pack of cards the exhaustive number of 
cases is C,, since 2 cards can be drawn out of 52 cards in °C, ways. 

(iv) In throwing of two dice, the exhaustive number of cases is 6° = =%6, 
since any of the 6 numbers 1.to 6 on the first die can be associated with any of the 
six numbers on the other die. 
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In general in throwing of x dice the exhaustive number of cases is 6". 

Favourable Events or Cases. Thé number of cases favourable to an event in 
q trial is the number of outcomes which entail the happening of the event. For 
example, 

(i) In drawing acard from a pack of cards ‘the number of cases favourable 
to drawing of an ace is 4, for drawing a spade is 13 and for drawing a red card is 
26. 

(ii) In throwing of two dice, the number of cases favourable to getting the 
sum 5 is : (1,4) (4,1) (2,3) (3,2), i.e., 4. 

Mutually exclusive events. Events are said to be mutually exclusive or 
incompatible if the happening of any one of them precludes the happening of all 
the others ;i.e., if no two or moré of them can happen simultaneously in the same. 
trial- For example : 

(i) In throwing a die all the 6 faces numbered 1 to 6 are mutually exclu- 
sive since if any one of these faces comes, the possibility of others, in the same 
trial, is ruled out. 

(ii) Similarly in tossing a ¢éin thé events head and tail are riutually exclu- 
sive. 

Equally likely events. Outcomes of a trial are set to be equally likely if taking 
into consideration all the relevant evidences, there is no reason lo expect one in 
preference-to the others. For example 

(i) In tossing an-unbiased or uniform oe héad or tail are equally likely 
events. 

(ii) In throwing an unbiased die, all the six faces are equally likely to:come. 

Independent events. Several events are said ‘to be independént if the 
happening (or non-happening) of an event is not affected by the supplementary 
knowledge conceming the occurrence of any number of the remaining events . For 
example 

_ (i) In tossing .an unbiased coin the event of getting a head in the first toss 
is independent of getting a head in the second, third and subsequent throws. 

(ii) If we draw a card froma pack of well-shuffled cards and replace it 
before drawing. the second card, the result of the second draw is independent of 
the first draw. But, however, if the first card drawn is not replaced then the second 
draw is dependent on the first draw. 

«3-1. Mathematical or Classical or ‘a. priori’ Probabality 

Definition. If a trial results in n exhaustive, mutually exclusive and equally 
likely cases and m of them are favourable to the happening of an event £, ‘then the 
Probability ‘p’ of happening of E is given by 


P(E)= Favourable number of cases _m 
ne Exhaustive number of cases on we(41) 
Sometimes we express (4-1) by saying that ‘the odds in favour of E arem <= 


(n— m) or the odds against E are(n— m):n.’ 
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Since the number of cases favourable to the ‘non-happening’ of the event E 

are,(n— m), the probability ‘g’ that E will not happen is given by 
n-m m_. 
=| a . p => p+ q=Kil (4-Va) 

Obviously p as well as q are non-negative and cannot exceed unity, i.e, 
0< p< 1,0 gs l. 

Remarks. 1. Probability ‘p’ of the happening of an event is also known as 
the probability of success‘and the probability ‘g’ of the non- happening ofthe event 
as the probability of failure. 

2. 1f P (E) = 1 , E is called a certain event and if P (E) = 0, E is called an 
impossible event. 

3. Limitations of Classical Definition. This definition of Classical Prob- 
ability breaks down in the following cases : 

(i) If the various outcomes. of the trial are not equally likely: or equally 
probable. For example, the probability that a candidate will pass in a certain test is 
not 50% since the two possible outcomes, viz., sucess and failure (excluding the 
possibility of a compartment) are not equally likely. 

(ii) If the exhaustive number of cases in a trial 1s infinite. 

4-3-2, Statistical or Empirical Probability 

Definition (Von Mises). If a trial is’ repeated.a number of times under 
essentially homogeneous and identical conditions, then the limiting value of the 
ratio of the number of times the event. happens to the number of trials, as the 
number of trials become indefinitely large, is called the probability of happening 
of the.event. (It is assumed that the limit is finite and unique). 

Symbolically, if ina trials an event E happens m times, then the probability 
‘p’ of the happening of E is given by 


= P(E)=limit — 
: as no Mt seoe(4°2) 

Example 4:1. What is the chance that a leap year selected at random will 
contain 53 Sundays? 

Solution. In a leap year (which consists of 366 days) there are 52 complete 
Weeks and 2 daysover. The following are the possible combinations for these two 
‘over’ days: 

(i) Sunday and Monday, (ii) Monday and Tuesday, (iti) Tuesday and Wed- 
nesday, (iv) Wednesday and Thursday, (v) Thursday and Friday, (vi) Friday and 
Saturday, and (vii) Saturday and Sunday. 

In order that a leap year selected at random should contain 53 Sundays, one of 
the two ‘over” days must be Sunday. Since out of the above 7 possibilities, 2 viz., 
(i) and (vii), are favourable to this event, 


“IN 


Required probability = 
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Example 4-2. A bag contains 3 red, 6 white and 7 blue balls. What is the’ 

robability that two balls drawn are white and blue? 

Solution. Total number of balls = 3+ 6+ 7= 16. 

Now, out of 16 balls, 2 can be drawn in "°C. ways. 

Exhaustive number of cases = '*C,= io 5 ID = 120. 

Out of 6 white balls ! ball can be drawn in °C, ways and out of 7 blue balls 1 
pall can be drawn in ’C, ways. Since each of the former cases can be associz ‘ed 
with each of the latter cases, total number of favourable cases is: °C: x7C, 
= 6X7= 42. 

7 
Required probability = ae = 50° 

Example 4-3. (a) Two cards are drawn at random from a well-shuffled pack 
of 52 cards. Show that ihe chance of drawing two aces is 1/221. 

(b) From a pack of 52 cards, three are drawn at random. Find the chance 
that they are a king, a queen and a knave. 

(c) Four cards are d:awn from a pack of cards. Find the probability that 

(i) all are diamond, (ii) there is one card of each suit, and (iii) there are 
two spades and two hearts. 

Solution. (a) From a pack of 52 cards 2 cards can be drawnin C2 ways, 
all being equally likely. 

Exhaustive number of cases =. "C2 

In a pack there are 4 aces‘and therefore 2 aces can be drawn in ‘C2 ways. 

G._ 4x32 
wo 2 52x51 221 

(b) Exhaustive number of cases = °C; 

A pack of cards contains 4 Kings, 4 queens and 4 knaves, A king, a queen and 
a knave can each be drawn in ‘C, ways and since each way of drawing a king can 
be associated with each of the ways of drawing a queen and 4 knave, the total 
number of favourable cases = “C, x ‘C, x “C, 

“C1x*Grx °C, 4x4x4x6_ 16° 


Required probability = 


Required probability = an 52x 51x50 5525 
(c) Exhaustive number of cases = "C, 
(t) 
13 13 13 
(ti) Required probability = Sweex = “i aia 
4 
13 13 
(iii) Required probability= —C2% © 
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Example 4-4. What is the probability of getting 9 cards of the same suit in 
one hand at a game of bridge? 
Solution. One hand in a game of bridge consists of 13 cards. 
Exhaustive number of cases = “C3 
Number of ways in which, in one hand, a particular player gets 9 cards of one 
suitare Cy and the number of ways in which the remaining 4 cards are of some 
other suit are C,. Since there are 4 suits in a pack of cards, total number of 
favourable cases = 4x °Cyx Cy. 
4x "Cs x OC, 
a OF 
Example 4-5. (a) Among the digits 1,2, 3,4, 5, at first one is chosen and then 
a second selection is made among the remaining four digits. Assuming that all 
twenty possible outcomes have equal probabilities, find the probability that an odd 
digit will be selected (i) the first time, (ii) the second time, and (iii) both times. 
(b) From 25 tickets, marked with the first 25. numerals, one is drawn at 
random. Find the chance that 
(i) itis a multiple of 5 or 7, 
(it) itis a multiple of 3 or7. . 
Solution. (a) Total number of cases = 5 x 4= 20 
(i) Now there are 12 cases in which the first digit drawn ts-odd, viz., (1, 2), 
(1, 3), (1,4), (1, 5), (3, 1), (3, 2), 3, 4), G, 5), S, 1), G, 2), G, 3) and (5, 4). 
“. The probability that the first digit drawn is odd 
_12_3 
~ 20° 5 
(ii) Also there are 12 cases in which the second digit drawn is odd, viz., 
(2, 1),@, 1), 4, 1), 65, 1), (1, 3), (2, 3), 4, 3), (S, 3), (1, 5), (2, 5), G, 5) and (4, 5). 
The probability that the second digit drawn is odd 
_12_3 
~ 207 5 
(iii) There are six cases in which both the digits drawn are odd, viz., (1, 3), 
(1, 5), (3, 1), (3, 5), (5, 1) and (5, 3). 
‘. The probability that both the digits drawn are odd 
6 3 
~ 20 10 
(b) (i) Numbers (out of the first 25 numerals) which are multiptcs of 5 are 5, 
10, 15, 20 and 25, i.e., 5 in all and the numbers which are multiples of 7 are 7, 14 
and 21, i.e.,3 in all. Hence required'number of favourable cases are 5+3=8. 


Required probability = 


Required probability = a 


(ii) Numbers (among the first 25 numerals) which are multiples of 3 are 3, 6, 
9, 12, 15, 18, 21, 24, é.e., 8 in all, and the numbers which are multiples of 7 are 7, 
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14,21, i.e., 3 in all. Since the number 21 is common in both the cases, the required 
number of distinct favourable cases is 8+ 3—1= 10. 
10 2 
Required probability = 5" 5 

Example 4-6. A committee of 4 people is to be appointed from 3 officers of 
the production department, 4 officers of the purchase department, two officers of 
the sales department and 1 chartered accountant. Find the probability of forming ° 
the committee in the following manner: 

(i) There must be one from each category. 

(it) 1t should have at least one from the purchase department. 

(iii) The chartered accountant must be in the committee. 

Solution. There are 3+4+2+1=10 persons in al and a committee of 4 people 
can be formed out of them in "°C, ways. Hence exhaustive number of cases is 

1G, = xen 210 

(i) Favourable number of cases for the committee to consist of 4 members, one 

from each category is : 
“C1x°C1x7C,x%1 = 4x3x2 = 24 

4 8 
210 70 

(ii) P (Committee has at Icast one purchase officer] 

= 1- P {Committee has no purchase officer] 

In order that the committee has no purchase officer, all the 4 members are to 
be selected from amongst officers of production department, sales department and 
chartered accountant, i.¢., out of 3+2+1=6 members and this can be done in 


Required probability = 


s~ . 6X5 _ 
C= Or as 15 ways. Hence 
1 
P [ Committee has no purchase officer ) = aes 14 
1 
.. P {Committee has at least one purchase officer ] = 1- + = 3 


(iii) Favourable number of, cases that the committee consists of a chartered 
accountant as a member and three others are : 
9x8x7 
1x°C; = ——~--- 
G 1x2x3 
since a chartered accountant can be selected out of one chartered accountant in on'y 
1 way and the remaining 3 men:vers can be selected out of the oe 


10- 1= $ persons in °C; ways. Hence the required probability = = 5 2 ; 
Example 4-7, (a) If the letters of the word ‘REGULATIONS’ be arranged at 
randcm, what is the chance that there will be exactly 4 letters between R and E? 


= 84 ways, 
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(b) What is the probability that four S’s come consecutively in the word 
‘MISSISSIPPI ? 

Solution. (a) The word ‘REGULATIONS’ consists of 11 letters. The two 
letters R and E can occupy "'P2, i.e. ,11 x 10 = 110 -positions. 

The number of ways in which there will be exactly 4 letters between R and E 
are enumerated below: 

(i) R is in the Ist place and E is in the 6th place. 

(ii) R is in the 2nd place and E is in the 7th place. 


eco eee eee 
eee eee eee 


eee eee eo0e 


(vi) R is in the 6th place and E is in the 1 1th place. 
Since R and E can interchange their positiofs, the required number of 
favourable cases is 2X 6=> 12 
The required probability = ~ = = 


(5) Total number of permutations of the 11 letters of the word ‘MISSISSIPPI’, 
in which 4 are of one kind (viz., 5), 4 ofothe: kind (viz.,/), 2 of third kind 
(viz.,P) and 1 of fourth kind (viz., M) are 

1} 
4!:4!:2! 1! 
Following are the 8 possible combinations of 4 S’s coming consecutively: 


(i) §5 S $ § 

fi) —- S&S & S$ S 

(ji) - — § § S § 

We ee ye ee es CS 


Since in each of the above cases, the total number of arrangements of the 
remair:ing 7 letters, viz., MIIIPPI of which 4 are of one kind, 2 of other kind 


' 

and cne of usird kind are TEXETI , the required number of favourable cases 
_ 8x7! 
— 4'2ed! 

- 8 x 7! 11! 
Remus Pov Anil Arata 
~8x7!x4! 4 
" 7 11! ~ 165 


Exzmple 4-8. Each coefficient in the equation ax’ +bx+c = 0 is deter- 
: ined by throwing an ordinary die. Find the probability that the- equation will 
nave real roots. (Madras Univ. B. Sc. (Stat. Main), 1992] 
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Solution. The roots of the equation ax’+bx+c= 0 .(*) 
will be real if its discriminant is non-negative, 1.e., if 
b’-4ac > 0 = b? > dac 
Since each co-efficient in equation (*) is determined by throwing an ordinary 
die, each of the co-efficients a, b and c can take the values fiom 1 to 6. 
-, Total number of possible outcomes (all being equally likely) 


= 6x6x6 = 216 
The number of favourable cases can be enumerated as follows: 
ac a Cc 4ac b No. of cases 
(so that b* > 4ac ) 

] l l 4 2, 3, 4, 5 1x 5=5 

2@ ji 2 = 
(ii) ; 8 3,4,5,6 2x 4=8 

3 (@) jl 3 = 
(ii) 13 1 12 4,5,6 2x 3=6 

4 (f) }1 4 
(ii) 44 ] 16 4,5,6 3 x 3=9 
(ii) (2 2 

5 @ Jl 5 _ 
(ii) i 1 20 5,6 2x2=4 

6 (i) fl 6 
(ii) J6 1 = 
Gi) 3 7 24 5,6 4x2=8 
(iv) (2 3 

7 (ac=7 - not a ) 

8 (ij 42 _ 
_o 4 6 2x* T= 2 

3 36 6 1 


Total = 43 


Since b’ > 4ac and since the maximum value of b’ is 36, ac = 10, 11, 12, .. 
etc. is not possible. 
Hence total number of favourable cases = 43. 
; ss 43 
Required probability = 716 
Example 4-9. The sum of two non-negative quantities is equal to 2n. Find the 


chance that their product is not less than i times their greatest product. 
Solution. Let x > 0 and y> 0 be the given quantities so that x+ y= 2n. 


We know that the product of two positive quantities whose sum is constant is 


greatest when the quantities are equal. Thus the product of x and y is maximum 
when X= y=n. 
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a Maximum product= n.n = n° 
Now P| xy § n= P| xy> 3 nt | = Pl Qn~sx2 5 | 


4 
=P [(4x°-8nx+ 3n’)s 0) 
=P [(2x-3n)(2x- yi < 0) 


=P E lies between — 9 = and 3a] | 
Favourable range = ah 5 =n 
Total range =- 2n 
Required probability = = . 


Example 4-10. Oust of (2n+1) tickets consecutively numbered three are drawn 
at random. Find the chance that the numbers on them arein AP. 
[Calicut Univ. B.Sc., 1991; Delhi Univ. B.Sc.(Stat. Hons.), 1992] 
Solution. Since out of (2n + 1) tickets, 3 tickets can be drawn in antic, 
ways, 


Exhaustive number of cases = 2"+ !C, = 2n +1) 2n - 1) 


_ n(4n?-1) 
7 3 
To find the favourable number of cases we are to enumerate all the cases in 
which the numbers on the drawn tickets are in A.P with common difference, (say 
d= 1,2,3,...,n-1,n). 


If d= 1, the possible cases are as follows: 


2; 3 
2, 3, 4 
ce 2 , be.,(2n—-1) cases in all 
n-1, n, 2n+1 
If d= 2, the possible cases are as follows : 
l, 3 § 
2, 4, 6 


» i.é.,(2n — 3) cases in ail 
2n - 3, 2n-1, on+1 
and so on. 
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If d= n-—I, the possible cases are as follows: 
1, n, 2n—1 
2,n+/, 2n , £e., 3 cases in all 
3,a+2, 2n+ 1 
If d =n, there is only one case, viz., (l,a+1,2a+ 1). 
Hence total number of favourable cases 
= (2n ~ 1) + (Qn — 3)) +..454+34+ 1 
=1434+5+....4 (2n — 1), 
which is a series in A.P. with d =2 and vn terms. 


. Number of favourable cases a5 {1 +(2n -1)] =n? 


n2 —3n 
n ae 1)/3 (4n2-1) 


EXCERCISE 4 (a) 
1. (a) Give the classical and statistical definitions of probability. What are 
the objections raised in these definitions? 
[Delhi Univ. B.Sc. (Stat. Hons.), 1988, 1985} 
(b) When are a number of cases said to be equally likely? Give an example 
each of the following : 
(i) the equally likely cases, 
(it) four cases which are not equally likely, and 
(iii) five cases in which one case is more likely than the other four. 
(c) What is meant by mutually exclusive events? Give an example of 
(t) three mutually exclusive events, 
(ii) three events which are not mutually cxclusive. 
[Meerut Univ. B.Sc. (Stat.), 1987] 


.. Required probability = 


(d) Can 
(i) events be mutually exclusive and exhaustive? 
(if) events be exhaustive and indepenent? 
(iii) events be mutually exclusive and independent? 
(iv) events be mutually exhaustive, exclusive and independent? 

2. (a) Prove that the probability of obtaining a total of 9 in a:single throw 
with two dice is one by nine. 

(b) Prove that if a single throw with a pair of dice the probability of getting 
the sum of 7 is equal to 1/6 and the probability of getting the sum of 10 is equal 
lo 1/12. 

(c) Show that in.a single throw with two dice, the chance of throwing more 
than seven is equal to that of throwing less than seven. 

Ans, 5/12 {Delhi Univ. B.Sc., 1987, 1985] 


(d) In a single throw with two dice, what is the number whose probability 
is minimum? 
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(e) Two persons A and 6 throw three dice (six faced). If A throws 14, find B’s 
chance of throwing a higher number. [Meerut Univ. B.Sc.(Stat.), 1987] 

3. (a) A bag contains 7 white, 6 red and 5 black balls. Two balls are drawn at 
random. Find the probability that they will both be white. 

Ans, 21/153 

(b) Abag contains 10 white, 6 red, 4 black and 7 blue balls. 5 balls are drawn 
at random, What is the probability that 2 of them are red and one black? 

Ans. °C, ‘C,/ Cs 

4. (a) From aset of raffle tickets numbered 1 to 100, three are drawn at random, 
What is the probability that all the tickets are odd-numbered? 

Ans. ed OF TC, 

(b) A number is chosen from each of the two sets : 

(1,2, 3,4, 5, 6, 7,8,9); (4,5, 6, 7, 8, 9) 

If pi-is the probability that the sum of the two numbers be 10 and p> the 
probability that their sum be 8, find p: + p2- 

(c) Two different digits are chosen at random from the set 1,2,3,...,8. Show 
that the probability that the sum of the digits will be equal to 5 is the same as the 
probability that their sum will exceed 13, each being 1/14. Also show that the 
chance of both digits exceeding 5 is 3/28. (Nagpur Univ. B.Sc., 1992] 

5. What is the chance that (i) a leap year selected at random will coutain 53 
Sundays? (ii) a non-leap year selected at random would contain 53 Sundays. 

Ans. (i) 2/7, (ii) 1/7 

6. (a) What is the probability of having a knave and a queen when two cards 
are drawn from a pack of 52 cards? 

Ans. 8/663 

(b) Seven cards are drawn at random from a pack of 52 cards. What is the 
probability that 4 will be red and 3 black? 

Ans. “C,x™C; /"C; 

(c) Acard is drawn from an ordinary pack and a gambler bets that it is a spade 
or an ace. What are the odds against his winning the bet? 

Ans. 9:4 

(d) Two cards are drawn from a pack of 52 cards. What is the chance that 

(i) they belong to the same suit? 
(ii) they belong to different suits and different denominations. 
. (Bombay Univ. B.Sc., 1986] 

7. (a) If the letters of the word RANDOM be arranged at random, what is the 
chance that there are exactly two letters between A and O. 

(b) Find the probability that in a random arrangement of the leters of the word 
‘UNIVERSITY’, the two I’s do not come together. 
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(c) Inrandom arrangements of the letters of the word ‘ENGINEERING’, what 

is the probability that vowels always occur together? 
(Kurushetra Univ. B.E., 1991] 

(d) Letters are drawn one at a time from a box containing the letters A, H, M, 
O, S, T. What is the probability that the letters in the order drawn spell the word 
‘Thomas’? , 

8. A letter is taken out at random out of ‘ASSISTANT’ and a letter out of 
‘STATISTIC’. What is the chance that they are the same letters? 

9. (a) Twelve-persons amongst whom are x and y sit down at random at a 
round table. What is the probability that there are two persons between x and y? 

(b) A and B stand in a line at random with 10 other people. What is the 
probability that there will be 3 persons between A and B? | 

10. (a) If from a lot of 30 tickets marked 1, 2, 3, ..., 30 four tickets are drawn, 
what is the chance that those marked 1 and 2 are among them? 

Ans. 2/145 

(b) A bag contains 50 tickets numbered 1, 2, 3, ..., 50 of which five are 
drawn at random and arranged in ascending order of the magnitude (x; < x2 < xs 
< X4< Xs). What is the probability that x3 = 30? 

Hint. (a) Exhaustive number of cases= *C, 

If, of the four tickets drawn, two tickets bear the numbers 1 and 2, the remaining 
2 must have come out of 28 tickets numbered from 3 to 30 and this can be done in 
™C2 ways. 

“ Favourable number of cases = *C; 

(b) Exhaustive number of cases = “Cs 

If x3 = 30, then the two tickets with numbers x, and x, must have come out of 
29 tickets numbered from 1 ‘to 29 and this can be done in “C2 ways, and the other 
two tickets with numbers x, and x; must have been drawn out of 20 tickets numbered 
from 31 to 50 and this can be done in “C2 ways. 

: No. of favourable cases = "Cx ”C:. 

11. Four persons are chosen ar random from a group containing 3 men, 2 
women and 4 children. Show that the chance that exactly two of them will be 


children is 10/21 . (Delhi Univ. B.A.1988] 
4 $ 
C.x*C, 10 


12. From a group of 3 Indians, 4 Pakistanis and 5 Americans a sub-com- 
mittee of four people is selected by lots. Find the probability that the sub-committee 
will consist of 

(i) 2 Indians and 2 Pakistanis ; 
(ii) 1 Indian,1 Pakistani and 2 Americans 
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(iii) 4 Americans [Madras Univ. B.Sc.(Main os - 1987} 
3 3 4 
Ans. (i) an . (ii) i lS (iii) 
"Cy Cs C4 


13. In a box there are 4 granite stones, 5 sand stones and,6 bricks of identica} 
size and shape. Out of them 3 are chosen at random. Find the chance that : 
(i) They all belong to different varieties. 
(ii) They all belong to the‘Same variety. 
(iii) They are all granite stones. (Madras Univ. B.Sc., Oct. 1992) 

14. Ifn people are seated at a round table, what is the chance that two named 
individuals will be next to each other? 

Ans. 2/(n-1) 

15. Four tickets marked 00, 01, 10 and 11 respectively are placed in a bag. A 
ticket is drawn at random five times, being replaced each time. Find the probability 
that the sum of the numbers on tickets thus drawn is 23. 

[Delhi Univ. B.Sc.(Subs.), 1988] 

16. From a group of 25 persons, what is the probability that all 25 will have 
different birthdays? Assume a 365 day year and that all days are equally likely. 

[Delhi Univ. B.Sc.(Maths Hons.), 1987] 

Hint. (365 x 364 x... x 341) + (365)” 

4-4. Mathematical Tools : Preliminary Notions of Sets. The set theory was 
developed by the German mathematician, (+. Cantor (1845-1918). 

4-4-1. Sets and Elements of Sets. / set is a well defined collection or 
aggregate of all possible objects having give 1 properties and specified according 
to a well defined rule, The objects comprising a set are called elements, members 
or points of the set. Sets are often dénoted by capital letters, viz., A,B,C, etc. Ifx 
is aa element of the set A, we write symbolically xe A (x belongs.to A). If x is not 
a member of the set A, we write x € A(x does not delong to A ). Sets are often 
described by describing the properties possessed by their members. Thus the set 
A of all non-negative rational numbers with square less than 2 will be written as 
A={x :xrational,x > 0,x’ < 2}. 

If every element of the set A belongs to the set B, i.e., if xe A =x B, then 
we say that A is a subset of B and write symbolically A ¢ B (A is contained in B) 
or B >A (Bcontains A ). Two setsA and B are said to be equal or identical if 
Ac B andBcA and wewriteA =BorB=A. 

A null or an empty set is one which does not contain any element at all and is 
denoted by 9. 

Remarks. 1. Every set is a subset of itself. 

2. An empty set is subset of every Set. 

3. A set containing only one element is zonceptually distinct from the element 
itself, but will be represented by the sam« symbol for the sake of convenience. 

4. As will be the case in all our applications of set theory, especially to 
probability theory, we shall have a fixed set S (say) given in advance, and we shall 


be concemed only with subsets of this given set. The underlying set § may vary 
from one application to another, and it will be referred to as universal set of each 
partic ular discourse. 

4-4-2, Operation on Sets 

The union of two given sets A and B, denoted by A UB, is defined as a set 
consisting of all those points which belong to either A or B or both. Thus 
symbolically, 

AUBz={x:xeAorxe B). 
Similarly 


Rn 
U A; ={x :xe A; foratleastonei=1,2,...,2} 
i=] 
The intersection of two sets A and B, denoted by A OB, is defined as a set 
consisting of all those elements which belong to both A and B. Thus 
ANnBz={x:xe Aandxe B}. 
Similarly 
Rn 


,Ai =({(x:xe A; foralli=1, 2, ..., n} 
i=1 : 
For example, if A = {1,2, 5, 8, 10) and B = {2, 4, 8, 12), then 
AUB = (1,2,4,5, 8, 10,12} andAM B= {2,8}. 
If A and B have no common point, i.e., A B = >, then the sets A and B are 
said to be disjoint, mutually exclusive or non-overlapping. 
The relative difference of aset A from another set B, denoted by A-38 is defined 
as a set consisting of those elements of A which do not belong to B. Symbolically, 
A~B={x:xeAandx¢€ B}. 
The complement or negative of any set A, denoted by 4 is a set containing all 
elements of the universal set S, (say), that are not elements of A, i.e..4 = S—A. 
4-4-3, Algebra of Sets 
Now we state certain important properties concerning operations on sets. If A, 
B and C are the subsets of a universal set S, then the following laws held: 


CommutativeLaw : AUB=BUA,ANB=BQONOA 
Associative Law >: AUB)UCH=AU(BUC) 
(ANB)NC=AN(BNC) 
Distributive Law : AN(BUQC=(ANB)U(ANC) 
AU(BNQC)=AUB)N(AUC) 
Complementary Law : AUA=S,ANA=$6 


AUS=S,(°°A=S),ANS=A 
AUOG=A,ANO=6 
Difference Law : A-B=ANB 
A-B=A-(ANB)=(AUB)-B 
A-(8-C)=(A-B)V(A-O). 
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(AUB)-C=(A-Q)U(B-C) 
A-(BUC)=(A-B)M(A-C) 
(ANB)U(A-B)=A,(ANB)O(A-B)=6 
De-Morgan’ s Law — Dualization Law. 
(AUB)=AQB and (ANB)=AUB 
More generally - 


Rn % r 
( UA) = AA; and ( NA) = 


i=] i=] i=1 i 


"Ce 


Ai 
l 
InvolutionLaw  : (A) =A 
Idempotency Law : AUA=A, AQA=A 


4-4-4, Limit of Sequence of Sets 
Let (A,} be a sequence of sets in S. The limit supremum or limit superior of 
ithe sequence , usually written as lim sup A,, is the set of all those elements which 
belong to A, for infinitely many n. Thus 
lim sup A,={x:x € A, for infinitely manyn} + 
n—yo ...(4-3) 
The set of all those elements which belong to A, for all but a finite number of 
nis Called limit infinimum. or limit inferior of the sequence and is denoted by lim 
inf A,. Thus 
lim inf A. = (x:x & A, forall buta finite number of n } 
~A—poo (43 a) 
The sequence {A,) is said to have a limit if and only if lim sup A, 
= lim inf A, and this common value gives the limit of the sequence. 


Theorem 4-1. lim sup An= TA ( U Aan) 

m=l1 a=m 
m=1 n=m 

Def. {A.} is a monotone (infinite) sequence of sets if either 

(ij) AxCAnsi Von or (ii) AnsDAnsi Vi av. 

In the former case the sequence {A,) is said to be non-decreasing sequence 
and is usually expressed as A, 1 and in the latter case it is said to be non-increasing 
sequence and is expressed as Aa . 

For a monotone sequence (non-increasing or non-decreasing), the limit always 
exists and we have, 

U A, in case (i), ie., Aal 
lim A,={ "7! 
A—poo 


CA, in case (ii), i.e., Ant 
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4-4-5, Classes of Sets. A group of sets will be termed asa class (of sets). Below 
we shall define some useful types of classes. 
A ring R is a non-empty class of sets which is closed under the formation of 
‘finite unions’ and ‘difference’, 
ie, fWAE R, Be R, then AUB Ee Rand A-BeR. 
Obviously 6 is a member of every ring. 
A field F (or an algebra) is a non-empty class of sets which is closed under 
the formation of finite unions and under complementation. Thus 
(i) Ae F, Be F> Au Be F and 
(ii) Ae F>Ae F. 
A o-ring C isa non-empty class of sets which is closed under the formation 
of ‘countable unions’ and ‘difference’. Thus 


(i) Ae C,i=1,2,... 2% UA EC 
i= 

(iijj)AEC,BeEC => A-Be C. 

More precisesly o-ring is a ring which is closed under the formation of 
countable unions . 

A o field (or o-algebra) B is a non-empty class of sets that is closed under 
the formation of ‘countable unions’ and onpre ene 

ié., 


(i) A € Bi=1,2,.. —- UWA € B. 
i=l 
ijj/AEB => AEB. 

0-field may also be defined as a field which is closed under the formation of 
countable unions. 

4-5. Axiomatic Approach to Probability.. The axiomatic approach to prob- 
ability, which closely relates the theory of probability with the modern métric 
theory of functions and also set theory, was proposed by A.N. Kolmogorov, a 
Russian mathematician, in 1933. The axiomatic definition of probability includes 
‘both’ the classical and the statistical definitions as particular cases and overcomes 
the deficiencies of each of them. On this basis, it is possible to construct a logically 
perfect structure of the modern theory of probability and at the same time to satisfy 
the enchanced requirements of modern natural science. The axiomatic develop- 
ment of mathematical theory of probability relies entirely upon the logic of 
deduction. 

The diverse theorems of probability, as were available prior to 1933, were 
finally brought together into a unified axiomised system in. 1933. It is important to 
remark that probability theory, in common with all-axiomatic mathematical sys- 
tems, is concerned solely with relations among undefined things. 

The axioms thus provide a set-of rules which define relationships betweer. 
abstract entities. These rules can be used to deduce theorems, and the theorems can 
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be brought together to deduce more complex theorems. These theorems have no 
empirical meaning although they can be given an interpretation in terms of 
empirical phenomenon. The important point, however, is that the mathematical 
development of probability theory is, inno way, conditional upon the interpretation 
given to the theory. 

IMore precisely, under axiomatic approach, the probability can be deduced 
from mathematical concepts. To start with some concepts are laid down. Then some 
statements are made in respect of the properties possessed by these concepts. These 
properties, often termed as “axioms” of the theory, are used to frame some 
theorems. These theorems are framed without any reference to the real world and 
are deductions froth the axioms of the theory. 

4-5-1. Random Experiment, Sample Space. The theory of probability 
provides mathematical models for “real-world phenomenon” involving games of 
chance such as the tossing of coins and dice. We feel intuitively that statements 
such as 

(i) "The probability of getting a “head” in one toss of an unbiased coin is 1/2" 

(it) “The probability of getting a "four" in a single toss of an unbiased dic is 
1/6", 
should hold. We also feel that the probability of obtaining either a "5" or a "6" in 
a single throw of a die, should be the sum of the probabilities of a "5" and a "6", 
viz., 1/6+1/6=1/3. That is, probabilities should have some kind of additive property. 
Finally, we feel that in a large number of repetitions of, for example, a coin tossing 
experiment, the proportion of heads should be approximately 1/2. ‘hat is, the 
probability should have a frequency interpretation. 

To deal with these properties sensibly, we need a mathematical description or 
model for the probabilistic situation we have. Any such probabilistic situation is 
referred to as arandom experimezu, denoted by E. E may be acoin or die throwing 
experiment, drawing of cards from a well-shuffled pack of cards, an agricultural 
experiment to determine the effects of fertilizers on yield of a commodity, and so 
on. 

Each performance in a random experiment is called a ¢rial. That is, all the trials 
conducted under the same set of conditions form a random experiment. The result 
of a trial in a random expenment is called an outcome, an elementary. event or a 
cmple point. The totality of all possible outcomes (i.e., sample points) of a random 
ex seriment constitutes the sample space. 

Suppose 1, 2, ..., én are the possible outcomes of a random experiment E such 
that no two or more of them can occur Simultaneously and exactly one of the 
‘outcomes é), €2, ..-, &, Must occur. More specifically, with an experiment E, we 
associated a set S = (€,€2,..,é€,) Of possible outcomes with the following 
properties: 

(i) each element of S denotes a possible outcome of the experiement, 
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(ti) any trial results in an outcome that corresponds to one and only one element 
of the set S. 

The set S associated with an experiment E, real or conceptual, satisfying the 
above two properties is called the sample space of the experiment. 

Remarks, 1. The sample space serves as universal set for all questions 
concemed with the experiment. 

2. Asample space S is said to be finite (infinite) samplé sapce if the number 
of elements in S is finite (infinite). For example, the sample space associated with 
the experiment of throwing the coin until a head appears, is infinite, with possible 
sample points 

{W1, 2, Ws, Ws, .,..) 
where @ = H, @2= TH, @)=TTH, @4= TTTH, and soon ,H denoting ahead and 
T a tail. 

3. A sample space is called discrete if it contains only finitely or infinitely 
many points which can be arranged into a simple sequence @;, @, ..., while a 
sample space containing non- denumerable number of points is called a continuous 
sample space. In this book, we shall restrict ourselves to discrete sample spaces 
only. 

4-5-2, Event. Every non-empty subset A of S, which is a disjoint union of 
single element subsets of the sample space S of a randota experiment E is called 
an event. The notion of an event may also be defined as follows: 

"Of all the possible outcomes in the sample space of an experiment,some 
outcomes satisfy a specified description,which we call an event.” 

Remarks. 1. As the empty set $ is a subset of S , is also an event, known as 
impossible event. 

2. An event A, in particular, can be a single element subset of S, in which case 
it is known as elementary event. 

4-5-3. Some Illustrations — Examples. We discuss below some examples 
to illustrate the concepts of sample space ard event. 

1.Consider tossing of a coin singly. The possible outcomes for this experiment 
are (writing H for a "head" and T for a "tail") : H and T. Thus the sample space § 
consists of two points {@, @2), corresponding to each possible outcome or 
elementary event listed. 

i.e, S= (Qh, W2} = (H,T) and a(S) =2, 
where n(S) is the total number of sample points in S. 

If we consider two tosses of a coin, the possible outcomes ave HH, HT, TH, 
TT. Thus, in this case the sample space S consists of four points (G1, ®2, @s, Ws}, 
corresponding to each possible outcome listed and n(S)= 4. Combinations of these 
outcomes form what we call events. For example, the event of getting at lea t one 
head is the set of the outcomes {HH,HT,TH} = {@,, 2, 3}. Thus, mathematically, 
the events are subsets of S. 
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2. Let us consider a single toss of a die. Since there are six possible outcomes, 
our sample space S is now a space of six points (@1, We, ..., 6} where 0; cor- 
responds to the appearance of number i. Thus S={@,, @2, ..., ®6}={1,2,....6} and 
n(S}=6. The event that the outcome is even is represented by the Set of points 
(G2, Gs, We}. | 

3. Acoin and a die are tossed together. For this experiment, our sample space 
consists of twelve points {@), Gy, .....@i2}) where 0; (i = 1, 2, ..., 6) represents a 
head on coin together with appearance of ith number on the die and q; 
(i= 7, 8, ..., 12) represents a tail on coin together with the appearance of ith number 
on die. Thus 

S = {@1, Ms, ..., O2 }={(H,T)xX (1,2,....,6)} andn(S)= 12 

Remark. If the coin and die are unbiased, we can see intuitively that in each 
of the above examples, the outcomes (samplé points) are equally likely to occur. 

4. Consider an experiment in which two balls are drawn one by one from an 
urn containing 2 white and 4 blue balls such that when the second ball is drawn, 
the first is not replaced. 

Let us number thé six balls as 1, 2, 3,4, 5 and 6, numbers 1 and 2 representing 
a white ball and numbers 3, 4, 5, and 6 representing a blue ball. Suppose in a draw 
we pick up balls numbered 2 and 6. Then (2,6) is called an outcome of the 
experiment. It should be noted that the outcome (2,6) is different from the outcome 
(6,2) because in the former case ball No. 2 is drawn first and ball No.6 is drawn 
next while in the latter case, 6th ball is-drawn first and the second ball is drawn 
next. 

The sample space consists of thirty points as listed below: 

@; =(1,2) @2 =(1,3) @ =(1,4) @. =(1,5) Ws =(1,6) 

6 =(2,1) @ =(2,3) gp =(2,4) Wy =(2,5) M10. =(2,6) 
@1=3,1) 22,2) o3=(34) a =(3,5) Wis =(3,6) 
wis=(4,1) — @i7 =(4,2) re =(43) —@y =(4,5) my =(4,6). 

Wa =(5,1) 22 =(5,2) 23 =(5,3) Wx =(5,4) W2s =(5,6) 

as =(6,1) Wn =(6,2) ge =(6,3) = My =(6,4) —- 0 =(6,5) 

Thus 

S = {0)1, We, Ws, ..., Wo} and n(S) = 30 
=> § = {1, 2, 3,4, 5, 6}x {1, 2, 3,4, 5, 6) 
~ ((1, 1), (2, 2), (3, 3), (4, 4), (S, 5), 6, 6)} 

The event 

(i) the first ball drawn is white 

(ii) the second ball drawn is white 

(iii) both the balls drawn are white 

(iv) both the balls drawn are black 

are represented respectively by the following sets of points: 

{@1, Wz, Ws, Ws, Ws, Ws, 7, Ge, Wy, Dio}, 
(G1, We, O11, Di2, Wis, W17, Or, Wr, Wr, O27}, 
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(Gi, 6}, and 
{ @13, Mia, Mrs, Wis, W19, Dro, G23, W24, Wes, W28, W29, 30}. 
s, Consider an experiment in which two dice are tossed. The sample space § 
for this sala is' given by 
§ = (1,2, 3, 4, 5, 6}. x {1, 2, 3, 4,5, 6) 
and =n(S)=6X6= 36. 
Let E, be the event that ‘thie sum of the spots on the dice is greater than 12’, 
E£2 be the event that ‘the sum of spots on the dice is divisible by 3’, and E3 be the 
event chat: “the sum is greater than or equal to two and is less than or equal to 12’, 
Then these — are represented by the following subsets of § : 
=(6}, Es =S and a 
. = ((1, 2), (1, 5), (2, 1), (2, 4), (3, 3), 3, 6), (4, 2), 
(4, 5), (5, 1), G, 4), ©, 3), ©, 6)} 
Thus an (E;)=0, n (E2)=12, and n (E3)=36 
Here E is an ‘impossible event’ and E; a ‘certain event’. 
6. Let E denote the experiment of tossing a coin three times in succession or 
tossing three coins at.a time. Then the sample space S is given by 


S = {H, T} x (H, T} x (H, T} 
= { H,T } x (HH, HT, TH, TT} 
= (HHH, HHT, HTH, HTT, THH, THT, TTH, TIT} 
= {@), M2, Ws, ..., Os}, Say. 


If E; is the event that ‘the number of heads exceeds the number of tails’, £2, 
the event of ‘getting two heads’ and E;, the event of getting ‘head in.the first trial’ 
then these are represented by the following Sets of points : 

_ Ey = (0), @2, Os, Ws), 
E2= (Q2, @s, Ws} 

and Es = (Oh, @2, Ws, Wa). 

7. In the foregoing examples the sample sapce is finite. To construct an 
experiment in which the sample sapce is countably infinite, we toss a coin 
repeatedly until head or tail appears twice in succession. The sample space of all 
the possible outcomes may be represented as : 

S = (HH, TT, THH, HTT, HTHH, THTT, THTHH, HTHTT, ...}. 

4-5-4, Algebra of Events. For events A, B, C 

(i) AUB= {foe S$: oeEeAorwe B} 
(ii) AN B= {mae §: me€e A andwe B} 
(iii) A(A complement) = {we S:@o€A } 
(iv) A-B= [oe S:@eE ae B} 


(v) Similar generalisations for re Ai, a Ai, 4 A; etc. 


(vi) A cB = for every @ € A. we B. 
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(ii) BD ADACEB. 
(viii) A = B if and only if A and B have the same elements, i.e., if ALC’ B 
and Bc A. 
(ix) ‘A and B disjoint ( mutually exclusive ) => A © B= (null set). 
(x) AUB can be denoted by A + B if A and B are disjoint. 


(xi) A AB denotes those w belonging to exactly one of A andB, i.e., 
AAB=ABUAB 
Remark. Since the events are subsets of 5S, all the laws of set theory viz., 
commutative laws, associative laws, distributive laws, De-Morgan’s law, etc., hold 
for algebra of events. . 
Table - Glossary of Probability Terms 


. Statement Meaning in terms 
of set theory 
1. At least one of the events A 
or B occurs. @e AUB 
2. Both the events A and B occur. -@e ANB 
3. Neither A nor B occurs oe ANB 
4. EventA occurs and B does not 
occur ae ANB 
5. Exactly one of the events A or B 
occurs. we AAB 
6. Not more than one of the events . 
A or B occurs. oe ANB)UANB VLAN B) 
7. If event A oceurs, so does B AcB 
8. Events A and B are mutually ex- 
clusive. An B=o 
9. Complementary event of A. A 
10. Sample space universal set S 


Example 4-11. A, B and C.are three orbitrary events. Firid expressions for the 
events noted below, in the context of A, B and C. 
(i) only A occurs, 
(ii) Both A and B, but not C, occur, 
(iti) All three events occur, 
(iv) At least one occurs, 
(v) At least two occur, 
(vi) One and no more occurs, 
(vii) Two and no more occur, 
(viii) None occurs. 
Solution. 
(i) ANBNCE, (i ANBNC, (ii) AN BOC, 
(iv) AUBUC, ” 
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(vy) (ANBNC) UV (ANBNOG) vu (ANBOC) U (ANBNC) 
(vi) (ANBNC)U(A ABANC)U(ANBAC) 
(vii) (ANBOC)U(ANBANC)U(ANBNC 
(viii) ANBOC or AUBUC 
EXERCISE 4(b) 
1. (i) If A,B and C are any three events, write down the theoretical expressions 
for the following events: 
(a) Only A occurs, (b) A and B occur but C does not, 
(c) A,B,andC all the three occur, (d) at least one occurs 
(e) Atleast two occur, (f) one does not occur, 
(2) Two do not occurs, and (hk) None occurs. 
(ii) A, B and C are three events. Express the following events in appropriate 
symbols: 
(a) Simultaneous occurrence of A,B and C. 
(b) C currence of at least one of them. 
(c) A, B and C are mutually exclusive events. 
(d) Every point of A is contained in B. a 
(e) The event B but not A occurs. {[Gauhati Univ. B.Sc., Oct.1990] 
2. A sample space § contains four points x;, x2,x3 and x4 and the values of a 
set function P(A) are known for the following sets : 
A, =(%,%3) and P(A;) =< : Az=(Xs,%4) and P(A;)= 6 ; 


Az = (X1, X2, Xs) and P(A3)= Ag = (X2, Xs, 4) and’ P(As) = a 


Show that : ; 

(i) the total number of sets (including the “null” set of number points) of points 
of x is 16. 

(ii) Although the set containing no sample point has zero probability, the 
converse is not always true, i.e., a set may have zero probability and yet it may be 
the set of a number of points. 

3. Describe explicitly the sample spaces for each of the followin g experiments: . 

(i) The tossing of four coins. 

(ii) The throwing of three dice. 

(iti) The tossing of ten coins with the aim of observing the numbers of tails 
coming up. 
(iv) Twocards are selected from a'standard deck of cards. 

(v) Four successive draws (a) with replacement, and (6b) without replace- 
ment, from a bag containing fifty coloured:balls out of which ten are white, twenty 
blue and twenty red. 

(vi) A survey:of families with two children is conduced and the sex of the 
children (the older child first) is recorded. 

(vii) A survey of families with three children is made and the sex of the 
children (in order of age, oldest child first) are recorded. 


it 
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(viii) Three distinguishable objects are distributed in three numbered cells. 
(ix) A poker hand (five cards) is dealt from an ordinary deck of cards. 
(x) Selecting r screws from the lot produced by a machine, a screw can be 
defective or non-defective. 

4, In an experiment a coin is thrown five times. Write down the'sample space. 
How many points are there in the sample space? 

5. Describe sample space appropriate in each of the following cases : 

(i) n-tossés of a coin with head or tails as outcome in each toss. 
(ii) Successive tossés of a coin until a head tums up. 
(iii) A survey of families with two children is conducted and the sex of the 
children (the older child first) is recorded. 
(iv) Two successive draws, (a) with replacement (6) without replacement, 
from a bag containing 4 coloured toys out of which one is white, one black and 2 
red toys. | [M.S.Baroda Univ. B.Sc., 1991] 

6. (a) An experiment consists of tossing an unbiased coin until the same result 
appears twice on succession for the first time. To every possible outcome requiring 
n tosses attribute probability 1/2". Describe the sample space. 

(b) A coin is tossed until there are either two consecutive heads or two 
consecutive tails or the number of tosses becomes five. Describe the sample space 
along with the probability associated with each sample point, if every sequence of 
n tosses has probabilty 2. [Civil Services (main), 1983] 

7. Urn 1 contains two white, one red and 3 black balls. Urn 2 contains one 
white, 3 red and 2 black balls. An experiment consists of first selecting an urn and 
then drawing a ball frém this um. Define a suitable sample space for this 
experiment. 

8. Suppose an experinient has n outcomes Aj, A2, ..., A, and that it is repeated 
rtimes. Let x1, 2, ...,X,record the number of occurrences of Ai, A2, ..., An . Describe 
the sample space. Show that the number of sample points is * 

n+r—-1 
r~1 

9, A manufacturer buys parts from four different vendors numbered 1,2, 3 and 
4. Referring to orders placed on two successive days, (1,4) denotes the event that 
on the first day, the order was given to vendor 1 and on the second day it was given 
to vendor 4. Letting A represent the event that vendor 1 gets at least one of these 
two orders, B the event that the same vendor gets both orders and C the event that 
vendors 1 end 3 do not get either order. List the elements of : 

(a) entire sample space, (b)A, (c)B, (d)C, (e)A, (/)B, 

(QBUC, (RNANB, (iI) ANC, (j)AUB, and (k)A-B 

(Hint. (a) The elements of entire sample space are ° 

(1,1); (1, 2); (1, 3); (1, 4)3 (2, 0); (2, 2); (2,3); (2,4); 
(3, 1); (3, 2); (3, 3); (3, 4); 4, Ds (4, 2); (4, 3); 4, 4). 


« 
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(bj) The‘elements of A are 
— 031, 2), EC, 4) @1 1); (3, 1); (4, 1). 
(c) The elements of B are: (1, 1); (2, 2);-@, 3) and (4, 4). 
(d) The elements of C are (2, 2); (2, 4); (4, 2); (4, 4): 
(e} The element of A are: 
(2, 2); (2, 3); (2, 4); (3, 2); (3, 3); (3, 4); (4, 2)s (4, 3); (4, 4). 
(f) The elements of B are: (1, 2); (1, 3); (1, 4); (2, 1); (2, 3); (2, 4); 
(3; 1); (3, 2); 3,4); 4, 1D; , 2); (4, 3). 
(2) The elements.of B U C are (1, 1);.(2, 2); (3, 3); (4, 4); (2, 4); (4, 2). 
(h) The elements of A 4 B are (1, 1). 
(i) ANC=6 
(j) Since AO B=A CB. The elements of 4.0 Bare (2, 3); (2,4); (3, 2); 
(3, 4); (4, 2); (4, 3). 
(k) The elements of A —B are (1, 2); (1, 3); (1, 4); (2, 1); (3, 1); (4, 1). 


4-6. Probability — Mathematical Notion. We are now set to give the 
mathematical notion of the occurrence of a random phenomenon and the. mathe- 
matical notion of probability. Suppose in a large number of trials the sample space 
S contains N sample points. The event A is defined by a description which is 
satisfied by V4 of the occurrences. The frequency interpretation of the probability 
P(A) of the event A, tells us that P(A)=N,/N. 

A purely mathematical definition of probability cannot give us the actual yalue 
of P(A) and this must be considered_as a function defined on all events. With this 
in view, a mathematical definition of probability is enunciated as follows: 

"Given a sample description space, probability is a function which assigns a 
non-negative real number to every event A, denoted by P(A) and és called the 
probability of the event A." 


4-6-1. Probability Function. P(A) is the probability function defined ona 
0-field B of events if the following properties or axioms_hold : 

1. For each A € B, P(A) is defined, is real and P(A) >°0 

2. P(S) = 1 

3. If (A,)} is any finite or mer sequence of disjoint events.in B,-then 


P¢ U A)= P(A;) (4-4) 
i=} 
The above three axioms are termed as the axiom of positiveness, certainty and 
union (additivity), réspectively. 
Remarks. 1. The set function P defined on o-field B, taking its values in the 
real line and satisfying the above three axioms is called’the probability measure. 
2. The same definition of probability applies to uncountable sample space 
except that special restrictions must be placed on S and its subsets. It is important 
torealise that for a complete description of a probability measure, three things must 
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be specified, viz., the sample space S, the o-field (o-algebra) B formed from certain 

-Subset of 5 and set function P. The triplet (S; B, P,) is often called the probability 
space. in most elementary applications, S is finite and the o-algebra B is taken to 
be the collection of all subsets of 5. 

3. It is interesting to see that there are some formal statements of the properties 
of events derived from the frequency approach. Since P(AJ=N,/N, it is easy to see 
that P(A) 2 0 , as in Axiom 1. Next since Ns =N, P(S)=1, as in Axiom 2. In case 
of two mutually exclusive (or disjoint) events A and B defined by sample points 
Na and Ng, the sample points belonging to AU B are Na + Ng. Therefore, 


P(AUB ) = NA te Ma Ne P (A) +P (B), asin axiom 3. 


NNN 
Extended Axiom of Addition. If an event A can materialise in the occurrence 
of any one of the pairwise disjoint events A;, A2, ... So that 


A=U A;; A;N Aj= (i #/) 
i= 


then 


P(A)=P(u Ad= Y, P(A) A1) 
oT i=1 
Axiom of Continuity. If B,, By, ...., Bs, ... be acountable sequences of events 
such that_ 


(i) B35 Bis, G=1, 2, 3, ...) 
and 
(ii) A B,=6 
n=l 


i.e., if each succeeding event implies the preceeding event and if their simul- 
taneous occurrence is an impossible event then 
‘lim P(B,)= 0 w»-(2) 
R~joo 
We shall now prove that these two axioms, viz., the extended axiom of addition 
and axiom of continuity are equivalent, i.e., each implies the other, i.e., (1) <= (2). 


Theorem 4-1, Axiom of continuity follows from the extended axiom of addition 
and vice versa. 
Proof. (a) (1) =>(2). Let {B,} be a countable sequence of events such that 
B, 2B,283D easy >B,>D Basi Dd. 
and let for any n = 1, 
ry B=d (*) 
kon 
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Bast 
Ba+2 


Bas (” Bins? 


B, ry Bnet 


Then it is obvious from the diagram thai 
B, = B, Bas U Bast Bns2 LO rene & | ( a) B) 


k2n 
= B,=( VY BeBisi UC 4 Bt). 
=R 
where the events By B's; (k=n, n+l, . Jk are epanwise disjoint and each is disjoint 
with © Bx. 
k2n 


Thus B, has been expressed as the countable union of pairwise disjoint events 
and hence. by the extended axiom of addition, we get 


P(B,) = x P(By B’e41) + PC hf By) 
2n 


= ¥ PBB), (**) 
k=n 
since, from (*) 
P( A B)= P(o)= 0 
k2n 
Further, from (**), since 


>) P (Br B’es1) = PCB) $ 1, 
k=1 
the right hand sum in (**), being the remainder after n terms of a convergent series 
tends to zero as n—2. 
Hence 


lim P (B,)= lim y P (Bz B’s41)= 0 


R—)co k=n 
Thus the cui axiom of addition implies the axiom of continuity. 


(b) Conversely (2) => (1), i.e., the extended axiom of addition follows from 
the axiom of continuity. 
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Let (A,) be a countable sequence of pairwise disjoint events and let 


do 
A= U A; 


i=l 


= ( OOK S Ai) (3) 
i=1 t=n+1 


Let us define a countable ss {B,) of events by 


B,= U A; (4) 
i=n ie 
Obviously B, is a decreasing sequence of events, i.e., 
B, > Bz >... D BaD Bas, D.... (5) 
Also we have 
A= ( U Ay U Basi ...(6) 


ie 1 

Since A;’s are pairwise disjoint, we get 
. A:T Basi = >, (= 1, 2, 0 2) (6) 
From: (4) we see that if the event B, has occurred it implies the occurrence of 
any one of the events A,+1, Ai+2,... Without loss of generality let us assume that 
this event is A; @=n+1, n+2,...). Further since A;’s are pairwise disjoint, the 
occurrence of A; implies that events A;,1,Ai+2,... do not occur leading to the 

conclusion that B;,,, Bi. 2, ... will not occur. 


=> a B;= o 7) 


From (5) and (7), we observe that both the conditions of axiom of continuity 
are satisfied and hence we get 


lim P(B,)= 0 (8) 
R—>pco 
From (6), we get 
P (A)= P{( u Ai) U Basi] 
i=1 
= >) P(A) + P(Ba+1) 
i=l 
(By axiom of Additivity) 
= P( on Ai)= Jim y P(A) + lim (B.+1): 


i=l AHO 5 R—peo 


Theory of Probability - 4-29 


= >) P(A), [From (8)] 
\ i=1 
which is the extended axiom of addition. 


THEOREMS. ON PROBABILITIES OF EVENTS 


Theorem 4-2. Probability of the impossible event.is zero, i.e.,P (%) = 9: 
Proof. Impossible event contains no sample point and lience the certain event 
S and the impossible event $-aré‘mutually exclusive. 


Hence SUd=S 

a P(SUo)= P(S) 

= P(S)+ P(6)= : (S) [By Axiom 3 
> P(o) = 


Remark. It may be noted P(A a does not imply that A is seceushin: an 
empty set. In practice, probability ‘0’ is assigned to the events which are SO rare 
that they happen only once in a lifetime. For example, if a person Who does not 
know typing is asked to type the manuscript of a book, the probability of theevent 
that he will iype itcorrectly without any mistake is 0. 

As another illustration, let us consider the random tossing of acoiin. The event 
that the coin will-stand erect on its edge, is assigned the probability 0. 

"The study of continuous random variable provides another illustration to the 
fact that P(A)=0, does not imply A=), because in case ‘of continous random 
variable X, the proability at a point is always zero, i.e., P(X=c)=0 [See'Chapter 5]. 


Theorem 4-3. Probability of the complementary event A of A is given by 


P(A) = 1 ~- P(A) 
Proof. A and A are disjoint events. 
Moreover , AUA=5§ 


From axioms 2 and 3 of probability, we have . 
P(AUA)= P(A)+ P(A)= P(S)= 1 


=> P (A)= 1-P(A) 
Cor.1. Wehave P(A)= 1-P(A) 
=> P(A) <1 (-.: P(A) 20) 


Cor.2. P()= 0,sinceo= 5 
and P(o)= P(S)= 1-P(S)= 1-1= 0. 
Theorem 4-4, For any two events A and B, 
P AQB)= P(B)-P(ANB) (Mysore Univ. B.Sc., 1992] 
Proof. 
ANB and AQB are disjoint events and 
(AN B)UAQB)=B 
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S 
Hence by axiom 3, we get 
A P (B)= P(AQNB)+ P(AQNB) 
(4B) = P (AQB)= P-(B)- P(ANB) 
Remark. Similarly, we shall get 
B P(AQNB)= P(A)- P(ANB) 


Theorem 4-5. Probability of the union of any two events A and B is given by 
P(AUB)= P(A)+ P(B)-— P(AMB) 

Proof. AWB can be written as the union of the two mutually disjoint events, 
Aand BOA. 
: P (AUB)= P[AU(B.QA)]= P(A)+ P(BNA) 

= P(A)+P (B)-P (ANB) (c.f. Theorem 4-4) 


Theorem,4-6. if B CA, then 
(i) P(AQB)= P(A)— P(B), 
(ti) P (6) <P (A) 
Proof. (i) When B c A, Band AB are 
mutually exclusive events and their union is A 
Therefore 
P(A)= P({BU(ANB)) 
=P(B)+ P(ANB) [By axiom 3] 
=> P(AQB)= P (A)—- P(B) P 
(ii) Using axiom 1, 

P(ANB)20 => P(A)- P()20 
Hence P(B) < P(A) a 
Cor. Since (ANB)cCA and (ANB)cB, 

P(AQB) < P(A) and P(ANB) < P@) 
4-6-2. Law of Addition of Probabilities 


Statement. /fA and B are any two events [subsets of sample space S] and are 
not disjoint, then 


P(AUB)= P (A4)+ P(B)- P(ANB) (4-5) 
Proof. 
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We have 
AUB= AU(ANB) 
Since A at:.i (A 7 B) are disjoint , 
P(AUB)= P(A)+ P(AQB) 
= P(A)+ [P(A NB)+ P(ANB))+ P (A.B) 
= P(A)+ P[G@QAB)U(ANB)]- P(ANB) 
[° (A QB) and (ANB) are disjoint ] 

=> P(AUB)= P(A)+ P(8)- P(ANB) 
Remark. An alternative proof is provided by Theorems 4-4 and 4-5. 


4-6-3. Extention of General Law of Addition of Probabilities. For n events 
Ai, A2, ..., Aan, we have 


n an 
P(U A) = ZEP(A)- LE P(ANA)+ LEE PIAA AL) 
iz] i=] 


lsi<jsa Isi<j<ks 
~ i+ (-1)) P(ALNALN... AAs) (4:6) 
Proof. For two events A; and A>, we have 
P(A, U A2) = ? (Ai) + P (A2) - P(A \ A2) .(*) 


Hence (4-6) is true forn = 2. 
Let us now suppose that (4-6) is true for n =r, (say). Then 


P( We z (Pia) — LIP (AINA) +... +t- P(A NA2N...04,) 
i=] i=l lsi<jsr wo (#* 
Now 


r+1 r 
P(UA)= PICU Ar) U Aral 
i= 1 i=1 
= P( U A;).+ P (Ar+1)- PLC U A;) A Ar+1) J. ... {Using (*%] 
iz 7 | 


P ( U Aj)+ P(Ar+1)-P[ U (AiNAr+1)] (Distributive Law) 


i=1 t=1 


r 
XL P(A)- LL P(A; A) + ee 
i=l lsi<jsr 


wt (—1)7) P(A; NAN... AA,) + P (Aro1) 


- P[ U (Ai A Arey] ...[From (**)] 


{= 
r+1 
= EP(A)- LE P(A;NA)+ ... 
i=l lsi<jsr 
+(-1)7' P(1NA2N... AA,) 
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-[E P(A A Arsi)— EE P (AIA A;A Ass) 
i=] Isi<jSr 


+. — _ P(A, VN A220... AAPA Ara) J ...[From (**)] 


=> PCUA)=" EP (A) - [ LE P(A A) + E Plas OAr+1)] 
i=] lsi<jsr i= 
+ (-1)P[@ nin eV Arer ) | 
r+l 
= £ P(A)- ZED P (AINA) 
i=] ee a 
s+(— 1) P(A NA2 2... A Abst) 
Hence if (4-6) is true for n=r, it is ae true for n = (r+ 1). But we have proved 
in (*) that (4-6) is true for n=2. Hence by the principle of mathematical induction, 
it follows that‘(4-6) is true for all positive integral values of n. r 


Remarks. 1. If we wriie 
P (Ai) = pi , P (Ai V Aj) = py , P (Ai Aj 0 As) = Diz 
and soon and 


i=l i= 

§2= ZL | LL P (A;NA)) 
1Si<jSa lsi<jSa 

§3;= XZ Dix and so on, 
lsi<j<ksSa 

then 
n 
P(U A)= Si- Sat S~ + IS, (4-64) 


i= 


2. If all the events A;, (i= 1, 2, ..., 2) are mutually disjoint then ( 4-6 ) gives 


P, o Aj) = E P (Aj) 
i=] i=] 

3. From practical point of view the theorem can be restated in a slightly 
different form. Let us suppose that an event A can materialise in several mutually 
exclusive forms, viz., Ai, A2, ..., Ag Which may be regarded as that many mutually 
exclusive events. If A happens then any one of the events Aj, (= 1, 2, ..., m) must 
happen and conversely if any one of the events A;, (i = 1, 2, ..., n) happens, then A 
happens. Hence the probability of happening of A is the same as the probability of 
happening of any one of its (unspecified) mutually exclusive forms. From this point 
of view, the total probability theorem can be restated as follows: 

The probability of happening of an event A is the sum of the. probabilities of 
happening of its mutually exclusive forms A\, Ao, ..., Aa. Symbolically, 
'P (A)=P (A;)+ P (Az) + ... « P (An) (4-6b) 
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The probabilities P(A,),P(A2), .... P(A.) of the mutually exclusive forms of 
Aare known as the partial probabilities. Since P(A) is. their sum, it may be called 
the total probability.of A. Hence the name of the theorem. 

Theorem 4-7. (Boole’s inequality). For n events A, A2, ..., An, We have 


(a) P(A) EP(A)-(n-1) AAT) 
i=] [ 


n na 
(b) P( UA)S fT P(A) | ; ™; ...(4-Ta) 
i=1 i=] 
[Delhi Univ. B.Sc. (Stat Hons.), 1992, 1989] 
Proof. (a) P(Ai1UA2)= P(A1)+ P(A2)— P'(Ai N A2) <1 
= P (A, A2) 2 P (Ai) + P (A2)—- 1 (*) 
Hence (4-7) is true for n= 2. 
Let us now suppose that (4-7) is true for n=r (say), such that 


r r 
P(N A)2 ZZ P(A)~-C-) (**) 
i=1 i=1 
Then 
r+l r " 
P ( a) Aj)= P ( A; A, +1) 
i=] i=l]: : 
r ‘ 8 
>P( 0 Aj)+ P(Ars1)— 1 [From (*)] 
i=] 
r 
=> 2 P(A)-(r-1)+ P(Ar41)-—1 [From (**)] 
j=1 
r+i r+ 
= P(O A)2 & P(Aj)~r ) 
i=] i=] 
=> (4-7) is true for n =r + 1 also. : 


The result now follows by the principle of mathematical induction. 
(b) Applying the inequality (4-7) to the events Ai, Az, ..., As, we get 
PALANAN ...0-AUj 2 [P (Ai) +P (A) +... + P Ay) - (n- 1) 
=(1—P(A1)] + (1 — P(A2)] + ... + [1 — P(An)] —(n- 1) 
=1-P(A:)—P (A2)—...—P (Ay) 
= P(Ai)+P(A)+ ..+ P(A) 2=1-—P (1) NA22N...'A An). 
=1-P @U0A,U... UA,). 
=P(A,\UALU...UAa)  — 
=> P(A; VA2U... UA,) SP (A)) + P (Az) +... + P (An) 
as desired. 
Aliter for (b) i.e., (47a). We have 
P (Ay U A3)'= P (Ai) + P (A2) — P (A417 A2) 
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<P (A;) + P (A2) [°. P (Ai WN A2) 29] .».(##8) 
Hence (4-7a) is true for n = 2. 
Let us now suppose that (4-7a) is true for n=r, (say), so that 
P( UA)S 
i=] 


i= 


E P(A) (#448) 
=) 


8 
Now 


r+l r 
P( U A)= PC U A;U Ara) 


i= 1 i=] 
<P ( vu A) + P (Art) [Using (+#+)] 
S$ Bs P (Aj) + P (Ar+1) [Using (##**)) 
i=l , 


r+1 r+1 
= P ( U Aj) < L P (A;) 
i=] i=l 
Hence if (4-7a) is true for n=r, then it is also true for rn=r+1. But we have 
proved in (***) that (4-7a) is true for n=2. Hence by mathematical induction we 
conclude that (4-7a) is true for all positive integral values of n. 
Theorem 4-8. For n events Ai, Ao, ..., Aa, 


a a . 
P[ vu A; ] 22 P (A) - »» ‘P(A; Aj) 
i=] i=1 1Si<jsa 
[Dethi Univ. B.Sc. (Stat Hons.), 1986] 
Proof. We shall prove this theorem by the method of induction. 
We know that 
P (A; U A2 U As) = P (A) + P (A2) + P (As) 
=~ [P(A: -Y A2) + P(A2 - As) + P(A; Cr Ai)] + P(A; ANAND As) 


3 3 
=> P( UA)2 ZP(A)- LE P(AiNA) 
i=1 i=l lsi<js3 
Thus the result is true for n=3. Let us now suppose that the result is true for 
n=r (say), So that 


r 
P( VA)2 © P(A)- LT PANNA) ..(*) 
i=1 i=1 lsi<jsr 
Now 


r+i vr 
P( U AJ=H=P( U A;UAS41) 


i=l i=] 


r roe ° 
=P( UV A)+P (Ars1)—P[( U A) OAs 1] 
. j 7 | 


t= t= 


| 
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= P ( U Ai) + P (Ar+1) - P*[ U (AiN A,+1)] 


i=] i=] 


t 

r 

>| EP(A)- EE P(AinA) 
i=1 lsSi<j<r 


+P (Ars) -P [ UA Ar a(#*) 


i= 
Ss ; : [From (*)} 
From Boole’s en (c.f. Theorem 4-7 page 4-33), we get 


nS vu (AiN Ar+1)] $ z oe OA, 41) 


=> eh v (A; A Ayo 1))-2 = z pe eens) 
pat 
.. From (* \ we get 


r+l1 r+1 r 
P(U A)2 e aa Ly P (AiN Aj) - z P (Ai N Ars) 
i=l i ae as t=1 


= P ( Ss 'S P(A)- LL P (A; ON A,) 
i=1 i=] lsi<js r+l 
Hence, if the theorem is true for n = 7, it is also true for n = r +1. But we have 
seen that the result is true for n = 3. Hence.by mathematical induction, the result 
is true for all positive integral values of n. 


4-7, Multiplication Law of Probability and Conditional Probability 
Theorem 4-8, For two events A and B 
P(AQB)= P(A). P(B | A),, P(A) >0 , (4-8) 
= P(B).P(A |B), P(B)>0 
where P(B | A) represents the conditional probability of occurrence of B when the 
event A has already happened and P(A|B) is the conditional probability of 
happening of A, given that B has already happened. 
Proof. 
_ 2A) . pap _ 2G). _ 2B) 
P (A) = n(S) ° P (B) = n(S) and P (AQ B)= nS) (*) 
For the conditional event A | B, the favourable outcomes must be one of the 
sample points of B, i.e., for the event A |B, the sample space is B and out.of the 
n(B) sample points, n(ACB) pertain to the occurrence of the event A. Hence. 


= n(AaB) 
AIDS 1 ®) 
Rewriting (*), we get 


n(By) n(Ansb)_ 
P(AQNB)= 7S) oo” P(B) . P(Al|B) 
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Similarly we can prove : 
_ m(A) n(AnB) _ 
P(ANB)= °° ea P(A). P(B\ A) 
_ P(AnB) P (AN B) 
Remarks. 1. P (B|A)= ——— P(A) and P (A\|B)=: PB) 


Thus the conditional probabilities P (B | A) and P (A | 8) are defined if and 
only if P(A}* 0 and P(B)* 0, respectively. 
2. (i) For P (8)>0, P(A] B) <P (A) 
(ii) The conditional probability P (A | B) is not defined if P (B) = 0. 
(iii) P (B | B)=1. 
3. Multiplication Law of Probability for Independent Events. If A = B 
are independent then 
P(A|B)=P(A) and P(B|A)=P (8) 
Hence (4-8) gives : 
P(AQB)=P (A) P (B) (48a) 
provided A and B are independent. 


4-7-1. Extension of Multiplication Law of Probability. For n events 
A, A2, soaG Ap, we have 
P(Ai NA1M... AAs) = P (Ai) P (A2 | Ai)P (A; | Ai NA2).. 

x P(A, | Ar QV A2... A Aan-1) (4-85) 
where P(A; | A;AAN...0 A:) represents the conditional probability of the 
event A; ; given that the evenis Aj, At, ...; Ar have already happened. 

‘Proof. We hive for three events Ai,A2, and A; 
P(A, NANA; )= PLA, (A2 OAs )} ” 
= P(A,) P(A. As | Ar) 
= P (Ay)'P*(A2 | Ai) P (As | A, 0 A;) 
Thus we find that (4-85) is true for i=2 and n=3. Let us suppose that (4-85) is 
true for n=k, so that 
P (A, VA2N... A Ag) =P (Ai) P (Az | At) P (As | At AA2) 
... P (Ai LAAAA wee CV Ag-1) 
Now 
P [(Ai A Az... Ag) OV Asai) = P (Ar A A2 0... OAs) 
xP (Asst | AiN Arn... VA‘) 
= P(A:) P(A | Ai)... P (As | AL. MQ A220... A Ag-1) 
. x P (Aner | Ar NAp A... OAD) 
Thus (4-8b) is true for n=k+1 also. Since (4-85) is true for n=2 and ni=3;by 
the principle of mathematical induction, it follows that (4-85) is tree for all positive 
integral values of n. 
Remark. If A;, Ao, ..., A, are independent events then 
PlAn | A)= nee P (A3 | Ay VA2)= P (As) 
.. P(A, LA; AN... OV Ag-1) = P (A)) 
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Hence (4-8b) gives : 
P (A, ON A200... A An) = P(A1) P (Az)... OP (An), ..(4-8c) 
provided Aj, Aa, ...,A, arc independent. 
Remark. Mutually Exclusive (Disjoint) Events and Independent Events. 
Let A and B be mutually exclusive (disjoint) events with positive probabilities 
(P (A) > 0, P (B) > 0), i.e., both A and B are possible events such that 


AQNB= 6 > P(AQB)= P()= 0 Ai) 
Further, by compound probability theorem we have 

P(AQB)= P(A).P(B\A)= P(B) P(A|B) ..(ii) 
Since P (A) #0; P(B)# 0, from (i) and (ii) we get 

P(A|B) = 0-#P(A), P(B|A)= 0# P(B) ..iii) 


= AandB arc dependent events. 

Hence two possible mutually disjoint events are always dependent (not inde- 
pendent) events. 

However, if A and B are independent events with P (A)>0 and P(B)>0, 
then ) 

P(AQB)= P(A) P(B) # 0 

= AandB cannot be mutually exclusive. 

Hence two independent events (both of which are possible events), cannot be 
mutually disjoint. 

4-7-2. Given n independent events Ai, (i =1,2,...,.n) with respective prob- 
abilities of occurrence p;,_ to find the probability of occurrence of at least one 
of them. 

We have 

P(A)= pi = P(A)=1-p;; i= 1,2,..,0 


[.. (Ay VA2U...UA,) = (AL OA22....A An) (De-Morgan’s Law)} 
Hence the probability of happening of at least one‘of the events is given by 


P(A; AzU vee Ag) = 1-P (A, (DAU :.. U A,) -(*) 
= 1-P (Ai 1X A20 ...0 An) 
= 1-—P (A) P (A2) -.. P (A,) ooo (#4) 


[c.f. Theorem 4-14 page 4-41" 
1—[(1 — fr) (1 — pa) ... 1 — pa) 


n R n 
Lpi- LL (pipj)+ LE (pipj pr) 
i=1 ijel i,j,k=1 
i<j t<j<k 
we t(-lY” (Pi Pr... p.)| 
Remark. The results in (*) and (**) are very important and are used quite often 
in numerical problems. Result (*) stated in words gives: 
P (happening of at least one of thie events Aj, Ao, ..., An | 
=1 - P (none of the events A), Aa, ..., A, happens) 


lt 
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or equivalently, 
P {none of the given events happens} 
=1 — P {at least one of them happens}. 
Theorem 4-9. For any three events A, B and C 
P(AUB|C)=P(AIOCO+P(BIQO-P(ANB|C 
Proof. We have 
P(AUB)= P(A)+ P(B)- P(ANB) 
=>PI(ANC)UBAC)I=P(ANC)+ P(BAC)-P(ANBNC) 
Dividing both sides by P (C), we get 


aaa”; nee cmemmaamenae aan ABAC) 
PC) = »P(C)>0 


PIANC) | PBAO PAANBAC) 
~ PC) P(C)” P(C) 


= a = P(A|C)+ P(B|C)-P(AQBIC) 


= P[(AUB){IC] = P(A(CJ+P(B(C)-P(ANBI{C) 
Theorem 4-10. For any three events A, B and C 
PAQNB| C+ P(ANB{| C=P(AI CO) 
Proof. P(ANB| C+ P(ANB|C) 
_~PAQBOC) | PANBAC) 


P(C) P(C) 
_ PANBA OC) + PANBAGC 
P(C) 
= (ie P(A(C) 
Theorem 4-11. For a fixed B with P (B)>0,P (A|8) is a probability 
function. [Dethi Univ. B.Sc. (Stat. Hons.), 1991; (Maths Hons.), 1992] 
Proof. 
P(AQB) 
(i) P(A| B= P(B) 20 
5 P(SAB) _ P(B) _ 
(ii) P(S| B)= P(B) ~ P(B) l 


(iii) If {A,) is any finite or infinite sequences of disjoint events, then 
Pi(y A.) OB | P((v A,.B)] 
rn 7) i ce 


_ oan _ P (A,B) 
“FP @) >| er |: 2, PAI) 


Hence the theorem. 
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Remark. For given B satisfying P(B) > 0, -the conditional probability 
P[-|B] also enjoys the same properties as the unconditional probability. 

For example, inthe usual notations, we have: 

(i) P [>| Bl =0 

(ii) P[A}B}= 1 - P{A | By 


(iii) PY A; |B) = » P[A; |B], 


where 4,, Ao, ..., A, are mutually disjoint events. 

(iv) P(A, V A, | B) = PA, | B) + P (A, [ 8B) - PG, A, | B) 

(vy) IfE CF, then P(E | B) < P(F [| B) 
and so on. 

The proofs of results (iv) and (v) are given in theorems 4-9 and 4°13 
respectively. Others are left as exercises to the reader. 

Theorem 4-12. For any three events, A, B and C defined on the sample 
space S such that B < Cand P(A) > 0, 

P(B | A) s P(C | A) 
P(CNA) 
Proof. P(C|A) = P(A) (By definition) 


— PIBACAA)U(BACN A) 
7 P(A) 


PIBANCN A), P(BACOA) 
P(A) ae P(A) A) (Using axiom 3) 


t 


Hf] 


PI(BAG|4)+(BACn A)] 
Now BcC => BaAC=B 
P(C| A) = P(B| A) + P(BAC|A) 
~ P(C| A) > P(B| A) 
4°7°3. Independent Events. An event B is said to be independent (or 
statistically independent) of event A, if the conditional probability of B given 
A i.e, P (B | A) is equal to the unconditional probability of B, i.e., if 
P (B|.4) =P (B) 
Since 
P(A NB) =P (B\A) P(A) =P (A\B) P (B) 
and since P (B | A) = P (B) when B is independent of 4, we must have 
P (4 { B) = P (A) or it follows that 4 ig also independent of B. Hence the 
events 4 and B are independent if and only if = 
P (4 4B) = P (A) P (B) (49) 
4-7-4, Pairwise Independent Events 
Definition. 4 set of events A,, A,, ..., 4, are said to be pair-wise independent 
if P(A; 4) =P (A) P(A) v 1s (4°10) 
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4-7-5. Conditions for Mutual Independence of n Events. Let S denote the 
sample. space for a number of events. The events in S are said to be mutually 
indepdendent if the probability of the simultaneous occurrence of (any) finite 
number of them is equal to the product of their separate probabilities. 

If Ai, Aa, ..., A, are n events, then for their mutual independence, we should 
have 

(i) P(A; NA;) = P(A;)P(A;), (i #f 52, f= 1,2,..,7) 
(ii) P(AiN A; As) =P (Ai) P (A) P (Ad), (64 f #455555 k=1,2,...,0) 


P (A, NA2M... OAn) =P (Ai) P (A2') ... P(An ) 
It is interesting to note that the above equations give respectively 
"Co, "C3; ..., "C, conditions to be satisfied by Aj, Ao, ..., An. 
Hence the total number Of conditions for the mutual independence of 
Ai, Aa, .., An 1S "C2 +"°C3+...+ "Cy. 
Since "Co+"°C, +"C2+...+ "Ca = 2", we get the required number of conditions 
as (2"- 1-72). 
In particular for three events Aj, A2 and A;, (n= 3), we have the following 
2? = 1~+3=4, conditions for their mutual independence. 
P (A, N A2) = P (Ai) P (A2) 
P (A, As) = P (A2) P (As) 
P(A, VAs) = P (Ai) P (As) 
. P(A; NA2NAs) = P (Ai) P (A2) P (As) (4-11) 
Remarks, 1. It may be observed that pairwise or mutual independence of 
events Aj, Ao, ..., Ag, iS defined only when P (A;)# 0, for i =1, 2, ..., n. 
2. If the events A and B are such that P (Ai) #0, P(B)#0 andA is 
independent of B, then B is independent of A. 


Proof. We are givén that 
P (A|B)=P (A) 
P(AMB)_ 
= PB) =P (A) 
=> P(ANB) = P(A) PB) 
PBOA) _ pa 
P(A) ~ [- P(A) #0 and ANB=BQOA] 
=> P (B|A)=P (8B), 


which by definition of independent events, means that B is independerit of A. 
3, It may be noted that pairwise independence of events does not imply their 
mutual independence. For illustrations, see Examples 4-50 and 4-51. 


Theorem 4-13. /f A and B are independent events then A and B are also 
independent events. 
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Proof. By theorem 4-4, we have 
P(AQB)= P(A) -— P(AQB) 
= P(A) —-—P(A)P(B) [°. AandB are independent | 
= P(A)(1-P(8)] 
= P(A) P(B) 

=> A andB are independent events . 

Aliter. P(A 1B) =P (A) P (8) =P (A) P(B| A) =P (8) P (AIB) 

Le., P(8|A) = P(B) = Bis independent of A. 

also P(A|B) = P(A) = Ais independent of B. 

Also P(B{A)+P(B\A)=1 => P(B)+ P(B|A)=1 

or P (B| A)=1-P (8)=P 6) 

B is independent of A and by symmetry we say that A is independent of 
B. Thus A and B are independent events. 

Remark. Similarly, we can prove that ifA and B aré independent events then 
A and B are also independent events. 

Theorem 4-14. /f A and B are independent events then A and B are also 
independent events. 

Proof. We are given P(A B)=P (A) P (B) 

Now P(AQB) = P(AUB)= 1-P (AUB) 
1-[(P(A)+P(B)-P(ANB)] 
1-[P(A) + P(B)'- P(A)P(B)] 

1- P(A)- P(B) + P(A)P(B) 

(1 - P(B)] - P(A) [1-P(B)] 
=[1- P(A)][(1- P(B)] =P(A)P(B) 

.. A-and B are independent events . 


Aliter. We know 

P(A|B) + P(A|B)= 1 
= P(AIB) +P(A)= 1 _ (c.f. Theorem 4-13) 
=> P(A\B)=1- P(A) = P(A) 


A and B are independent events. 
Theorem 4-15. /f A,B, C are mutually independent events then AUB andC 
are also independent. 
Proof. We-are required to prove: 
P((AUB)NC] = P(AUB)P(C) 
LHS. = P[(ANC)U(BNC)] [Distributive Law] 
=P(ANC) + P(BNC) - P(ANBNC) 
= P(A)P(C) + P(B)P(C) —- P(A)PB)PC) 
[‘. A, B and C are mutually independent ] 
=P(C) [P(A) + P(B) — P(ANB)] 
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= P(C) P(AUB)= RHS. 
Hence (AUB ) and C are independent. 
Theorem 4-16. /fA,B and C are random events ina sample space and if A, 
B and C are pairwise independent and A is independent of (B U C ), then A, B and 
C are mutually indepéndent. 
Proof. We are given 
P(AQB)= P(A)P(B) 
P(BOAC)= P(B)P(C) (*) 
P(AQNC) = P(A)P(C) “ 
P[AQN(BUC)] = P(A)P(BUC) 
Nc wv P[LAN(BUC)] =P[(ANB)uU(ANC)] 
=P(ANB)+ P(ANC)—- P{ANB)A(ANC)] 
= P(A).P(B)+P(A).P(C) -—- P(ANBOC)) ...(**) 
and P(A)P (BUC) = P(A) [P(B)+P(C)-P (BNO) 
=P (A).P(B)+P(A)P(C)-P(A)P(BNC) _ ...(***) 
From (**) and (***), on using (*), we get 
P(ANBQC)=P (A) P(BOC)=P (A) P(B) P(C) 
Hence A, B, C are mutually independent. 
Theorem 4-17. For any two events A and B, 
P(AQNB)<P(A)SP(AUB)<P(A)+P(B) 
[Patna Univ. B.A.(Stat. Hons.), 1992; Delhi Univ. B.Sc.(Stat. Hons.), 1989] 
Proof. We have 
A=(ANB)U(ANB) 
Using axiom 3, we have 
P(A)=P[(AQNB)U(AQNB)j)=P(AQNB)+P(ANB) 


Now P[(ANB)2 0 (From axiom 1) 
* P(A)2 P(ANB) .(*) 
Similarly P(B)2>P(ANB) 

=> P(B)-P(ANB)20 

Now P(AUB)=P(A)+[P(B)-P(ANB )] (**) 
2 P(AUB)2P(A) = P(A)S P(AUB) (#**) 
Also P(AUB)< P(A)+ P(B) [From (**)] 


Hence from (*), (#*) and (***), we get 
P(ANB)SP(A) S P(AUB)S P(A)+ P(B) 
Aliter. Since A 1 B < A, by Theorem 4-6 (ii) page 4-30, we get 
P(AQB)SP(A). 
Also AC(AUB) = P(A) S$ P(AUB) 
P(AUB)=P(A)+P(B)-P(ANB) 
Combining the above results, we get 
P(AQNB)SP(A) < P(AUB)S P(A)+ P(B) 
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Example 4-12. Two dice, one green and the other red, are thrown. Let A be 
the event that the sum of the points on the faces shown is odd, and B be the event 
of at least one ace (number 1’). 

(a) Describe the (i) complete sample space, (ii) events A, B, B, ACB, 
AUB, and ACB and find their probabilities assuming that all the 36 sample 
points have equal probabilities. 

(b) Find the probabilities of the events : 

(i) (A UB) (ii) ACB) (iii) (ANB) (iv) AKB) (v) ARB) (vi) AVB) 
(vii) (A UB) (vii) AN (AUB) (ix) AU(ANB)(x)(A1 B)and(B | A), and 
(xi)(A |B) and(B1A).. 

Solution.,(a) The sample space consists of the 36 elementary events . 

(1,1) 3 (1,2); (1,3) 5 .4)3 (1,5) 3 (1,6) 
(2,1) 5 (2,2) 3 (2,3) 3 (2,4) 5 (2,5) 5 (2,6) \ 
(3,1) 5 (3,2)3 (3,3) 3 (3,4) 3 (3,5); (3,6) 
(4,1) 5 (4,2) 3 (4,3) 3 (4.4) 3 (4,5) 5 (4,6) 
(5,1) ; (5,2); (5,3); (5,4); €5,5 ); (5,6) 
(6,1) ; (6,2); (6,3) ; (6,4); (6,5 ); (6,6) 
where, for example, the ‘ordered pair (4, 5) refers to the elementary event that the 
green die shows 4 and and the red die shows 5. 
A = The event that the sum of the numbers shown by the two dice is odd. 
=((1,2);(2,1); 0,4); (2,3)3 (3,2); (4,1); (1,6); (2,5) 
(3,4); (4,3)365,2);5(6,1)5(3,6); (4,5); (5,4); (6,3) 
(5,6);(6,5)} ak Sia 
n(A)_ 
Por o 
B = The event that at least one face is 1, 
=((1,1) 5 (1,2); (1,3) 5 0,4); (1,5); 0,6) 
(2,1) ; (3,1); (4,1) 3; (5,1) ; (6, 1) ) and therefore 
_nB)_ il 
nae n(S) 36 
B = The event that each of the face obtained is not an ace. 
= {(2,2); (2,3); (2,4); (2,5); (2,6); (3,2); (3,3); 
(3,4); G5); 3,6); (4,2) 5 (4,3); (4,4) 5(4,5); 
(4,6); (5,2) 5 (5,3); (5,4) 3 .(5,5)5 (5,6); 6,2) ; 
(6,3); (6,4) ; (6,5); (6,6) } and therefore 
Ry - 2B) _ 25 
PO)= 1) 36 
Ac B = The event that sum is odd and at least one face is an ace. 


={(1,2)5(2,1)501,4)5¢4, - (1,6);(6,1)) 
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AUB ={(1,2);(2,1);(1,4);(2,3); (3,2) 3.04, 1); (1,6); (2,5) 
(3,4);3(4,3);(5,2);(,1); (3,6); (4,5); (5,4)3(6,3) 
(5,6);(6,5);(1,1);(1,3);(1,5); (3,1) 3€5,1)} 

_ A(AVB) . 23 
PAOD) (8). 36 
ANB =((2,3); (3,2); (2,5); (3,4); (3,6) 3(4,3)3 (4,5); (5, 2) 
(5,4);(5,6); ss (6, a 
| n(AQB)_12_ 1 
POON ES =367 3 
(b) (i) P(AUB)= P(AB)= 1- P(AnB)= 1- 4= 3 
(ii) PG AB)= P@OB)= 1- Pus) 1- 2 ‘3 


36 

(iii) P(AMB)= P(A)- PANB)=4- 2= Bat 
(iv) P (Ac\B)= P(B)- P(ANB)=U- 2-2 
(v) P AAB)= 1- P(ANB)= 1- 2=2 
(vi) P (A UB)= P(A)+ P(B)- P(ANB) 

—{,;-48|), U_ 5.2 

36 36 % = 3 

(vii) P(@UB)= 1- P(AuB)= 1- 2=2 
(viii) PIAN AUB) =P(ANA)UANB) 

=P@qB)= = [“ANA=06] 


(ix)P [AU (ANB) ]= P(A)+ P@NB)- ae 
= P(A)+ P(ANB)=4+ 2= 2 


36 % 
(x) Pal By= SS = ee $ 
PB A eT st ae 3 
(xi) PalB)= “Sae- mee 8 | 
P@\A)= a ws - 13 


Example 4-13. [ftwo dice are thrown, what is the probability that the sum is 
(a) greater than 8, and (b) neither 7 nor 11? 
Solution. (a) If S denotes the sum on the two dice, then we want P(S > 8). 
The required event can happen in the following mutually exclusive ways: 
(i) S=9 (@)S=10 (i#)S=11 (iv) S=12. 
Hence by addition theorem of probability 
P(S > 8) = P(S = 9) + P(S = 10) + P(S = 11) + P(S = 12) 
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in a throw of two dice, the sample space contains 6” =36 points. 
The number of favourable cases can be enumerated as follows: 
S=9 : (3,6), (6, 3), 4,5), (5,4), i.e., 4 sample points. 


P(S=9)= = 
S=10: (4,6) , (6,4), (5,5), : .e., 3 sample points. 

P (S= 10)= x 

S=11: (5,6), (6,5), i.e., 2 sample points. 
=41\a = 
P(S=1l)= 3 

S=12: (6,6), i.e., 1 sample point. 

P (S=12)= — 


ae eee eee 
P(S>8)= 6+ 36+ 36+ 367 367 18 


(b) Let A denote the event of getting the sum of 7 and B denote the event of 
getting the sum of 11 with a pair of dice. 
S=7 :(1,6), (6, 1), (2, 5), (5, 2), (3, 4), (4, 3), ie., 6 distinct sample 
points. 
P(A)= P(S=1)=2= 4 
36 «6 
24 
§ =11: (5,6), (6, 5), P (B) =P (S=11) “367 18 


‘. Required probability = P (AJ B)= 1-— P(AUB) 
= 1- [P(A)+ P(B)] 


(°. A and B are disjoint events ) 


6 18 9 . 

Example 4:14. An urn containe 4 tickets numbered I, 2, 3, 4 and another 
contains 6 tickets numbered 2, 4, 6, 7, 8, 9. If one of the two urns is chosen at 
random and a ticket is drawn at random from the chosen urn, find the probabilities 
that the ticket drawn bears the number (i) 2 or 4, (ii) 3, (iit) 1 or 9 

[Calicut Univ. B.Sc.,1992] 
Solution. (i) Required event can happen in the following mutually exclusive 
ways: 
(1) First um is chosen and then a ticket is drawn. 
(11) Second urn is chosen and then a ticket is drawn. 
__ Since the probability of choosing any um is ¥2, the required probability ‘p’ is 
8iven by 
p= P(1)+ P(Il) 
1,2 1.2 


ae re ee ee ee 
= 9% 479% 6 12 
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ss re ens ic: Mest | 
(ii) Requiréd probability = 7 x 4 +5 xXxO= 8 
(*: - the 2nd um there is no ticket with number 3) 


pee eee) eee 

(iii) Required probability = 9 ae 4 += > x= ae 

Example 4-15. A card is drawn from a well-shuffled pack of playing cards. 
What is the probability that it is either a spade or an ace ? 

Solution. The equiprobable sample space S of drawing a card from a well- 
shuffled pack of playing cards consists of 52 sample points. 

If A and B denote the events of drawing a ‘spade card’ and ‘an ace’ 
respectively then A consists of 13 sample points and B consists of 4 sample points 
so that, 


P (A) = 3 and P(B)= — 


The compound event A 1 B consists of only one sample point, viz., ace of 
spade so that, 


P(ANB)= = 


The probability that the card drawn is either a spade or an ace is given by 
P(A UB)= P(A)+ a is OB) 
13, 4 


52 - 52 a7 + 
Example 4-16. A box contains 6 red, 4 white and 5 black balls. A person draws 
4 balls from the box.at random. Find the probability that among the balls drawn 
there is at least one ball of each colour. (Nagpur Univ. B.Sc., 1992) 
Solution. The required event E that ‘in a draw of 4 balls from the box at 
random there is at least one ball of each colour’, can materialise in the 
following mutually disjoint ways : : 
(i) 1 Red, 1 White, 2 Black balls 
(ii) 2 Red, 1 White, 1 Black balls 
(iii) 1 Red, 2 White, 1 Black balls. 
Hence by the addition theorem of probability, the required probability is given 
by 
P(E)= P(i)+ P (ii) + P (ii) 
_ ax “Cx "Ca, °C; x “C, x "C) ‘ C. ¥ "Cs » fF 


=; =O _ "C, 
= ic st [6x4x 104 15x4x5+ 6xEx5} 
4 
——— 
= 15x14 x13 x12, [240+ 300+ 180 | 
24 x 720 


= T5xMxBxnD oP 
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Example 4-17. Why does it pay to bét consistently on seeing 6 at least once in 
4 throws of a die, but not on seeing a-double six at least once in 24 throws with two 
dice? (de Mere’s Problem). 
Solution. The probability of getting a ‘6.’ in a throw of die =1/6. 
‘, The probability of not getting a ‘6° in a throw of die 
=1-1/6 =S/6. 
By compound probability theorem, the probability that in.4 throws of a die no 
‘6’ is obtained = (5/6)* 
Hence the probability of obtaining ‘6’ at least once in 4 throws of a die 
= 1-(5/6) =0-516 
Now, if a trial consists of throwing two dice at a time, then the probability of 
getting a ‘double’ of ‘6’ in atrial = 1/36. 
Thus the probability of not getting a ‘double of 6 ’ in a trial = 35/36. 
The probability that in 24 throws, with two dice each, no ‘double of 6’ is 
obtained = (35/36) 
Hence the probability of getting a ‘double of -6’ at least once in 24 throws 
= 1-(35/36)” = 0-491. 
Since the probability in the first case 1s greater than the probability in the 
second case, the result follows. 
Example 4-18. A problem in Statistics is given to the three students AB and 
C whose chances of solving it are1/2,3/4, and 1/4 respectively. 
What is the probability that the proolem will be solved if all of them try 
independently? (Madurai Kamraj Univ, B.Sc.,1986; Delhi Univ. B.A.,1991] 
Solution. Let A, B, C denote the events that the problem i is solved by the 
students A, B,C ee mae 
l 
P(A)= ; ; P(B)= > 4 and P(C\= 4 
The problem will be solved if at least one of them solves the problem. Thus 
we have to calculate the probability of occurrence of at least one of the three events 
A,B,C,ie,,P(AUBUC). 
P(AUBUC)=P(A)+P(B)+P(C)- P(A NB)~P(ANC) 
-P(BONC)+P(ANBCC) 
= P(A)+P(B)+P(C)-P (A) P(B)-P(A) PC) 
—- P(B)P(C)+ P(A)P(B)P(C) 


(:..A,B,C are independent events. ) 


>|— pw 
Ni— plu 
> |— 
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Aliter. P(AUBUC)=1- P(AUBUC) 
=1-P(ANBNC) 
= 1—- P(A)P(B)P(C) 


aes 


32 
Example 4-19. If ANB = 4, then show that «..(*) 
P(A) < P(@) 
[Delhi Univ. B.Sc. (Maths Hons.) 1987} 
Solution. We have 
: A= (ANB) VU (ANB) 
=o U (ANB) [Using ¢ ] 
= ANB 
= AcB 
=" P(A) < P(B) 
as desired. . 


Aliter. Since A (B=, we have A CB, which implies that P (A) < P (@). 
Example 4-20. Let A and B be two events such that 


_3 _2 
| P (A)= 4 and P (B)= 8 
show that 
(a) P(AUB) 2 = 


00 | in 


(b) =< P(A AB)< 
[Delhi Univ. B.Sc. Stat (Hons.) 1986,1988] 


Solution. (i) We have 


A c (AUB) 
=> P(A) < P(AUB) 
=> 2< P (AUB) 
=> P(A UB) 2 3 
(ii) ANB CB 
= P(AQB) < P(B)= 2 ...(i) 
Also P(AWB)= P(A)+ P(B)— P(AQNB)S§ 1 


=> a4 a < P(ANB) 
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=> — 3 < P(ANB) 
2 3 <P (ANB) ii) 
From (i) and (ti) we get 

2< <P (ANB) < 


Example 4-21. (Chebychev’ s Problem). ni is the chance that two numbers, 
chosen at random, will be prime to each other ? 

Solution. If any number ‘a’ is divided by a prime number ‘7’ , then the possible 
remainders are 0, 1, 2, ...r-1. Hence the chance that ‘a’ is-divisible by 7 is 1/r 
(because the only case favourable to this is remainder being 0). Similarly, the 
probability that any number ‘b’ chosen at random is divisible by 7 is 1/r. Since 
the numbers a and 6 are chosen at random, the probability that none of them is 
divisible by ‘r’ is given (by compound probability theorem) by : 


{1-4} x (1-+}= (1-4); F=2.355, 1.2: 
4 | 4 rv 


Hence the required probability that the two numbers chosen at random are 
prime to each other is given by 


P= TI {1- Ly, where r is a prime number. 
rv 


= = (From trigonometry) 

Example 4-22. A bag contains 10 gold and 8 silver coins. Two successive 
drawings of 4 coins are made such that : (i) coins are replaced before the second 
trial, (ii) the coins are not replaced before the second trial. Find the probability 
that the first drawing will give 4 gold and the second 4 silver coins. 

{Allahabad Univ. B.Sc., 1987] 

Solution. Let A denote the event of drawing 4 gold coins in the first draw and 
B denote the event of drawing 4 silver coins in the second draw. Then we have to 
find the probability of P(A CB ). 

(i) Draws with replacement. If the coins drawn in re first draw are replaced 
back in the bag before the second draw then the events A and B are indépendent 
and the required probability is given (using the multiplication rule of probability) 
by the expression 

P(A B)=P (A). P(B) (*) 

Ist draw. Four coins can be drawn out of 10+8=18 coins in ‘°C, ways, which 
gives the exhaustive number of cases. In order that all these coins are of gold, they 
Must be drawn out of the 10 gold coins and this can be done in °C, ways. Hence 


P(A)= °C, / ¥Cy 
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2nd draw. When the coins drawn in the first draw are replaced before the 2nd 
draw, the bag contains 18 coins. The probability of drawing 4 silver coins in the 
2nd draw is given by P (B)= °C, / *C,. 

Substituting in (*), we have 


"C, Bi 
P(ANB)= TG, x Tie. 

(ii) Draws without replacement. If the coins drawn are not eplaced back before 
the second draw, then the events A and B are not independent and the required 
probability is given by , 

P (AQB)=P (A). P(B |.A) wa(*4) 

As discussed in parti), P(A)= °C, / Cs. 

Now, if the 4 gold coins which were drawn in the first draw are not replaced 
back, there are 18 — 4=14 coins left in the bag and P (B | A) is the probability of 
drawing 4 silver coins from the bag containing 14 coins out of which 6 are gold 
coins and 8 are silver coins. 

Hence P(B|A)= °C, / “C, 

Substituting in (**) we get 

‘ a OF ‘ °C, 
(ANB)=.- te, * TG, 

Example 4-23. A consignment of 15 record players contains 4 defectives. The 
record players are selected at random, one by one, and examined. Those examined 
are not put back. What is the probability that the 9th one examined is the last 
defective? 

Solution, Let A be the event of getting exactly 3 defectives in-examination 
of 8 record players and let B the event that the 9th piece examined is a defective 
one. 

Since it is a problem of sampling without replacement and since there are 4 
defectives out of 15 record players, we have 


3) U5 
PA)= a 
8 


P (B | A) = Probability that the 9th examined record player is defective given that 
there were 3 defectives in the first 8 pieces examined. 
=1/7, 
since there is only one defective piece left among the remaining 15 - 8 = 7 record 
players. 
Hence the required probability is 


_P(ANB)=P(A).P(B| A) 
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1 8 
15 7 195 
(3 
Example 4-24. p is the probability that a man aged x years will die ina year. 
Find the probability that out of nmen A,, A2,...,A, each aged x , A, will die ina 
year and will be the first to die. [Delhi Univ. B.Sc., 1985] 
Solution. Let E;, (= 1,2, ...,) denote the event that A; dies in a year. Then 
P (E) =p, G=1,2,...,n) and P(E;)=1—-p. 
The probability that none of mn men Aj, A2, ...,A, dies in a year 
=P (Ei AEN nae OE.) =P (Ei) P (E>) ak (E,) 
(By compound probability theorem) 
=(1-p)’ 
‘. The probability that at least one of Ai, Az, ..., A,, dies in a year 
= 1-P(LAINE1N...N Ed) = 1-(1l-py 
The probability that among n men, A, is the first to die is 1/n and since this 
event is independent of the event that at least one man dies in a year, required 
probability is 


a [I~ - oF] 


Example 4-25.The odds against ManagerX settling the wage dispute with the 
workers are 8:6 and odds in favour of manager Y settling the same dispute are 
14:16. 

(i) What is the chance that neither settles the dispute, if they both try, 
independently of each other? 

(ii) What is the probability that the dispute will be settled? 

Solution. Let A be the event that the manager X will settle the dispute and B 
be the event that the pone Y will settle the dispute. Then clearly 


heat os 4 > P(A)=1- P@)=2=2 


7 7 
es a oe _._ 16 _ 8 
P(B)= tap > P(B)=1- P(B)= Wea 


The required probability that neither ee the dispute is given by : 


P(ANB)= P(A) x P(B)= 5 x SoH 


[Since A and B are independent > A and B are also independent] 
(ii) The dispute will be settled if at least one of the managers X and Y settles 
the dispute. Hence the required probability is given by’ 
P (AUB) = Prob. [ At least one of X and Y settles the dispute] 
=1 — Prob. [ None settles oS dispute] 


- 73 
=1- P(ANB)=1- = a 
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Example 4-26. The odds that person X speaks the truth are'3:2 and the odds 
that person Y speaks the truth are 5:3. In what percentage of cases are they likely 
to contradict each other on an-identical point. 

Solution. Let us define the events: 

A :.X speaks the truth, B : Y speaks.the truth 

Then A and B represent the complementary events that X and ¥ tell a lie 

respectively. We are given: 


3__ 3 3_2 

PAH s,975 7 PA) I-g=5 

. __»__3 _~;-2-3 

and P (8)= 47g = P(B)=1 37 8 
The event E that X and Y contradict each other on an identical point can 


happen in the following mutually exclusive ways: 
(i) X speaks the truth and ¥ tells a lie, i.e., the event AB happens, 
(ii) X tells a lie and Y speaks the truth, i.e., the event A 7 B happens. 
Hence by addition theorem of probability the required probability is given by: 


P (E)= P ()+ P(ii)= P(AQB)+ P(ANB) 


= P(A). P(B)+ P(A) P(B), 
{Since A and B are independent } 
DUO ck ag ke 
er x ets Xoo 407 = 0-475 
Hence A and B are likely to contradict each other on an identical point in 
A7-5% of the cases. 
Example 4-27. A special dice is prepared such that the probabilities of 
throwing 1, 2,3,4,5 and 6 points are : 


L-k 142k 1-k, 1+k 1-2 1+k 


6° 6 6? 6 6° 6 
respectively. If two such dice are thrown, find the probability of getting a sum 
equal to 9. (Delhi Univ. B.Sc. (Stat. Hons.), 1988] 


Solution. Let (x, y) denote the numbers obtained in a thrown of two dice, x 
denoting the number on the first dice and_y denoting the number on the second dice. 
The sum S$ = x+y =9, can be obtained in the following mutually disjoint ways: 

(i) (3,6), (ii) (6, 3), (itt) (4, 5), (iv) G, 4) 

Hence by addition theorem of probability: 

P(S =9) = P(3, 6) + P (6, 3) + P(4, 5) + P(S, 4) 
=P (x=3) P(y=6)+ P (x=6)P (y=3) +P (x=4) Ply=5) 
+ P (x=5)P (y=4), 
since the number on one dice is independent of the number on the other dice. 
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= piesoys C8 = (+) , +h (1-8 , (t+ (1-26 


6 6 °&6 6 ° °&#&«»6 
r (1—2k) (1+) 
6 °&»&#8&26 


L+k 
= 2 (te | [(1-—k) + (1-24) | 


ath) (2 — 34) 


{| 


Example 4:28. (a) A and B alternately cut a pack of cards and ‘the pack is 
shuffled after each cut. If A starts and the game is continued until one cits a 
diamond, what are the respective chances of A and B first cutting a diamond? 

(b) One shot is fired from each of the three guns. E\, E2, E; .denote the events 
that the target is hit by the first, second and third gun respectively. If 
P(E, = 05, P(Ex)= 06 and P(E3)= 08 and, E\, E2, Ey are independent 
events, find the probability that (a) exactly one hit is registered, (6) at least two 
hits are registered. 

Solution. (a) Let £, and E2, denote the events of A and 8 cuttifig a diamond 
respectively. Then 

P(E)= P()= 2=+ = PE)= PE)=3 

If A starts the game, he can first cut the diamond in the following mutually 
exclusive ways: : 

(i) E, happens, (ii) E, OE, E, happens, (iii) E)] AE, NE, OE, OE; 
happens, and so on. dence by addition theorem of probability, the probability ‘p’ 
that A first wins is given by 

p= P(i)+ P(ii)+ P (@ii)+...... 

= P(E\)+ P(E, NE,NE,)+ P(E, NE2N BE, NE, NE}) +... 
P (E\) + P(E,) P (E,) P(E.) + P (E\)P (E2) P (E;) P (Ez) P (E,) + ..- 
(By Compound Probability Theorem; 


1 3 3 1.3 3 3 3 ~=41 
A 4° a° a 44° a? aa sieneee 
I 
ee Sees, 
tee 7 
16 
The probability that B first cuts a diamond 
apne Se 
7 #7 
(b) We are given 


P(E,)= 0-5, P(E) = 0-4 and P(E3)= 0-2 
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(a) Exactly one hitcan be registered in the following mutually exclusive ways: 
(i) Ey 0 Ey O Ey happens, (ii) E, A E,O Ey happens, (iii) E|] AE, Es; happens. 
Hence by addition probability theorem, the required probability ‘p’ is given 
by: ne ees an 
: P=P(E:NE,NE;)+ P(E, N E21 E;) +P (Ey OE, NE) 
= P(E,) P(E.) P(E;) + P(E,) P(E2) P(E;) + P(E,) P(E,) P(Es) 
(Since.£), £2 and E; are independent) 
= 0-5 x0-4x0:2+0-5 x0-6x0-2+0-5 x0-4«0-8= 0-26. 
(6) Atleast two hits can be registered in the following mutually exclusive ways; 
(i) Ey Q Ex VE; happens (ii) E,) A Ey E; happens, (iii) Ey CE, Ey hap.- 
pens. (iv) E, (E20 £E; happens. 
Required probability 
= P(E, OV E20 Ey) + P(E, O E, 0 E3) + P(E; 0 E20 Es) + P(E, E20 E3) 
= 0-5 x 0-6 x 0-2+0-5 x 0-4 x 0-8 + 0-5 x 0-6 x 0:8 + 0-5 x 0-6 x 0-8 
= 0-06 + 0:16 + 0-24 + 0-24 = 0-70 


we 


Example 4:29. Three groups of children contain respectively 3 girls and 1 
boy, 2 girls and 2 boys, and I girl and 3 boys. One child is selected at random from 
each group. Show that the chance that the three selected consist of 1 girl and 2 
boys is 13/32. [Madurai Univ. B.Sc.,1988; Nagpur Univ. B.Sc.,1991| 


Solution. The required event of getting 1 girl and 2’boys among the three 
selected children can materialise in the following three mutually disjoint cases: 


Group No. > I II Ill 
(i) Girl Boy Boy 
(ii) Boy Girl Boy 
(iti) Boy Boy Girl 
Hence by addition theorem of probability, 
Required probability = P (i) + P (ii) + P (iii) ...(*) 


Since the probability of selecting a girl from the first group is 3/4, of selecting 
a boy from the second is 2/4, and of selecting a boy from the third group is 3/4, and 
since these three events of selecting children from three groups are independent of 
each other, by compound probability theorem, we have 


Sr ee eee 

Sg eig Og = ap 
Similarly, we have 

ee ee at ee 

eo a toa ace amy) 

a ee ee ee 0 

and Pili) = 4 X4*% | = 39 


Theory of Probability 4:55 
Substituting in ( *), we get 
3 l 13 


Required probability = s + 39 + 39 = 30 


EXERCISE 4 (b) 
1. (a) Which function defines a probability space on S = (1, é2, é3) 


() Pled=F Pead= 5, P= 5 
(ii) P (ei)= ‘ P (é2) = 7 P (e3) = : 


(iv) P(a)=0, P(e)= 5 , P(e3)= 


Ans. (i) No, (ii) No, (iii) a and (iv) - 
(b) LetS= (e:, é2, €3, ea) , Bs iki ama functionon § . 


(i) Find P(e,), if P(e) =3 P(e) =<, Pe) = 
(ii) Find P(e) and P(e) if resent and P(e;) = 2P(e;), and 


(iii) ‘Find a if Pl, €3)) = _ €4)] = + and Ped=5. 


Onl— 


Ans. () Pley= , (ii) P (a)= 5 ee i () =< — and (iit) P (e,)= 


2. (a) With ieial cade prove a 
P(AUB)= P(A)+ P(B)-P(AQB). 

Deduce a similar result for P (A U B UC), where C is one more event. 

(b) For any evert : E;, r (E;) = pi, (i = l, 2, 3) ; P (E; OY E}) = Pi, (i,j = l, Z. 3) 
and P (E, ( E20 Es) = pia, find the probability that of the three events, (i) at least 
one, and (ii) exactly one happens. 

(c) Discuss briefly the axiomatic approach to probability, illustrating by 
examples how it meets the deficiencies of the classical approach. 

(d) If A and B are any two events, state the results giving 

(i) P(A UB) and (ii)P (A OB). 


Find 


A and B are mutually exclusive events and P (A)=— =P (B)=3 : 


P (AUS) and P(A QB). | 


2 ‘n 
3. Let S= fr, 4 2° uty oink be a classical event space and A, B 


be events given-by 
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k 
A=f1, 5| ; = {(3) | k is an even positive imeger 


Find P (ANB) [Calcutta Univ. B.Sc. ( Stat Hons. ), 1986] 
4, What is a ‘probability space’? State (i) the ‘law of total probability’ and 
(ii) Boole’s inequality for events not necessarily mutually exclusive. 
5. (a) Explain the following with examples: 
(i) random experiment, (ii) an event, (iii) an event space. State the axioms of 
t Obability and explain their frequency interpretations. 
A man forgets the last digit of a telephone number, and dials the last digit at 
-¢ndom. What is the probability of calling no more than three wrong numbers? 
(b) Define conditional probability and give its frequency interpretation. Show 
that conditional probabilities satisfy the axioms of probability. 
6. Prove the following laws, in each case assuming the conditional prob- 
abilities being defined. 
(a) P(E\E)=1, (6) P(o\F)=0 
(c) If Ey CE, then E(E; | F)<P(£.|F) 
(4) P(E|F)=1- P(EI|F) 
(ec) P(E\UE.| F)=P(E,|F)+ P(E,|F) - P(P(,NE2| F) 
‘f) If P(F)=1 thenP(E | F)= P(E) 
(g) P(E- F)= P(E)- P(ENF) 
(h) If P(F)>0,andE and F are mutully exclusive then P (E | F)=0 
(i) If P(E | F)=P(E), then P(E | F)=P(E) andP(E| F)=P (E) 
7. (a) If P (A)= a, P (B)= 5b, then prove that P(ANB) > 1-a-—b. 
(b) If P(A)=a, P (B)=B, then prove that P(A|B) = (a+B-1)/B. 
Hint. In each case use P (A UB) <1 
8. Prove or disprove: 
(a) (i) If P (A| B) = P (A), then P (B| A) =P (B) 
(ii) lf P (A) = P (8), then A =B. 
[Delhi Univ. B.Sc. (Maths Hons.), 1988] 
(b) If P (A)=0, thenA =6 
[Delhi Univ. B.Sc. (Maths Hons.), 1990] 
Ans. Wrong. 
(c) For possibie events A, B, C, 
(i) If P(A)>P(B), thenP (A|C)>P(B|C) 
(ii) IfP(AJ|C)2>P(B|C) and P(A|C)2P (BIC), 
then P (A) > P (B). {Delhi Univ. B.Sc.(Maths Hons), 1989] 
(d) \f P (A)=0, then P(ANB)=0. 
[Delhi Univ. B.Sc. (Maths Hons.), 1986] 
(e) (i) lf P(A) =P (B)=p, then P(ANB)Sp’ 
(ii) If P (B |A) =P (B|A), then A and B are independent. 
[Delhi Univ. B.Sc. (Maths Hons.), 1990] 
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(f) If P (A) >0, P (B)>0 and P (A|B)=f (BA), 
then P (A)=P (B). 
9, (a) LetA and B be two events, neither of which has probability zero. Then 
if A and B arc disjoint, A and B are independent. 
[Delhi Univ. B.Sc.(Stat. Hons.), 1986] 
(b) Under what conditions does the following equality hold? 
P (A)=P (A|B)+P(A[B) 
[Punjab Uniy. B.Sc. (Maths Hons.), 1992] 
Ans. B=S or B =S 
10. (a) If A and B are two events and the. probability P (B) # 1, prove that 
_ [P(ARP(ANB)] 
P(A | B)= [1 -P(B)] 
where B denotes the event complementary to B and hence deduce that 
P(AQB)2 P(A)+ P(8)- 1 
{Delhi Univ. B.Sc. (Stat. Hons.), 1989] 
Also show that P (A)> or < P(A | B) according as 
P(A|B)> or < P(A). 
[ Sri Venkat. Univ. B.Sc. 1992 ; Karnatak Univ. B.Sc.1991] 


Hint. (i) 
ds P(AQB) = [P(A)-P (ANB) ] 

P(AIB)= —p Ry [1— P(B)] 
(ii) Since P(A | B)<1, P(ARP(ANB)S1- P(B) 
=~ P(A)+P(B)-1 < P(ATB) 
iy P(AIB)_ P(BIA)_ 1-P@1A) 

P (A) P(B) 1—- P(B) 

Now  P(A|B)>P(A) if {1- P(B| A)}> {1- P(e)} 
i.e., if P(B|A)<P(B) 
ae P(B|A) 
i.e., if Pp (By B) <1 
i.e., if PATE) i.e, if P(A)>P{A | B) 


(6) If A and B are two mutually exclusive events show that 

P(A | B)=P(A)/(1- P@)) 

[Delhi Univ. B.Sc. ( Stat. Hons.), 1987] 
(c) If A and B are two mutually exclusive events and P (A U B) #0, then 
P(A 

P(A| AUB)= P (A)+ P(B) {Guahati Univ. B.Sc. 1991] 
(d) If A and B are two independent events show that 

P(AUB)= 1-P(A)P(B) 


458 Fundamentals of Mathematical Statistics 


(e) If A denotes the non—occurrence of A, tnen prove that 
P (A, U Az U As) = 1 - P(A,) P (Az | A) P (A3 | Ay A A2) 
(Agra Univ. B.Sc., 1987] 
11. If A, B and C are three arbitrary events and 
=P(A)+P(B)+P(C) 
S2=P(AQNB)+P(BNC)+P(CANA) 
S3=P(ANBNC). 
Prove that the probability that exactly one of the three events occurs is given 
by §,-—2 $82+3 Ss. 
12. (a) For the hs Ai, Aa, ..., Ax aSSuUMINg 
n 
P ( = Ai)s Z ee) prove that 
=] = 


iz 


(i) P( A A) 21- : P (A;) and that 


i=1 i=l 


(ii) PC A A ADE E PCA) (a1) 
[Sardar Patel Univ. B.Sc. Nov.1992] 
(b) Let A, B and C denote events. If P(A | C)>P(B | C) and 
P(A |G)2>P (BIC), then show that P (A) > P (B). 
[Calcutta Univ. B.Sc. (Maths Hons.), 1992} 
13. (a) If A and B are independent events defined on a given probability 
space (Q,A_ ,P (.)), then prove that A and B are independent, A and B are 
. independent. [Delhi Univ: B.Sc. (Maths Hons.), 1988] 
(b) A, B and C aré three events such that A and B are independent, P (C) = 0. 
Show that A, B and C are independent. 
(c) An event A is known to be independent of the events B,B UC and 
BC. Show that it is also independent of C. [Nagpur Univ. B.Sc.1992] 
(d) Show that if an event C is independent of two mutually exclusive events 
A andB, then C is also independent of A U B. 
(e) The outcome of an experiment is equally likely to be one of the four points 
in three—dimensional space with rectangular coordinates (1, 0, 0), (0, 1,0), 
(0,0, 1) and (1, 1,1). Let E, F andG be the events : x-coordinate=1, y-coor- 
dinaté=1 and z—coordinate=1; , respectively. Check if the events E, F and G are 
independent. (Calcutta Univ. B.Sc., 1988) 
14. Explaifi what is meant by “Probability Space”. You fire at a target with 
each of the three guns; A, B and C denote respectively the event — hit the target 
vith the first, second and third gun. Assuming that the events are independent and 
have probabilities P (A) =a, P(B)=6 and P (C)=c, express in terms of A, B 
and C the following events: 
(i) You will not hit the target at all. 
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(ii) You will hit the target at least twice. Find also the probabilitics of these 
events. [Sardar Patel Univ. B.Sc., 1990) 

15. (a) Suppose A and B are any two events and that P (A) = p;, P (B) = pp 
and P {AQ B) =p 3. Show that the formula of each of the following 
probabilities in terms of p;, Pz and p3 can be expressed as follows : 


(i) P(A U B)=1-py (ii) P(A A B)=1-p,-p2 +p, 
(iii) P(AN B)=pi-p3 (iv) P(A A B)=p2—p3 
(vy) P(A QNB)=1-p, (vi) P(A UB)=1-p, +p; 


(vii) P(A U B)=1-p,—p)+p3 (viii) P[A A(AUB)] =po-p, 
(ix) PIAU(A OB) =p; +p2.-py 
(x) P(AIB) = and P(BIA)=@ 

P2 Pi 


-—Pi-P2r2+t+ Ps 


eee Faaae, ] _ _ rs 
(xt) P(A |B)= ] _' ~ Pi P2t P3 
— P2 


— l 
a = 
[Allahabad Univ. B.Sc. (Stat.), 1991] 
(b) If P (A)= 1/3, P (B) = 3/4 and P (A U B) = 11/12, find 
P(A\B) and P (BIA). 
(c) Let P (A) =p, P (A! B) =g, P(B| A) =r. Find the relation between the 


numbers p, q and r such that A and B are mutually exclusive. 
[Delhi Univ. B.Sc. (Maths Hons.), 1985] 
Hint. P (AB)=P(A)P(BlIA)=P(B).P(AIB) 
= P (AB) =pr=P(B).q => P(B)=prlq 
If A and B are mutually disjoint, then P (A 0B) =0. 
=> 1-P (AUB)=0=>1-[p + (pr/q) - pr) =0 
16. (a) In terms of probabilities, p, = P (A), pp = P (B) and p, = P (AN B); 


Express (i) P (A U B), (it) P (A |B), (iii) P (A. Cc) B) under the condition that 
(‘) A and B are mutually exclusive, (ii) A and B are mutually independent. 

(b) Let A and B be the possible outcomes of an experiment and suppose 

P (A) =0-4, P(AU B) =0-7 and P (B) =p 
(i) For what choice of p are A and B mutually exclusive ? 
(ii) For what choice of p are A and B independent ? 
[Aligarh Univ. B.Sc., 1988 ; Guwahati Univ. B.Sc., 1991] 

Ans. (i) Q-3, (ii) 0-5 - 

(c) Let Aj, Az, A3, Aq be four independent events for which P (A,) =p, 
P (A) = q, P (A3) = rand P (Aq) =s. Find the probability that 

(7) at least one of the events occurs, (ti) exactly two of the events occur, and 
(ii?) at most three of the events occur. (Civil: Services (Main), 1985] 

17. (a) Two six-faced unbiased dice are thrown. Find the probability that the 
sum of the numbers shown ts 7 or their product is 12. 

Ans, 2/9 
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(b) Defects are classifed as A, B or C, and the following probabilities have been 
determined from available production data : 

P (A) = 0-20, P (B) = 0-16, P (C) = 0-14, P (A 7 B) = 0-08, P (ANC) =0.05, 
P(BNC)=0-04, and P(ANBNC)=0.02. 

What is the probability that a randomly selected item of product will exhibit 
at- least one type of defect ? What is the probability that it exhibits both A and B 
defects but is free from type C defect ? [Bombay Univ. B.Sc., 1991] 

(c) A language class has only three students A, B, C and they independently 
attend the class. The probabilities of attendance of A, B and C on any given day are 
1/2, 2/3 and 3/4 respectively. Find the probability that the total number of 
attendances in two consecutive days is exactly three. 

{Lucknow Univ. B.Sc. 1990; Calcutta Univ. B.Sc.(Maths Hons.), 1986] 

18. (a) Cards are drawn one by one from a full deck. What is the probability 
that exactly 10 cards will precede the first ace. [Delhi Univ. B.Sc.,1988) 
4847, 46 39) 4 164 
ny a) Ge | I 42 4165 

(b) Lach of two persons tosses three fair coins. What is the probability that 
they obtain the same number of heads. 


1y (3). (3Y. (1yY. 5 
Ans. | + 5 | + | + | = 16° 
19, (a) Given that A, B and C are mutually exclusive events, explain why each 
of the following is not a permissible assignment of probabilities. 
(i) P(A)=0-24, P(B)=04 and P(AUC)=0-2, 
(ii) P(A)=0-7, P(B)=01 and P(nC=03 
(iit) P(A)=06, P(ANB)=0°5 
(b) Prove that for n arbitrary independent events Aj, Ao, ..., As 
P (A, U Az UA3U... UAx) + P (A) P (A2) ... P (A,) = 1. 
(c) Aj, A2, .... Aq aren independent events with 


PAS 1==, 12 1c: 
a 


Ans, 


Find the value of P (A; UA2UA3U... UA,). (Nagpur Univ. B.Sc., 1987) 


l 
(d) Suppose the events A), Ao, ..., A, are independent and that 
P (A) = ; + i for 1<i<n. Find the probability that none of the n events 


occurs, justiying each step in your calculations. 

Ans. if(n+1) 

20. (a) A denotes getting a heart card, B denotes getting a face card (King, 
Queem or Jack), A and B denote the complementary events. A card is drawn at 
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random from a full deck. Compute the following probabilities. 

(i)P (A), (ii)P(AQB), (iii)P(AUB), (iv) P(ANB), 

(vy) P(A UB). 

Assume natural assignment of probabilities. 

Ans, (i) 1/4, (ii) 5/26; (iti) 11/26, (iv) 3/5, (v) 21/26. 

(b) A town has two doctors X and Y operating independently. If the probability 
that doctor X is available is 0-9 and that for Y is 0-8, what is the probability that 
at least one doctor is available when needed? [Gorakhpur Univ. B.Sc., 1988] 

Ans. 0-98 

21. (a) The odds that a book will. be favourably reviewed by 3 independent 
critics are 5 to 2, 4 to 3 and 3:to 4 respectively. What is the probability that, of the 
three reviews, a majority will be favourable? [Gauhati Univ. BSc., 1987] 

Ans. 209/343. 

(b) A, B and C are independent witnesses of an event which is known to have 
occurred. A speaks the truth three times out of four, B four times out of five and C 
five times out of six. What is the probability that the occurrence will be reported 
truthfully by majority of three witnesses? 

Ans. 31/60. 

(c) A man seeks advice regarding one of two possible courses of action.from 
three advisers who arrived at their recommendations independently. He follows the 
recommendation of the majority. The probability that the individual advisers are 
wrong are 0-1, 0-05 and 0:05 respectively. What is the probability that the man 
takes incorrect advise ? (Gujarat Univ. B.Sc., 1987] 

22. (a) The odds against a certain event are 5 to 2 and odds in favour of another 
(independent) event are 6 to 5. Find the chance that at least one of the events will 
happen. (Madras Univ. B.Sc.,1987) 

Ans, 52/77. 

(b) A person takes four tests in succession. The probability of his passing the 
first test is p, that of his passing each succeeding test is p or p/2 according as he 
passes or fails the preceding one. He qualifies provided he passes at least three tests. 
What is his chance of qualifying. [Gauhati Univ. B.Sc. (Hons.) 1988] 

23. (a) The probability that a 50-years old man will be alive at 60 is 0-83 and 
the probability that a 45-years old woman will be alive at 55 is 0-87. What is the 
probability that a man who 1s 50 and his wife who is 45 will both be alive 10 years 
hence? 

Ans. 0-7221. 

(b) It is 8:5 against a husband who 1s 55 years old living till he is 75 and 4:3 
against his wife who is now 48, living till she is 68. Find the probability that (i) the 
couple will be alive 20 years hence, and (ii) at least one of them will be alive 20 
years hence. 

Ans (i) 1591, (ii) 59/91. 
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(c) A husband and wife appear in an interview for two vacancies in the same 
post. The probability of husband’s selection is 1/7 and: that of wife’s selection is 
1/5. What is the probability that only one of them will be selected ? 

Ans, 2/7 (Dethi Univ. B.Sc.,.1986] 

24. (a) The chances of winning of.two race-horses are 1/3 and 1/6 respective- 
ly. What is the probability that at least one will win when the horses are running 
(a) in different races, and (b) in the same race? 

Ans. (a) 8/18 (6)1/2 

(6) A problem in statistics is given to three students whose chances of solving 
itare 1/2,1/3 and 1/4. What is the probability that the problem will be solved? 

Ans. 3/4 (Meerut Univ. B.Sc., 1990] 

25. (a) Ten pairs.of shoes are in a closet. Four shoes are selected at random. 
Find the probability that there will be at least one paw among the four shoes 
selected? 

a OF a yy 
aC; 

(b) From 100 tickets numbered 1, 2, ... , 100 four are drawn at random. What 
is the probability that 3 of them will bear number from 1 to 20 and the fourth will 
bear any number from 21 to 100? 

af OF x “Cc, 
Ans, 4 ———— 


100 
Cy 


26. Asix faced die is so biased that it is twice as likely to show an even number 
as an odd rumber when thrown. It is thrown twice. What is the probability that the 
sum of the two numbers thrown is odd? 

Ans, 4/99 

27. From a group of 8 children, 5 boys and 3 girls, three children are selected 
at random. Calculate the probabilities that selected group contains (i) no girl, 
(ii) only cne girl, (iii) one particular girl, (iv) at least one girl, and (v) more girls 
than boys. 

Ans. (i) 5/28, (ii) 15/28, (iii) 5/28, (iv) 23/28, (v) 2/7. 

28. If, three persons, selected at random, are stopped on a street, what are the 
probabilities that : 

(a) all were born on a Friday; 
(6) two were bom ona Friday and the other on a Tuesday; 
(c) none was bom ona Monday. 

Ans. (a) 1/343, (6) 3/343, (c) 216/343. 


29. (a) A and B toss a coin alternately on the understanding that the first who 
obtains the head wins. If A starts, show that their respective chances of winning are 
2/3 and 1/3. 

(b) A, B and C, in order, toss a coin. The first one who throws a head wins. If 
A Starts, find their respectivre chances of winning. (Assume that the game may 


Ans. 1 
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continue indcfinitely.) 

Ans. 4/7, 2/7, 1/7. 

(c) Aman alternately tosses a coin and throws a die, beginning with coin. What 
is the probability that he will get a nead before he gets a ‘5 or 6’ on die? 

Ans. 3/4. 

30. (a) Two ordinary six-sided dice are tossed. 

(i) Whiat is the probability that both the dice show the number 5. 
(ii) What is the probability that both the dice show the same number. 

(titi) Given that the sum of two numbers shown is 8, find the conditional 
probability that the number noted on the first dice is larger than the number noted 
on the second dice. 

(6) Six dice are thrown simultaneously. What is.the probability that all will 
show different faces? 

31. (a) A bag contains 10 balls, two of which are red, three blue and five black. 
Three balls are drawn at random from the bag, that is every ball has an equal chance 
of being included in the three. What is the probability that 

(i) the three balls are of different colours, 
(ii) two balls are of the same colour, and 

(iii) the balls are all of the same colour? 

Ans. (i) 30/120, (ii) 79/120, (iit) 11/120. 

(b) A is one of six horses entered for a race and is to be ridden by one of the 
two jockeys B and C. It is 2 to | that B rides A, in which case all the horses are 
equally likely to win, with rider C, A’s chance is trebled. 

(i) Find the probability that A wins. 
(ii) What are odds against A’s winning? 
{Shivaji Univ. B.Sc. (Stat. Hons.), 1992] 

Hint. Probability of A’s winning | 

=P (Brides A and A ‘wins )+P(C ndes A and A wins) 
ee ee 
3 6 3 6 18 
Probability of A’s losing= I — 5/18 = 13/18. 
Hence odds against A’s winning are: 13/18: 5/18, i.e., 13:5. 


32. (a) Two-third of the students in a class are boys and the rest girls. It is 
known that the probability of a girl getting a first class is 0-25 and that of boy getting 
a first class is 0-28. Find the probability that a student chosen at random will get 
first class marks in the subject. 

Ans, 0-27 

(6b) You need four eggs to make omelettes for breakfast. You find a dozen eggs 
in the refrigerator but do not realise that two of these are rotten. What is the 
probability that of the four eggs you choose at random 

(i) none is rotten, 
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(ii) exactly one is rotten? 

Ans. (i) 625/1296 ¢ (ii) 3500/1296. 

(c) The probability of occurrence of an event A is 0:7, the probability of 
non-occurrence of another event B is 0-5 and that of at least one of A or B not 
occurring is 0-6. Find the probability that at least one of A or B occurs. 

[Mysore, Univ. B.Sc., 1991] 

33. (a) The odds against A solving a certain problem are 4 to 3 and odds in 
favour of B solving the same problem are 7 to 5. What is the probability that the 
problem is solved if they both try independently? [Gujarat Univ.B.Sc., 1987] 

Ans. 16/21 

(b) Acertain drug manufactured by acompany is tested chemically for its toxic 
nature. Let the event ‘the drug is toxic’ be denoted by E and the event ‘the 
chemical test reveals that the drug is toxic’ be denoted by F. Let P(E) =9, 
P (F | E)= P(F | E)= 1-6. Then show that probability that the drug is not toxic 
given that the chemical test reveals that it is toxic is free from @ . 

Ans. 1/2 (M.S. Baroda Univ. B.Sc., 1992] 

34, A bag contains 6 white and 9 black bails. Four balls are drawn at a time. 
Find the probability for the first draw to give 4 white and the second draw to give 
4 black balls in each of the following cases : 

(i) The balls are replaced before the second draw. 
(ii) The balls are not replaced beforé thé second draw. 
[Jammu Univ. B.Sc., 1992] 
6 9 6 9 
Ans. (i) a x = (it) a x 2 
4 4 4 

35. The chances that doctor A wiil diagnose a disease X correctly is 60%. The 
chances that a patient will die by his treatment after correct diagnosis is 40% and 
the chance of death by wrong diagnosis is 70%. A patient of doctor A, who had 
disease X, died. What is the chance that his disease was diagnosed correctly? 

Hint. Let us define the following events: 

E, : Disease X is diagnosed correcily by doctor A. 
E.: A patient (of doctor A) who has disease X dies. 


Then we are given : 


P (E\)= 06 => P(E)= 1-06=04 
and P(E,|Eij)=04 and P(E,|E,)=0-7 
P(E, N E2) P(E, TN E2) 6 


We want P(E; | E2) = P(E) | P(E, NE) +P(Q,AE) 13 

36. The probability that ai least 2 of 3 people A, B and C will survive for 10 
years is 247/315. The probability that A alone will survive for 10 years is 4/105 and 
the probability that C alone will die within 10 years is 2/21. Assuming that the 
events of the survival of A, B and C can be regarded as independent, calculate the 
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probability of surviving 10 years for each person. 

Ans. 3/5, 5/7, 7/9. 

37. A and B throw alternately a pair of unbiased dice, A beginning. A wins if 
he throws 7 before B throws 6, and 8 wins if he throws 6 before A throws 7. if A 
and B respecuvely denote the events that A wins and B wins the series, and a and 
b respectively denote the events that it is A’s and B’s turn to throw the dice, show 
that 


(i) P(Ala)= a+ 2 P(A lb), @) P(Al 6)= = P(Ala), 


re _5 7 5. B 
(iii) P (B \.a)= : P(B | b),and (iv) P(B | b)= qet ae P(B | a), 


Hence or-otherwise, find P (A | a) and P (B | a). Also comment on the result 
that P(Aja)+P (B\a)=1. 

38. A bag contains an assortment of blue and red balls. If two balls are drawn 
at randon, the probability of drawing two red balls is five times the probability of 
drawing two blue balls. Furthermore, the probability of drawing one ball of each 
colour is six times the probabililty of drawing two blue balls. How many red and 
blue balls are there in the bag? 

Hint. Let number of red and blue balls in the bag be r and b respectively. Then 

pi= Prob. of drawing two red balls = Pre Tey ey 
_6(6-1N) 
(r+ b)(r+6-1) 


- | . a 
ps= Prob. of drawing one red and one blue ball -| (r+by(r+b— 7 


P2= Prob. of drawing two blue balls = 


Now pi=5p2 and p»=6p2 
r(r—1)=5b(6-1) and 2br=6b(b- 1) 

Hence 6=3 andr=6. 

39. Three newspapers A, B and C are published in a certain city. It is estimated 
from a survey that 20% read A;16% read B, 14% read C, 8% read'A and B, 5% read 
A and C, 4% read B and C and 2% read all the three newspapers. What is the 
probability that a normally chosen person 

(i) does not read any paper, (it) does not read C 
(iti) reads A but not B, (iv) reads only one of these papers, and 
(v) reads only two of these papers. 

Ans, (i) 0-65, (ii) 0-86, (iii) 0-12, (iv) 0-22, (v)0-11. 

40. (a) A die is thrown twice, the event space S consisting of the 36 possible 
pairs of outcomes (a,b) each assigned probability 1/36. Let A, B and C denote the 
following events : 

A=((a,b) | a is odd}, B = ((a,b) | b is odd}, C = ((a,b) | a+ bis odd.} 

Check whether‘A, B and C are independent or independent in pairs only. 

{Calcutta Univ. B.Sc. Hons., 1985] 
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(b) Eight tickets numbered 111, 121, 122, 122, 211, 212, 212, 221 are placed 
in a hat and stirred. One of them is then drawn at random. Show that the event A : 
"the first digit on the ticket drawn will be 1”, B : “the second digit on the ticket 
drawn will be 1," and C : "the third digit on the ticket drawn will be 1", are not 
pairwise independent although 

P(ANBQC)= P(A) P (B) P(C) 

41. (a) Four identical marbles marked 1, 2,3 and 123 respectively are put in 
an um and one is drawn at random. Let A;, (i= 1, 2,3), denote the event that the 
number i appears on the drawn marble. Prove that the events A,, A, and A; are 
pairwise independent.but not mutually independent. 

[Gauhati Univ. B:Sc. (Hons.), 1988] 


Hint. P-(A\) == =P (A2)=P (As)*;, P(Ar A2) = P (A, As) = P (Az. As) =5 


P (A, A2 A3) = = 


(b) Two fair dice are thrown independently. Define the following events : 
A : Even number on the first dice 
B : Even number on the second dice. 
C : Same number on both dice. 

Discuss the independence of the events A, B and C. 

(c) A die is of the shape of a regular tetrahedron whose faces bear the numbers 
111, 112, 121, 122. Ai, A2, As are respectively the events that the first two, the 
last two and the extreme two digits are the samé, when the die is tossed at random. 
Find whether or not the events Ai, A2, As are (i) pairwise independent, (ii) mutually 
(i.e. completely) independent. Determine P (A, | A243) ) and explain its value by 
by argument. [Civil Services (Main), 1983] 

42. (a) For two events A and B we have the following probabilities: 


P(A)=P(A|B)=> and P(B | A)= 


Check whether the following statements are true or false : 
(i) A and B are mutually exclusive, (ii) A and B are independent, (iii) A is a 


subevent of B, and (iv) P(A | B) == 


Ans. (i) False, (ii) True,, (iii) False, and (iv) True. 

(b) Consider two events A and B such that P (A)= 1/4, P (B | A) = 1/2, 
P(A | B) = 1/4. For each of the following statements, ascertain whether it is true 
or false : 

(i) Aisasub-eventofB, (ii) P(A | B)=3/4, 
(iii) P(A\B)+ P(A|B)=1 


43. (a) Let A and B be two events such that P (4) = 3 = and P (B)= 


aoln 
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Show that 
(i) P(AUB)2=, (i) <P (ANB) <2 ,and (iil) = = <P(AMB)S 


[Coimbatore Univ. B:E., Nov. 1990; Delhi Univ. a die ee 
(b) Given two events A and 8. If the odds against A are 2 to 1 and those in 
favour of AU B are3 tol, that 
> 
D* < P(B)<s 
Give an example in aia P (B)=3/4 and one in which P (B) = 5/12. 
44. Let A and B be events, neither of which has probability zero. Prove or 
disprove the following events : 
(i) IfA and B are disjoint, A and‘B are independent, 
(ii) If A and B are independent, A and B are apa 


45. (a) It is given that P(A;UA2)=—> 2p (A, NA2) => > and P (4) == 


where P(A.) stands for the probability that . does not me iaieies 
P (A) and P (A2). 
Hence show that A, and A2 are independent. 


Ans. P(A;) =4, P (A) = 5 
(6) A and B are events oa that 
P(AUB)=3, P(A NB)=5. and P (A)=-. 


3 
Find (i) P(A), (ii) P a and (iii) P(A vB). 
(Madras Univ. B.E., 1989) 


i 


Ans. (i) 1/3, (ii) 2/3 (iii) 1/12. 

46. A thicf has a bunch of n-keys, exactly one of'wnich fits a lock. If the thief 
ines to open the lock by trying the keys at random, what is the probability that he 
requires exactly & attempts, if he rejects the keys already tried? Find the same 
probability if he does not reject the keys already tried. 

(Aligarh Univ. B.Sc., 1991) 
. | | : 

Ans. (i) n? — 

(b) There are M ums numbered | to M and M balls numbered | to M. The balls 
are inserted randomly in the urns with one ball in each um. If a ball is put into the 
ur bearing the same number as the ball, a match is said to have occurred. Find the 
probability that no match has occurred. ’ (Civil Services (Main), 1984] 

Hint. See Example 4:54 page 4-97. 

47. If n letters are placed at random in n correctly addressed envelopes, find: 
the probability that 

(i) none of the letters is placed in the correct envelope, 


-1 
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(ii) Atleast one letter goes to the correct envelope, 
(iti) All letters go to the correct envelopes. 
[Delhi Univ. B.Sc. (Stat Hons.), 1987, 1984] 
48. An urn contains n white and m black balls, a second um contains N white 
and M black balls. A bail is randomly transferred from the first to the second um 
and then from the second to the first urn. If a bal] is now selected randomly from 
the first urn, prove that the probability that it is white is 
ney mN — nM 
n+m (n+m)(N+M+41) 
[Delhi Univ. B.Sc. (Stat.Hons.) 1986] 
Hint. Let us define the following events : 
B; : Drawing of a black ball from the ith urn, i= 1, 2. 
W; : Drawing of a white ball from the ith um, ¢.= 1, 2. 
The four distinct possibilities for the first two exchanges are B, W2, Bi B2, 
W, B>, W; W2 . Hence if E denotes the event of drawing a white ball from the first 
urn after the exchanges, then 


P (E)= P (B,W.E) +P (8B, BE) +P (W, Br E)+P (Wi W2 E) .{*) 

We have: 

P(B, W, E) = P(B;) . P(W;.| B,) P(E | By W;) =—@—-x—%— y 241 
m+n *M+N41° m+n 
m M+1 n 

P(B, Bz E)= P(B;) . P(B2 | Bi). P(E | Bi Ba) = aaa a 

; M n-] 

P(W, Bz E) = P(W;) . P(B2 | Wi). PEI a” rs hg or 

N+] n 


P(W,W2 E) = P(W,) P(W2 | W,) P(E |W W2) = min *M4N4] m+n 


Substituting in (*) and simplifying we get the result. 


49. A particular machine is prone to three similar types of faults Ai, A2and 
A3. Past records on breakdowns of the machine show the following : the probability 
of a breakdown (i.e., at least one fault) ir 0-1; for each i, the probability that fault 
A; occurs and the others do not is 0-02 ; for each pair é, j the probability that A; and 
Aj occur but the third fault does not is 0-012. Determine the probabilities of 

(a) the fault of type A, occurring irrespective of whether the other faults occur 
or not, 

(b) a fault of type A, given that A2 has occurred, 

(c)faults of type Ar and A2 given that A; has occurred. 

(London U. B.Sc. 1976] 
50). The probability of the closing of each relay of the circuit shown below is 


given by p. If all the relays function independently, what is the probability. that a 
circuit exists between the terminals L and R? 


l 2 
Ss 
: be 
St 
3 4 


Ans. p’(2-p’). 
4:9, Bayes Theorem. /f E;, £2,....E, are mutually disjoint events with 
P(E) #0, @= 1,2, ...,) then for any orbitrary event A which is a subset of 


n 
U E, such that P (A) > 0, we have 
i=] 
P(E) P(A | Ed) 
n 


P (Ei | A= , 6=1,2,..,0n. (4-12) 


= P(E) P(A! Ei) 
i=] 
Rn 
Proof. Since Ac vu E; ,we have 
i=] 
nA n 
A=AN( VU E,)= VU (ANE;) (By distributive law] 
i=] i=] 
Since (AN Ej) c Ei, (= 1, 2, ...., n) are mutually disjoint events, we have by 
addiuon ne of probability = Asioi 3 of propabiaty) 


P (A) = Plu (AV E)) = EP (ANE) = 5 P(E) P(A | E), wa(*) 
i=] i=1 i=] 
by compound theorem of probability. 
Also we have 
P(AQME) = P(A)P (E;| A) | 
P(AQE)) P(E)P(A\E; 2 
P(E,| y= Eo = eA [From (+)] 
ZP(E)P(A\ Ed : 
i=] 

Remarks. 1. The probabilities P (E), P (E2). ..., P (E,) are termed as the ‘a 
priori probabilities’ because they exist before we gain any information from the 
experiment itself, 

2. The probabilities P (A | E), i= 1,2, ..., n are called ‘likelihoods’ because 
they jndicate how likely the event A under consideration is to occur, given cach 
and every a priori probability. 

3. The probabilities P(E; { A), = 1, 2, ...,n are called ‘posterior probabilities’ 
because they are determined after the results of the experiment are known. 

4, From (*) we get the following important résult: 

"If the events E), E>, ..., E, constitute a partition of the sample space S anc 
P (E)) #0, i= 1,2, ..., 1, then for any event A in S we have 
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i=1 i= 1 
Cor. (Bayes theorem for future events) 
The probability of the materialisation of another event C, given 


PIC I ANE), P(C|AQNE),...,P(C|ANE,) is 


P (A)= EP(ANE)= E P(E) P (AE) (4-12 a) 


5 P(E) P(A E)P(C| EAA) 


P(C | A)= SS (4-12 b) 


Z P(E) P(A E) 
\ j= 
Proof. Since the occurrence of event A implies the occurrence of one and only 
one of the events E), E2, ..., E,, the event C (grantéd that A has occurred) can occu 
in the following mutually exélusive ways: 
CONE,,COE2,..,COE, 
L.e., C=(CNE))UV(CNOE)VU...U(COE,) 
=> ClA=[(CNE)|AJULI(CAE)| Alu... U(CAE) LA] 
- P(C|A)=P[(CNE) | Al+P (COE) | Al t..t PCE, | Al 


= ZP[(CNE) 1A) 


i=l 


= © P(E;| A) P[C| (E:NA)] 
i=1 


t= 


Substituting the value of P (E; | A) from (*), we get 


5 P(E) P(A | B) P(C | E;NA) 
P(C | A)= =+— 
x P(E) P(A\| E) 

i= 
Remark. It may happen that the materialisation of the event E; makes C 

iidependent of A, then we have 

. P(C|E; NA)= P(C|E), 
and the above formula reduces to 


5 P(E) P(A1E) P(C | E)- 
t=] 


P(C|A)= (4-12 €) 


EP E)PCALED 


The event C can be eased: in regard to A, as Future Event. 
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Example 4-30. /n 1989 there were three candidates for the position of 
principal - Mr. Chatterji, Mr. Ayangar and Dr. Singh = whose chances of getting 
the appointment are in the proportion 4:2:3 respectively. The probability that Mr. 
Chatterji if selected would introduce co-education in the college is 0-3. The 
probabilities of Mr. Ayangar and Dr. Singh doing the same are respectively 0-5 
and 0-8. Whatis the probability that there was co-education in the college in 1990? 


[Delhi Univ. B.Sc.(Stat. Hons.), 1992; Gorakhpur Univ. B.Sc., 1992] 


Solution. Let the cvents and probabilitics be defined as follows: 
A : Introduction of co-cducation 
E, : Mr. Chatterji is selected as principal 
E,: Mr. Ayangar is selected as principal 
E;: Dr. Singh is selected as principal. 
Then 


4 
P(E)= 4. P(E)== and P(Es)*2 


; 3 oS ok 8 
P(Al &)=55, P(Al B= and P(A Bs) =75 


P (A)=P (AN BE) U (AN £),U (AN E))] 
= P(ANE,) + P(A.NE2) + P(AQNE;) 
= P(E,)P(A| E\) + P(E) P(A | E2) + P(Es)P(A | Es) 
43,2 5,3 8 23 © 
~9°10 9°10 9°10 45 
Example. 4-31. The contents-of urns I, II and III are as follows: 
! white, 2 black and 3 red balls, 
2 white, 1 black and I red balls, and 
4 white, 5 black and 3 red.balls. 
One urn is chosen at random and two balls drawn. They happen io be white 
and red. What is the probability that they come from urns I, Il or Ill ? 


{Delhi Univ. B.Sc. (Stat. Hons.), 1988] 

Solution. Let £1, £2, and E; denote the events that the umm I, I] and III is chosen, 

respectively, and let A be the event that the two balls taken from the selected urn 
are white and red. Then 


P (E\)= P (E2)= P(E) = = 
1x3 1 2x1 1 
= == E>)= == 
P(A | E)) cG. 5 PAl 2) iG, = 3° 
4x3 2 


and P(A|E;)= 
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Hence - 
PE ay= PPL Ey 
EP (E)P(A\ BE) 
i=] - 
11 
_ meee 5S: 
3 95 3 3 3 «2121 
Similarly 
SA dane. 3° 11 _ 30 
ici ak CA A BR a1 
3 5 3 3 3 11 
~y7- 28 . 30 _ 33 
P(E, | A)= 1 118 118 118 


Example. 4-32. /n answering a question on a multiple choice test a student 
either knows the answer or he guesses. Let p be the probability that he knows the 
answer and 1-p the probability that he guesses. Assume that a student who guesses 
at the answer .vill be correct with probability 1/5, where 5 is the number of 
multiple-choice alternatives. What is the conditional probability that a student 
knew the answer to a question given that he answered it correctly? 

[Dethi Univ. B.Sc. (Maths Hons.), 1985] 

Solution. Lét us define the following-events: 

E, : The student knew the right answer. 
E2: The student guesses the nght answer. 
| A : The student gets the nght answer. 
Then we are given 
P(E,\)=p, P(Ex)=1-p, P(A\ E.)=1/5 
P (A | E;) =P [student gets the right answer given that he knew the night 
answer] = 1 

We want P (Ei | A). 

Ustng Bayes’ rule, we get : 

P(E, | 4)= P(E;):P(A\| E,) Z px __5p 
P(E;) P(A | Ey) + P(E2) P(A | E2) 4p+l 


px1+(I-p)x= 


Example 4-33. /n a bolt factory machines A,B and C manufacture respectively 
25%, 35% and 40% of the total. Of their output 5,4, 2.per cent are defective bolts. 
A bolt is drawn at random from the product and is found to be defective. What are 
the probabilities that it was manufactured by machines A, B and C? 


Theory of Probability 473 


Solution. Let £,,E2 and E; denote the events that a bolt.selected at raiidom 
js manufactured by the machines A, B and C respectively and let E denote the event 
of its being defective. Then we have 

P (Ey) = 0:25, P (£2) = 0-35, P (E3) = 0-40 

The probability of drawing a deféctive bolt manufactured by machine A is 
P(E | £:)=005. 

Similarly, we have 

P(E | Ex) =0-04, and P (E | E3)=0-02 

Hence the probability that a defective bolt selected-at random is manufactured 

by machine A is given by 


P(E, | E)= PE)P EE) 

E P(e) PE |B) 

ne 

_ 0-25 x 0-05 _ 125 _ 25 

0:25 x 0-05 + 0-35 x 0-04 +0-40 x 0:02 345 69 

Similarly 
a 0-35 x 0-04 _ 140 _ 28 
P (Eb B= 0:25 x 0-05 + 035x004 + 040x002 ~ 345 ~ 69 
and 

es _,_25  28_ 16 

P(E: |E)=1-(P(E, |E+P(E.| B= 1 rar ger 


This example illustrates one of the chief applications of Bayes Theorem. 


EXERCISE 4 (d) 

1. (a) State and prove Baye’s Theorem. 

(b) The set of events A, , (k= 1, 2, ..., 2) are (i) exhaustive and (ii) pairwise 
mutually exclusive. If for all k the probabilities P (Az) and P (E | Ax) are known, 
calculate P (A,;| E), where E is an arbitrary event. Indicate where conditions (i) and 
(i) are used. 

(c) The events F,, £,...,E, are mutually exclusive and E=E£,U £,uU.. 
VE,. Show that if P(A| &) = P(B| E);i=1, 2, .., n, then P(A | E) = 
P(B'| E). Is this conclusion true if the events,E; are not mutually exclusive? 

(Calcutta Univ. B.Sc. (Maths Hons.), 1990] 

(d) What are the criticisms against the use of Bayes theorem in probability 
theory. [Sri. Venketeswara Univ. B.Sc., 1991] 

(e) Using the fundamental addition and multiplication rules of probability, 


show that 
___.__P(B)P(A|B) 
P(B|A)= P (B) P (A|B)+ P (8B) P (A\|B) 


where B is the event complementary to the event B. 
[Dethi Uniy. M.A. (Econ.), 1987] 
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2. (a) Two groups are compcting for the positions on the Board of Directors 
of a corporation. The.probabilities that the first and second groups wil) wifi are 0-6 
and 0-4 respectively. Furthermore, if the first group wins the probability of 
introducing a new product is 0-8 and the corresponding. probability if the second 
group wins is 0-3. What is the probability that the new product will be introduced? 
Ans. 0:6 x 0-8 +04 x 0-3 =0-6 


(b) The chances of X, Y, Z becoming managers ol a certain company are 4:2:3, 
The probabilities that bonus scheme will be introduced if X, Y, Z become managers, 
are 0:3, 0-5 and 0-8 respectively. If tie bonus scheme has been introduced, what is 
the probability that X is appointed as the manager. 

Ans. 0-51 

(c) A restaurant serves two special dishes, A and B to its customers consisting 
of 60% men and 40% women. 80% of men order dish A and the rest B. 70% of 
women order dish B and the rest A. In what ratio of A to B should the restaurant 
prepare the two dishes? (Bangalore Univ. B.Sc., 1991) 

Ans. P(A)=P(AANM)UCZNW)]= 06x08 4+0-4 x 0-3 = 0-6 

Similarly “P (B) = 0-4. Required ratio = 0-6 : 0-4 = 3 : 2. 

3. (a) There are three ums having the following compositions of black and 
white balls. 

Um 1:7 white, 3 black balls 
Um 2:4 white, 6 black balls 
Urn 3-: 2 white, 8 black balls. 

One of these urns is chosen at random with probabilities 0-20, 0-60 and 0-20 
respectively. From the chosen ur two balls are drawn at random without replace- 
ment. Calculate the probability that both these balls are white. 

Ans. 8/45. (Madurai Univ. B.Sc., 1991) 


(b) Bowl I contain 3 red chips and 7 blue chips, bowl II contain 6 red chips 
and 4 blue chips. A bowl is selected at random and then 1 chip is drawn from thiis 
bowl. (i) Compute the probability that this chip is red, (ii) Relative to the hypothesis 
that the chip is red, find the conditional probability that it is drawn from bowl IT. 

[Delhi Univ. B.Sc. (Maths Hons.)1987] 


(c) Ina factory machines A and B are producing springs of the same type. Of 
this production, machines A and B produce 5% and 10% defective springs, 
respectively. Machines A and B produce 40% and 60% of the total output of the 
factory. One spring is selected at random and it is found to be defective. What is 
the possibility that this defective spring was produced by machine A ? 

[Delhi Univ. M.A. (Econ.), 1986] 


(d) Um A contains 2 white, 1 black and 3 red balls, um B contains 3 white, 2 
black and 4 red balls and urn C contains 4 white, 3 black and 2 red balls. One urn 
is chosen at random and 2 balls aré drawn. They happen to be red and black. What 
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js the probability that both balls came from urn ‘B’ ? 
[Madras U. B.Sc. April; 1989] 

(e) Urn X;, X2, X3, each contains 5 red and 3 white balls. Ums Yj, Y2, each 
contain 2 red and 4 white balls. An urn is selected at random and a ball is drawn. 
It is found to be red. Find the probability that the ball comes out of the urns of the 
first type. [Bombay U. B.Sc., April 1992] 

(f) Two shipments of parts are received. The first shipment contains 1000 parts 
with 10% defectives and the second shipment contains 2000 parts with 5% 
defectives. One shipment is selected at random. Two parts are tested and found 
good. Find the probability (a posterior) that the tested parts were selected from the 
first shipment. [Burdwan Univ. B.Sc. (Hons.), 1988] 

(g) There are three machines producing 10,000 ; 20,000 and 30,000 bullets 
per hour respectively. These machines are known to produce 5%, 4% and 2% 
defective bullets respectively. One bullet is taken at random from an hour’s 
production of the three machines. What is the probability that it is defective? If the 
drawn bullet is defective, what is the probability that this was produced by the 
sccond machine? [Delhi Univ. B.Sc. (Stat. Hons.), 1991] 

4. (a) Three urns are given each containing red and white chips as indicated. 

Um 1:61red and 4 white. 
Um 2:2red and 6 white. 
Um 3:1 red and 8 white. 

(i) An urn is chosen at random and a ball is drawn from this urn. The ball is 
red. Find the probability that the urn chosen was um 1. 

(ii) An urn is chosen at random and-two balls are drawn without replacement 
from this urn. If both balls are red, find the probability that urn I was chosen. Under 
these conditions, what is the probability that urn III was chosen. 

Ans. 108/173, 112/12, 0 [Gauhati Univ. B.Sc., 1990] 

(b) There are ten urns of which each of three contains 1 white and 9 black balls, 
each of other three contains 9 white and 1 black ‘ball, and of the remaining four. 
each contains 5 white and 5 black balls. One of the urns is selected at random and 
a ball taken blindly from it turns out to be white. What is the probabililty that an 
um containing | white and 9 black oe was selected? / — Univ. B.Sc., 1991) 


Hint: P(E\)= = , P(E.)= 19 and P (E3) = 


Let A be the event bs drawing a an ball. 
re Oe: ic Le 
se 10 x70 * 10% 10 * 10°10" 2 


P(A | B)= = 9 and PEL |A)= 2 —s 


(c) It is known ies an um containing jena 10 balls was: filled in, the 
following manner: A coin was tossed.10 times, and according as it showed heads 
Or tails, one white or one black ball was put into:the urn. Balls are drawn, from this 
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urn one at a time, [0 times in succession (with replacement) and every one turns 
out to he white. Find the chance that the urn contains nothing but white balls. 

Ans, 0-0702. 

5. (a) From a vessel containing 3 white and 5 black balls, 4 balls are 
transferred into an empty vessel. From this vessel a ball is drawn and is found to 
be white. What is the probability that out of four balls transferred, 3 are white 
and | black. [Delhi Uni. B.Sc. (Stat. Hons.), 1985] 

Hint. Let the five mutually exclusive events for the four balls transferred be 
Ey, E;, Ez, E3, and Ey, where E; denotes the event that i white balls are 
transferred and let A be the event of: drawing a white ball from the new vessel. 


5C 3C, x 5C 3C x 5C 
Then P (Eo) =g¢t» PEs) =—“4 — P (Ex) = 4 
30, x5C 
P (Ey) —— and P (Es) =0 


Also P(A l'Eg) =0,P (A iE) =, P(Al Ee = (A lEjee, 
4 4 4 


and P(A bE,) = 1. Hence P(E; 1A) == 


(b) The contents of the urns | and 2 are as follows : 
Urn 1 : 4 white and 5 black balls. 
Urn 2 : 3 white and 6 black balls. —' 

One urn is chosen at random and a ball is drawn and its colour noted and 
replaced back to the urn. Again a ball is drawn from the same urn, colour noted 
and replaced. The process is repeated 4 times and as a result one ball of white 
colour and three balls of black colour are obtaine J. What is the probability that 


the urn chosen was the urn 1 ? (Poona Univ. B.E., 1989) 
Hint. P(E,) =P (E,) = 1/2, 
P (ALE) = 4/9, 1-P(AIE,)=5S/9 


P(AIE)) = 1/3, 1-P(AIE,) = 2/3 
The probability that the urn chosen was the urn | 


156) 
£Q 44-8 


(c) There are five urns numbered | to 5. Each urn contains 10 balls. The ith 
urn has i defective balls and 10—- é non-defective balls; ¢= 1, 2,... 5. An urn is 
chosen at random and then a ball is selected at random from that urn. (() What 1s 
the probability that a defective ball is selected ? 

(ii) 1f the selected ball is defective, find the probability that it came from 
urn i, (i= 1,2,..., 5). [Delhi Univ. B.Sc. (Maths Hons.), 1987] 

Hint.: Define the following events : 

E;: ith urns selected at random. 
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A : Defective ball is selected. 
P-(E) = 1/5; i= 1,2, ..., 5. 
P(A|E)=P ets ball from ith urn] = i/10, (i= 1, 2, ..., 5) 


P(E).P(A| E)= aXe, (i= 1,2,..., 5). 
~ 5 
_ i) 142434+44+5 3 
(i) P(A)= EPE)PALED= E 35 |= 42se sc LS 


: | PE)PAIE) _ i/50_ 4, 1 
(ii) P(E;| A)= EP P(E)P (ALE) > 3/107 15’ = 1,2,...,5. 


For example, the probability that the defective ball came from Sth um 
= (5/15) = 1/3. 

6. (a) A bag contains six balls of different colours and a ball is drawn from it 
at random. A speaks truth thrice out of 4 times and B speaks truth 7 times out of 10 
times. If both A and B say that a red ball was drawn, find the probability of their 
joint statement being true. 

[Delhi Univ. B.Sc. (Stat. Hons.),1987; Kerala Univ. B.Sc.1988] 

(b) A and B are two very weak students of Statistics and their chances of 
solving a problem correctly are 1/8 and 1/12 respectively. If the probability of their 
making a common mistake is 1/1001 and they obtain the same answer, find the 
chance that their answer is correct. [Poona Univ. B.Sc., 1989] 

ie Wx V2 2 

Ans. Reqd. Probability = Vex Vi2+(1-¥s).(1= M2) Mion 

7. (a) Three boxes, practically indistinguishable in appearance, a two 
drawerseach. Box I.containsa gold‘coin in one and a silver coin in the other drawer, 
box II contains a gold coin in each drawer and box III contains a silver coin in each 
drawer. One box is chosen at random and one of its drawers is opened at random 
and a gold coin found. What is the:probability that the other drawer contains a coin 
of silver? (Gujarat Univ. B.Sc., 1992) 

Ans. 1/3, 1/3. 

(b) Two cannons No. I and 2 fire at the same target. Cannon No. I gives.on 
an average 9 shots in the time in which Cannon No. 2 fires 10 projectiles. But on 
an average 8 out of 10 projectiles from Cannon No. | and 7 outof 10 from Cannon 
No. 2 strike the target. In the course of shooting, the target is struck by one 
projectile. What is the probability of a projectile which has struck the target 
belonging to Cannon No. 2 ? (Lucknow Univ. B.Sc., 1991) 

Ans. 0-493 

(c) Suppose 5 men out of 100 and 25 women out of 10,000 are colour.blind. 
A colour blind person is chosen at random. What is the probability of his being 
male? (Assume males and females to be in equal number.) 

Hint. E, = Person isa male, E2= Person is a female. 
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A =.Person is colour blind. 

Then P(E\)=P(E,)=%2, P(A| E:)=005, P(A| Ex) = 00025. 

Hence find P (£, | A). 

8. (a) Three machines X, Y, Z with capacities proportional to 2:3:4 are 
producting bullets. The probabilities that the machines produce defective are 0-1, 
0-2 and 0-1 respectively. A bullet is taken from a day’s production and found to be 
defective What is the probability that it came from machine X ? 

[Madras Univ. B.Sc., 1988] 

(b) In a factory 2 machines M, and M2 are used for manufacturing screws 
which may be uniquély classified as good or bad. M; produces per day m, boxes of 
screws, of which on the average, p:% are bad while the corresponding numbers for 
M2 are n2 and po. From the total production of both M, and M2 for a certain day, a 
box is chosen at random, a screw taken out of it and it is found to be bad. Find the 
chance that the selected box is manufactured (i) by Mi, (ii) M2. 

Ans. (8) ny pi/(m pitn2p2), (ti) n2 p2e/(m pi t+ mp2). 


9. (a) A man is equally likley to choose any one of three routes A, B, C from 
his house to the railway station, and his choice of route is.not influenced by the 
weather. If the weather is dry, the probabilities of missing the train by routes A, B, 
C are respectively 1/20, 1/10, 1/5. He sets out on a.dry day and misses the train. 
What is the probability that the route chosen was C ? 

On a wet day; the respective probabilities of missing the train by routes A, B, 
C are 1/20, 1/5, 1/2 respectively. On the average, one day in four is wet. If he misses 
the train, what is the probability that the day was wet? 

[Allahabad Univ. B.Sc., 1991] 

(b) A doctor is to visit the patient and from past experience it is known that 
the probabilities that he will come by train, bus or scooter are-respectively 3/10, 
1/5, and 1/10, the probabililty that he will use some other means of transport being, 
therefore, 2/5. If he comes by train, the probability that he -will be late is-1/4, if 
by bus 1/3 and if by scooter 1/12, if he uses some other means of transport it can 
be assumed that he will not be late. When he armves he is late. What is the 
probability that (i) he comes by train (ii) he is not.late? 

[Burdwan Univ. B.Sc. (Hons.), 1990] 

Ans. (i) 1/2, (ii) 9/34 


10. State and prove Bayes rule and expalin why, in spite of its easy deducubility 
from ‘the postulates of probability, it has been the subject of such extensive 
controversy. 

In the chest X-ray tests, it is found that the probability of detection when a 
person has actually T.B. is 0-95 and probabililty of diagnosing incorrectly as having 
T.B. is 0-002. In a certain city 0-1% cf the adult population is suspected to be 
suffering from T.B. If an adult is selected at random and is diagnosed as having 
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T.B. on the basis of the X-ray test, what is the probability of his actually having a 
T.B.? (Nagpur Univ. B.E., 1991) 
Ans. 0-97 
11. A certain transistor is manufactured at three factories at Barnsley, Bradford 
and Bristol. It is known that the Barnsley factory produces twice-a$ many transistors 
as the Bradford one, which produces the same number as the Bristol one (during 
the same period). Experience also shows that 0-2% of the transistors produced at 
Barnsley and Bradford are faulty and so are 0-4% of those produced at Bristol. 
A service engineer, while maintaining an electronic equipment, finds a defec- 
tive transistor. What is the probability that the Bradford factory is to blame? 
(Bangalore Univ. B.E., Oct. 1992) 
12. The sample space consists of integers from 1 to 2n which are assigned 
probabilities proportional to their logarithms. Find the probabilities and show that 
the conditional probability of the integer 2, given that an even integer occurs, is 
log 2 
[nlog2+ log(n!)] (Lucknow Univ. M.A., 1992) 
[Hint. Let E; : the event that the integer 2i is drawn, (i = 1, 2, 3, ...,”). 
A : the event of drawing’ an even integer. 


=> A=E,VUE.U...UE, => °#P(A)= E P(E) 
i=] 


But P(E;)=k log (2) (Given) 
aA n 
P(A)=k & log (2i) =k log [1 (2 = k[ nlog2+ log (n!) ] 
i=] t=] 


log (2i) 
[n log 2+ log (n!)] 

13. In answering a question on a multiple choice test, an examinee either 
knows the answer (with probability p), or he guesses (with probability 1 - p). 
Assume that the probability of answering a question correctly is unity for an 
examinee who knows the answer and 1/m for the examinee who guesses, where m 
is the number of multiple choice alternatives. Supposing an examinee answers a 
question correctly, what is the probability that he really knows the answer? 

[Delhi Univ. M.C.A., 1990; M.Sc. (Stat.), 1989] 

Hint. Let E, = The examinee knows the answer, 

E,= The examinee guesses the answer, 
and A= The examinee answers correctly. 
Then P(E,)=p, P(E2)=1-p, P(A | £,)=1 and P(A | E2)= 1/m 
Now use Bayes theorem to prove 


_ mp 
P(EIA)= Toe 


14. Die A has four red and two. white faces whereas die B has two red and four 
white faces. A biased coin is flipped once. If it falls heads, the game continues by 


P(E; | A)= 
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throwing die A, if.it falls tails die B is to be used. 
(i) Show that the probability of getting a red face at any throw is 1/2. 
(ii) If the first two throws resulted in red faces, what is the probability of 
red face at the 3rd throw? 
(iii) If red face tums up at the first n throws, what is the probability that die 
A is being used? 
ss Be ee 

Ans, (ii) 3/5 (iii) al 

15. A manufacturing firm produces steel pipes in three plants with daily 
production volumes of 500, 1,000 and 2,000 units.respectively. According to past 
experience it is known that the fraction of defective outputs produced by the three 
plants are respectively 0.005, 0.008 and 0.010. Ifa pipe is selected at random from 
a day’s total production and found to be defective, from which plant does that pipe 
come? 

Ans. Third plant. . 

16. A piece of mechanism consists of 11 components, 5 of type A, 3 of type 
B, 2 of type C and 1 of type D. The probability that any particular component will 
function for a period of 24 hours from the commencement. of operations without 
breaking down is independent of whether or not any other component breaks down 
during that period and can be obtained from the following table: 

Component type:ABCD 

Probability:0-60-70-30-2 

(i) Calculate the probability that 2 components chosen at random from the 11 
components will both function for a period of 24 hours from the commencement 
of operations without breaking down. 

(ii) If at the end of.24 hours of operations neither of the 2 components chosen 
in, (i) has broken down, what is the probability that they are both type C com- 
ponents. 

Hint. 


I 


(i) Required probability = ic [ °C2 x 0-6)" + °C2 (0-79 + 7C2 (0-3)? 


2 
+ °C, x7*C; x 0-6 x 0-7 + °C; x 2C, x (0-6) x (0-3) 
+ °C, x1Cy x (0-6) x (0-2) + °C, x ?C; x 0-7 x 03 
47°C, x'C,x0-7 x0-2+7C, x1C, x 0-3 x 0-2] 
=p (Say). 
(ii) Required probability (By Bayes theorem) 
_ 2G. x (03) _ 0.09 


Pp - p 
4-10. Geometric probability. In remark 3, § 4-3-1 it was pointed out that the 
classical definition of probability fails if the total number of outcomes of an 
experiment is infinite. Thus, for example, if we are interested in finding the 
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probability that a point selected-at random in a given region will lie in a:specified 
part of it, the classical definition of probability is modified and extended to what 
is called geometrical probability or probability in continuum. In this case, the 
general expression for probability ‘p’ is given by 


_ Measure _of specified part of the_region 


° Measure of the whole region 


where ‘measure’ refers to the length, area or volume of the region if we are dealing 
with one, two or three dimensional space respectively. 

Example 4-34. Two points are taken at random on the given straight line of 
length a. Prove that the probability of their distance exceeding a given length 


c (< a),is equal to | 1 - ; J 


[Burdwan Univ. B.Sc. (Hons.), 1992; Delhi Univ. M.A. (Econ.), 1987] 
Solution. Let P and Q be any two points taken at random on the given straight 
line AB of length ‘a’. Let AP =x and AQ =y, 
(O<x<a,0<ysa). 
Then we want P{| x—y | >cl. 
The probability can be easily calculated geumetrically. Plotting. thé lines 
x-y=cand y~x=c along-the co-ordinate axes, we get the following diagram: 


SinceO<x<a,0<y <a, total aréa=a.a=a’. 
Area favourable to the event | x — y {> c is given by 


A LMN + ADEF => LN .MN +2 EF . DF 


ual a 
=5 (a-c)’+5 (a~-c)'=(a-cy 


— p\* 
P(Ix-yl>= 259 -(1-5] 
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Example 4-35. (Bertrand’s Problem). [f a chord is taken at random in a 
circle, what is the chance that its length | is not less than ‘a’ , the radius of the 
circle? 

Solution. Let the chord AB make an angle 6 with the diameter AOA ’ of the 
circle with centre O and radius OA=a. Obviously 6 lies betwseen — 1/2 and 7/2. 
Since all the positions of the chord AB and 
consequently all the values of 8 are equally 
likely, © may be regarded as a random vari- 
able which is uniformly distributed cf. § 8-1 
‘over (= 1/2, ©/2) with probability density 
function 


f(®%)= > —/2<O0< n/2 


Z ABA’, being the angle in a semi- 
circle, is aright angle. From A ABA’ we have 
AB once 
AA’ = 09S 
= l= 2acos 8 
The required probability ‘p’ is given by 

p=P(l 2 a)=P (2acos® 2 a) 
=P(cos@ > 1/2)=P(|6| < 7/3) 


1/3 \ 1/3 ) 
=| f@d=- | ao=5 
Tt 3 

-n/3 - 7/3 


Example 4-36. A rod of length ‘a’ is broken into three parts at random. What 
is the probability that a triangle can be formed from these parts? 
Solution. Let the lengths of the three. parts of the rod'be x, y and a — (x + y). 
Obviously, we have 
x>0; y>Oandx+y<a = y<a-x ..(*) 
In order that these three parts form the sides of a triangle, we should have 


: a 
xt+y>a-(x+y) => ar ae | 
and Xt+a-(x+y)>y => y<5 


..(**) 


a 
yta-(x+y)>x => y<5 
since in a triangle, the sum of any two sides is greater than the third. Equivalently, 
(**) can be written as 


5 A O<x<5 wo ( ¥**) 


Hence, on using (*) and (**#), the required probability is given by 


oHxcy< 
a) J 
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a/2 a2 a/2 
a@_{a_ 
[ J aa f E E x |e 
Q (a/2)-x ; _ 9 
a a-x ~ a 
| | dy dx | @-x) dx 
0 0 0 
"x? Ja/2 
_ _l2lo- _ a8 _1 
—(a-x)’|@ @’/2 4 
2 0 


Example 4-37. (Buffon’s Needle Problem). A vertical board is ruled with 
horizontal parallel lines at constant distance ‘a’ apart. A needle of length | (< a) 
is thrown at random on:the table. Find the probability that it will intersect one of 
the lines. 

Solution. Let y denote the distance from the centre of the needle to the nearest 
parallel and be angle formed by the needle with this parallel. The quantities y 
and @ fully determine the position of the needle. Obviously y ranges from 0 to 
a/2 (since |< a) and 6 from 0 ton. 

Since the needle is dropped randomly, all possible values of y and $ may be 
regarded as equally likely and consequently the joint probability density function 
f(y, >) of y and is given by the uniform 
distribution.( c.f. § 8.1 ) by 

fO,O=k; SOS, 
O<ysa/2,__...(*) 
where & is a constant. 

The needle will inte:sect one of the 
lines if the distance of tts centre from the 
line is less than 3 / sind, i.e., the required 
event can be represented by the inequality 
O<y< > sin. Hence.the required probability p is given by 

x (Ising)/2 


4 


f(y, 9) dy do 


x 
J singe 
0 
(a/2) .% 
= ge = a cS 
an 0 an 
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EXERCISE 4 (e) 

1. Two points are selected at random in a line AC of length ‘a’ so as to lic on 
the opposite sides of its mid-point O. Find the probability that the distance between 
them is less than a/3 . 

2. (a) Two points are selected at random on a line of length a. What is the 
probability that acne of three sections in which the line is thus divided is less than 
a/4? 

Ans. 1/16. 

(b) A rectilinear segment AB is divided by a point C into two parts AC=a, 
CB=b. Points X and Y are taken at random on AC and C8 respectively. What is the 
probability that AX, XY and BY can form a triangle? 

(c) ABG is a straight line such that AB is 6 inches and BG is 5 inches. A point 
Y is chosen at random on the BG part of the line. If'C lies between B and G in such 
a way that AC=1 inches, find 

(i) the probability that Y will lie in BC. 
(ii) the probability that'Y will lie in CG. 

What can you say about the sum of these probabilities? 

(d) The sides of a rectangle are taken at random each less than a and all lengths 
are equally likely. Find the chance that the diagonal is less than a. 

3. (a) Three ports are taken at random on the circumference of a circle. Find 
the chance that they lie on the same semi- circle. 

(b) A chord is drawn at random in a given circle. What is the probability that 
itis greater than the side of an equilateral triangle inscribed in that circle? 

(c) Show that the probability of choosing two points randomly from a line 
segment of length 2 inches and their being at a distance of at least 1 inch from each 
other is 1/4. {Delhi Univ. M.A. (Econ.), 1985] 

4. A point is sclected at random inside a circle. Find the probability that the 
point is closer to the centre of the circle than to its circumference. 

5. Onc takes at random two points P and Q on a segment AB of length a 

(i) What is the prooability for the distance-PQ being less than b ( <a )? 
(ii) Find the chance that the distance between them is greater than a given 
length 0. 

6. Two persons A and B, make an appointment to meet on a certain day at a 
certain place, but without fixing the time further than that it is to be between 2 p.m. 
and 3 p.m and that each is to wait not longer than ten minutes for the other. 
Assuming that each is independently equally likely to arrive at any time during the 
hour, find the probability that they meet. 

Third person C, is to be at the same place from 2-10 p.m. until 2-40 p.m. on 
the same day. Find the probabilities of C being present when A and B are there 
together (i) When A and B remain after they meet, (ii) When A and B leave as soon 
as they meet. 
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Hint. Denote the times of arrival of A by x and of B by. y. For the meeting to 

take place it is necessary and sufficient that 
| x-y|<10 

We depict x and y as Cartesian coordinates in the plane; for the scale unit we 
take one minute. All possible outcomes can be described as points of a aquare with 
side 60. We shall finally get (c.f. Example 4-34, with a = 60, c = 10] 

P[|x-y|<10]= 1-(5/6)?= 11/36 

7. The outcome of an experiment are represented by points in the square 
bounded by x=0, x= 2 and y=2 in the xy-plane. If the probability is distributed 
uniformly, determine the probability that x*7+ y’ > 1 

Hint. 


Required probability P (£) = J ; dx dy=1- J : dx dy 
z E’ 


where E is the region for which x*+)’>1 and E” is the region for which 
2 2 
xt+y si. 
11 
3 
ap (Ey=4-J J a& dy=3 = P(E)=2 
0 0 
8. A floor is paved with ules, each tile. being a parallelogram such that thé 
distance between pairs of opposite sides are a and b respectively, the length of the 
diagonal being /. A stick of length c falls on the floor parallel to the diagonal. Show 


that the probability that it will lie entircly‘on one tile is 
}-£ } 
l 
If a circle of diameter d is thrown on the floor, show that the probability that 


it will lie on one tle is 


(8) (9 


9. Circular discs of radius r are thrown at random on to a plane circular table 
of radius R which is surrounded by. a border of uniform width r lying in the same 
plane as the table. If the discs are thrown independently and at random, end N stay 
on the table, show that the probability that a fixed point on the table but not on the 
border, will be covered is 


r? 
= Cr 
SOME MISCELLANEOUS EXAMPLES 


Example 4-38. A die is loaded in such a manner that for n=1, 2, 3,4,5,6.the 
probability of the face marked n, landing on top When the die is rolled is propor- 
tional to n. Find the probability that an odd number will appear on tossing the die. 

(Madras Univ. B.Sc. (Stat. Main), 1987] 
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Solution: Here we-are given 
P(njx<xn- or P (n)=kn, where k is the constant of proportionality. 
Also P(1) + P(2)+..P(6)=1 => A14+24+34+44+5+6)=1ork=1/21 
Required Probability = P(1) + P(3) + P(5) = “ae = S 
Example 4:39. In.terms of:probability : 
pi=P(A), po=P(B), pp=P(ANB), (Pr, pr, ps > 9) 
Express the following in terms of p:. .p2, ps 
(a)P(AUB), (6) P(AUB), (c)P(AQB), (d)P(AUB), (e)P(AQB) 
(f) PCACB), (g) P(A|B), (h)P (BIA), (i) PIAN(AVUB)] 
Solution. 
(a) P(AUB)= 1- P(AUB)= 1~-[P(A)+ P@)— P(AB)}. 
=l- Pi~ prt p3.- 
(b) P(AUB)= P(ANBF 1- P(ANB)= 1- ps 
(c} P(AQB)= P (B-AB)=P (B)—P (ANB)=m-ps 
(d): P(A UB)= P (A)+ P(B)- P(AnB)=1- Pit Pr- (P2- Ps) 
=l- Pit ps 
(ec) P(AQB)= P(AUB)= 1- pi- Pot Ps. [Part (a)] 
(f) P(AQB)= P(A- AQNB)= P(A)— P(ANB)= pir ps * 
(g) P(A\B)= P(ANB)/P(B)= ps/pr 
(hk) P(B}A)= P(ANB)/P(A)= (2- ps)/(1- pr) 
(i) P[AN(AUB)]= P[(ANAYU(ANB)] 
= P(ANB)= pr— ps (ANA=9] 
Example 4-40. Let P(A) = p, P(A|B)=4q,P (B|A) =r. Find relations be- 
tween the numbers p, q, 1 for the following cases : 
(a) Events A and B are mutually exclusive. 
(b) Aand B are mutually exclusive and collectively exhaustive. 
(c) Ais a subevent of B; B is a subevent of A. 
(d) A and Bare mutually exclusive. 


(Delhi Univ. B.Sc. (Maths Hons.) 1985] 
Solution. From given data: P (A)=p, P (A\\B)=P (A) P (B|A)=rp 


_ P(ANB)_ 
PO="P (AIB) 
(a) P(AQNB)= 0 => rp= 0. 
(b) P(A™B)=0 and P(A)+ P(B)= 1 
=> piq+n=q. p=0 => pq=zq => p=1V q=9. 
(c) ACB > ANB=A or-P(AN B)=P(A) = rp=p > r=1Vp=9. 
BoA = AQNB=B or P(ANB)=P(B) 
=> rp=(rp/q) oO rp(q-l=0 => qe! 
(dq) P(ANB)=1-P(AUB) = O=1-[P(A)+P(B)-P(AN8)| 
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So P(A)+P(B)=14+P(ANB) = p[l+#(r/g)) = 1+rp 
p(a+ r= q(I+ pr). 

Example 4-41. (a) Twelve balls are distributed at random among three boxes. 
What is the probability that the first box will contain 3 balls? 

(b) If n biscuits be distributed amiong N persons, find the chance that a 
particular person receivesr (<n) biscuits. [Marathwada Univ. B.Sc. 1992] 

Solution. (a) Since each ball can go'to‘any one of. the-three boxes, there.are 3 
ways in which a ball can go to any one of the three boxes. Hence there are 3'? ways 
in which. 12 balls can be placed in the three boxes. 

Number of ways in which 3 balls out of 12 can go to the first box is '*C3. Now 
the remaining 9 balls are to be placed in 2 boxes and this can be done in 2’ ways. 
Hence the total number of favourable cases = '?C; x 2°. 

j ml x yh 
Required probability = ——.— 

(b) Take any one biscuit. This can be given to any oné of the N beggars so that 
there are N ways of distributing any one biscuit. Hence the total number of ways 
in which n biscuit can be distributed at random among N beggars 

=N.N...N(ntimes')= N". 

r’ biscuits can be given to any particular beggar in "C, ways. Now we are left 
with (n—r) biscuits which are ‘to be distributed among the remaining (N — 1) 
beggars and this can be done'in(N- 1)" ways. 

Number of. favourable cases = "C, .(N—- 1)?" 
"C.(N-1)"" 
N R 

Example 4-42. A car is parked among N cars in a row, not.at either end. On 
his return the owner finds that exactly r of the N pjaces are still occupied.What is 
the probability that both neighbouring places are empty? 

Solution. Since the owner finds on return shat exactly r of the N places 
(including-Owner’s car) are occupied, the exhaustive number of cases for such an - 
arrangement is “~'C,_, [since the remaining r ~ 1 cars are to be parked in the 
remaining N — 1 places and this can-be done in “~'C,_, ways]. 

Let A denote the event that both the neighbouring places to owner’s car are 
empty. This requires the remaining (r — 1) cars to be parked in ‘the remaining 
N~3 places and hence the humber of cases favourable to A is “~°C,-1. Hence 

a: C4 (N-r)(N-r- 1) 
OS IGN INN 2) 


Hence, required probability = 


Example 4-43. What is the probability that at least two out of n people have 
the same birthday? Assume 365 days ina year and that all days are equally likely. . 
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Solution. Since the birthday of any person can fall on any one of the 365 days, 
(he exhaustive number of cases for the birthdays of n persons is 365". 
If the birthdays of all the n persons fall on different days, then the number of 
favourable cases is 
365 ( 365 - 1 ) (365 — 2) .... [365 —-(n-1)], 
because in this case the birthday of the first person can fall on any one of 365 days, 
the birthday of the second person can fall on any one of thé remaining 364 days 
and so on. 
Hence the probability (p) that birthdays of all the » persons are different js 
given by : 
_ 365 (365 — 1) (365 —2) ... [365 —(n—- 1)] 
365” 


] 2 3 n-1 
(1-36 | (1-365) (1-355 }-~(F- 365 


Hence the required probability thai at least two persons have the same birthday 


1’\(, 2° 3 n~1 
ne eae (1 55 |(!-es } (365 }--(1- 365 
Example 4-44. A five-figure number is formed by the digits-0, 1, 2, 3,4 
(without repetition), Find the probability-that the number formed is divisible by 4. 


[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 


Solution. The total number of ways in which the five digits 0, 1, 2,3, 4 can be 
arranged among themselves is 5!. Out of these, the number of arrangements which 
begin with O (and, therefore, will give only 4-digited numbers) is 4!. Hence the 
total number of five digited numbers that can be formed from the digits 0, 1, 2,3, 
4is 


1S 


5!~4!:=120-24=96 

The number formed will be divisible by 4 if the number formed by the two 
digits on extreme right (i.e., the digits in the unit and tens places) is divisible by 4. 
Such numbers are : 

04 ,12,20,24,32 ,and 40 

If the numbers end in 04, the_remaining three digits, viz.,1, 2 and 3 can & 
arranged among themselves in 3 ! ways. Similarly, the number of arrangements of 
the numbers ending with 20 and 40 is 3 ! in each case. 

If the numbers end with 12, the remaining three digits 0 , 3 , 4 can be arranged 
in 3 ! ways. Out of these we shall reject those numbers which start with 0 (i.e., have 
O as the first digit). There are (3 - 1) !=2! such cases. Hence, the number of fivé 
digited numbers ending with 12 is 

31!-2!=6-2=4 
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Similarly the number of 5.digited numbers ending with 24 and 32 each is 4. 
Hence the total number of favourable cases is 
3x3!+3x«4 = 18412 = 30 
30. =65 
Hence required probability = 06 16 
Example 4-45. (Huyghen's problem). A and B throw alternately with a pair 
of ordinary dice. A wins if he throws 6 before B throws’7, and B wins if he throws 
7 before A throws 6. If A begins, show that his chance-of winning is 30 ( 6] 
[Delhi Univ. B.Sc. (Stat. Hons.), 1991; Delhi Univ. B.Sc.,1987] 
Solution. Let E, denote the event of A’s throwing ‘6’ and E, the event of B’s 
throwing ‘7 witha pair of dice. Then E, and E, are the complementary events. 
‘6’ can be obtained with two dice in the following ways: 
(1,5), (5, 1), (2, 4), (4, 2), (3, 3), ie., in 5 distinct ways. 
2. ae By-j)-2- 3h 
‘7’? can be obtained with two dice, as follows: 
(1, 6), (6, 1), (2, " (5, 2), (3, 4), (4, 3), é. i in 6 distinct ways. 


P(E3) = and P.G)=1-+= 2 


6 — 
36 : : 6 
If A starts the-game, he will win in the following mutually exclusive ways: 
(i) E, happens (ii) E, 0 E, 0 E, happens 
(iti) E, AE, QE,QE,QE, happens, and so on. 
Hence by addition theorem of probability, the required probability of A’s 
winning, (say), P (A) is given by 
P (A)=P (i) +P (ii) + P (ati) + ... 
= P(E\)+P(E,QE,QN Ei: + P(E, NE: NE, 0 £2 0E}) + 
= P(E,) + P(E,) P(E2) P(E,) + P(Ei) P(E2) P(Ey) P(E2) P(E) + .. 
(By compound probability theorem) 
5... 31.3. 3... 3k..5..31'35 5 


= 36 36636 366 366 36°" 


_ 5/36 30 
Tle oO! 
36 6 


Example 4-46. A player tosses a coin and is'to score one point for every head 
and two points for every tail turned up. He is to play on until his score reaches or 
passes n. If p, is the chance of attaining exactly n score, show that 

Pa=> [ Pa-1 + Pa-2 1, 
and hence find the value of pn. {Delhi Univ. B.Sc. (Stat. Hons.), 1992] 
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Solution. The score n can be reached in the following two mutually exclusive 
ways: 
(i)By throwing a tail when score is (m — 2), and’ 
(ii)By throwing a head when score is (n - 1). 
. Hence by addition theorem of probability, we get 


Pa= Pi) +P (ii) = 3. Pa-2+ 5 -Pa-1= 5 (Pa-i + Pa-2) 


(#) 
To find p, explicitly, (*) may. be re-written as 
Pat + Pa 1=Pa-1 +3 Da-2 
= Pa-2+ ; Pa-3 
ie 
= Pot oP (4%) 


Since the score 2 can be obtained as 
(i)Head in first throw and head in 2nd throw, 
(ii) Tail in the first throw, we have 


pay styaatyes and obviously py = > 
Hence, from (**), we get 
1 1 1 2). d. 2k 2 
a aaa iy maa ee ae 


Multiplying all the above equations, we get 
Pr -2=(-i)""' (pi-3 


aa Gals 


Example 4-47. A coin is tossed (m+n) times, (mn). Show that the probability 


: _ A+ 2 
of at least m consecutive heads is “peat 


- [Kurukshetra Univ. M.Sc. 1990; Calcutta Univ. B.Sc.(Hons.),1986] 
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Solution. Since m >n , only one scquence of m consecutive heads is 
possible. This sequence may start either with the first toss or second toss or third 
toss, and so on, the last-one will be starting with (m + 1)th toss. 

Let E; denote the event that the sequence of m consecutive heads starts with 
ith toss. Then the required probability is 


P(E) +P (E2) +... + P (Ensi) (*) 
Now P(E) = P [Consecutive heads in first m tosses and head or tail in the rest] 
_{1 
(2 


P (E,)=P [Tail in the first toss, followed by m consecutive heads and 
head or tail in the next] 


ee ee 
~ 32] ~ geet 
In general, 


P (E,) = P [tail in the (r — 1)th trial followed by m consecutive heads 
and head or tail in the next ] 


iy” 1. 
=3(2) > anal? V = 2, 3;04N 4 1. 
Substituting in (*), 
Required probability = wit gers ot 


Example 4-48. Cards are dealt one by one from a well-shuffled pack until an 
ace appears. Show that the probability that exactly n cards are dealt before the 
first ace appears is 

4(51- n)(SO0—- n)(49—- n) 
52.51.50.49 [Delhi Univ. B.Sc. 1992] 
Solution. Let E; denote the event that an ace appears when the ith card is 
dealt. Then the required probability ‘p’ is given by 
p=P [Exactly n cards are dealt before the first ace appears] 
=P [The first ace appears at the (n + 1)th dealing] 
= P(E, N Ex NEA eee N En NEW Enai) 
= P(E) P(E.\E,) P (Es IE: ED)... 
x P (En 1 Ei AE... OE, 1) <P (Ess i IE; E20... OE,) 
.(*) 

‘Now 
= 
52 
P(RIE)== = PG&IB)=S 


P (E;)= = P@=S 
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. P (Es | EN ED= = = P (EEE) = 2 
P(E. EAB. OB.-d= Gly 
P(E, EOE... OB d= 5 Gy 
P(E NEYOE, ON... OE,-1)= sa 
P(E ME OEN..0B)= Gp 


. _— 7 4 
and) P( Ext | E; EZ... OE,) = S2- h 


,~ Hence, from (*) we get 


5 = |B xB xB xB xB xB x ye te ee 
S52 51 SO 49 48 47 °"  §2-(n - 4) 
San 50-49 = 
52~- (n— 3) S2- (n—- 2) 52-(n- 1) 52-2 


_ Gl—- n(50- n)(49- n)4 
7 52 x 51 x 50 x 49 
Exam ple 4-49. If four squares are chosen at random on a chess-board, find 
the chance ‘that théy should be in a diagonal line. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1988] 
Solution. In a chess-board there are 8 x 8= 64 squares as shown in the 
following diagram. 
p_ Let us consider the number of ways in 
A which the 4 squares selected at random are 
in a diagonal line parallel to AB. Consider 
the A ABC. Number of ways in which 4 
selected squares are along the lines A, B,, 
-A3 B3, A2 B2, A, B; and AB are "Ci "Ci 
~€C,, ‘Cand *C, respectively. 
Similarly, in AABD there are an 
equal number of ways of selecting 4 
squares in a diagonal line parallel to AB. 
Hence, total number of ways in which 
the 4 selected squares are in a diagonal 
line parallel to AB are 
2 ( “Cat *Ca+ Cat Cs) + "Cy. 
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Since there is an equal number of ways in which 4 selected squares are in a 
diagonal line parallel to CD, the required number of favourable cases is given by 
2 [ 2(4Ce + °Ca + Ca + "Ca) + °Ca] 


Since 4 squares can be selected out of 64 in “C, ways, the required probability 


_ 2[2¢4Ca + Cy + Ca + 'Ca) + °Ca] 


“C; 
— (4¢14+54+154+35)+140]x4h 9 
7 64 x 63-x 62 x.61 ~ + 158844 


Example 4-50. An urn contains four tickets marked with numbers 112, 121, 
211, 222 and one ticket is drawn at random. Let Ai, (i=1,2, 3) be the event that ith 
digit of the number of the ticket drawn is 1. Discuss the independence of the events 
A,,A2 and Az. [Delhi Univ. B.Sc.(Stat. Hons.),1987; Poona Univ. B.Sc.,1986] 

Solution. We have 

P(A:) = 2 = 3 = P(42) = P(As) 


A, (Az is the event that the first two digits in the number which the selected tucket 
bears are each equal to unity and the only favourable case is ticket with number 
112. 


P(A, 0 Ag) = 7 = +4 
= Py) P(A2) 
Similarly, 
P(A. 0 As) = 7 = P(A2) P(A3) 
and P(A3 0 Ai) = 4 = P(As) P(Ai) 


Thus we conclude that ie events A;, Az and A; are pairwise independent. 
Now P(A; 9:A37VA3) = P {all the three digits in the number are 1’s) 
= P(9) 
= 0 # P(A,) P(A2) P(As) 
Hence A,,A2and A; though pairwise independent are not mutually inde- 
pendent. 
Example 4-51. Two fair dice are thrown independently. Three events A, B 
and C are defined as follows: 
A : Odd face with first dice 
B : Odd face with second dice 
C : Sum of points on two dice is odd. 
Are the events A, B and C mutually independent? 


[Deihi Univ. B.Sc. (Stat. Hons.) 1983; M.S. Baroda Univ. B.Sc.1987] 
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Solution. Since each of the two dice can show any one of the six faces 1, 2, 3, 
4,5, 6, we get: 


_3x6_1 ee 
3x6 1 Pell? : 


The sum of points on two dice will be odd if one shows odd number and the 
other shows even number. Hence favourable cases for C aze: 
(1,2), (1,4), (1,6); (4,1), (4, 3), 4,5) 
(2, 1), (2, 3), (2, 5); (5,2), (5,4), G, 6) 
(3,2), (3,4), (3,6); (6, 1), (6, 3), (6, 5) 
i.e., 18 cases in all. 


Hence P(C) = = =. 


Cases favourable to the events ANB, AQC,BOC and ANBOC are 
given below : 


Favourable cases 


ACB (1,1), (1, 3), (1, 5), (3,1), (3, 3), 3, 5), (5, 1) (5, 3) 
(5, 5), i.e., 9 in all. 

ANC (1, 2), (1, 4), (1, 6), (3, 2), (3, 4), (3, 6), (5, 2), (5, 4) 
(5,6),ie.,9in all, | 

BOC (2, 1), (4, 1), (6, 1) (2, 3), (4, 3), (6, 3), (2, 5), (4, 5), 


(6. 5), i.e., 9 in all 
ANBAC Nil, because ANB implies that sum of points on two dice is 
even and hence (ANB)QNC = 6 


P(ANB) = = z ; = P(A).P(B) 
P(AAQ = “ : = P(A) P(C) 
PBAG = = Z : = P(B) P(C) 


and P(AANBOC) = P(g) = 0# P(A) P(B) PC) ; 

Hence the events A, B and C are pairwise independent but not mutually 
independent. 

Example 4-52. Let Aj, A2,....4. be independent events and P (Ax) = pr. 
Further, let p be the probability tha; nune of the events occurs; then Show that 


pse = [Agra Univ. M.Sc., 1987] 
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Solution. We have 
p= P(ALQALN...OA,) 
n oes 3 ; n 
= [] P(A)= IT (1-P(4)) = I -p) 
i=1 i=l i=1 
(since A; ’s are independent] 


< Tle? [--l-x < &% for0S x <1 
=> Pp s exp [- E pil 
i=] 
as desired. 
Remark. We have 
l-x<e* for0<x<1 wo *) 


Proof. The inequality (*) is obvious for x = 0 and x = 1. Consider 0 < x< 1. 
Then a 


log (1~x)! = -log(1-x) 


wlxeeaeyhy le 
2 3 4 ‘ 
the expansion being valid since 0 < x <1. Further since x>0, we get from (**) 
log qa- xy’ >x 
= ~log(1-x) > x 
=> log (1-—x) < -x 
=> Il-x<e%, 


as desired. , 
Example 4-53. In the following Fig.(a) and (b) assume that the probability of 


a relay being closed is P and that @ relay is open or closed independently of any 
other. In each case fi fi nd the probability that current flows from L to R. 


5 6 F19¢Cb) 


Solution. wh zz denne the eal that the relay i, ( i = 1, 2, ..., 6) is closed. 
Let E be the event that current flows from L to R. 

In Fig. (a) the current will flow from L to R if at least one of the circuits from 
LtoR is closed. Thus for the current to flow from L to R we have the following 
favourable cases: 
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(i) A, Ag= By, (ii) Ay VAs= Bo, ? 
(iti) Ai NAN As= B;, (iv) Aa CO A3 1 A2 = Bz, 
The probability p, that current flows from L to R is given by 
Pi = P(B, UV B21 U B3 U-Ba) = XT P(B)-— 2 PBiOB)+ LT PBB OB) 


i i<j b<ypck 


— P(B, N B20 B37 Ba) (*) 
Since the relays operate independently of each other, we have 
P (B,)= P(A. QA2) = P(A)).P(A2) = p.p =p" 
P (Br) = P (Aa As) = P (As). P (As) = pp =p 
P (B3)= P (Ai) P (As) P (As) = p 
P (Bs) = P (As) P (As) P (A2) = p 
Similarly 
P(By (Ba) = P(A, CQ Az A Ag OAs) = P(A) P(A2) P(Aa) P(As) = p* 
P (Bi NB20Bs)= P(A, NAN Ax As As) =p 
and so on. Finally, PHO He in (*), We Set 
=(p'+p'+p+p —@'+ pit pi +p +p +p) 
. +(p°+p'+p'+p)-p 
=2p+2p-S5p'+2p 
In Fig.(b). Arguing as in the abové Case, the required probability p2 that the 
current flows from L to R is given by 
p2=P (E, U EU E;U Ea) 
where 
E,= Ai M Ao, E,=A3;OAd2, E;=Az, E, =A; Ag 
pr= EP (E)- E P(E; E) + © PEN EE) 
i<j i<j<k 
— P(E: NE2z2N E32 Ey) 
=(p°+p'+p+p)-(p'+p'+p'+p+p'+p) 
+(p\+p +p +p)-p" 
=pt 3p’-4p’-p'+3p’-p* 
Matching Problem. Let us have n letters corresponding to which there exist 
n envelopes bearing different addresses. Considering various letters being put in 
various envelopes, a match is said to occur if a leer goes into the right envelope. 
(Alternatively, if in a party there are n persons with n different hats, a match is 
said to occur if in the process of selecting hats at random, the ith person rightly 
gets the ith hat.) 


A match at the kth position for k=1, 2, ..., n.-Let us first consider the event 
Ax when a match occurs at the kth place. For better understanding let us put the 
envelopes bearing numbners 1, 2, ..., = mn ascending order. When Ax occurs, & th 
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letter goes to the kth envelope but (m — 1) letters can go to the remaining (n — 1) 
envelopes in (n — 1) ! ways. 
— 1)! 
Hence P (A;) = fa— Dt — , 
; n! n 
where P (A;) denotes the probability of the Ath match. It is interesting to see that 
P (Ax) does not depend on k. 

Example 4-54, (a) ‘n’ different objects 1,2, ...,n are distributed at random 
inn places marked 1, 2,...,n. Find the probability that none of the objects occupies 
the place corresponding to its number. [Calcutta Univ. B.A.(Stat.Hons.)1986; 

Delhi Univ. B.Sc.(Maths Hons.), 1990; B.Sc.(Stat.Hons.) 1988] 


(b) If n letters are randomly placed in correctly addressed envelopes prove 
that the probability that exactly r leters are placed in correct envelopes is given by 


ie as | 
a ae 1) bt r=1,2,....” 


[Bangalore Univ. B.Sc., 1987] 

Solution (Probability of no match). Let E;, (i= 1, 2, ..., n) denote the event 
that the ith object occupies the place corresponding to its number so that £, , is the 
complementary event. Then the probability ‘p’ that none of the objects occupies 
the place corresponding to its number is given by 

p= PCE, AELAE A we» En) 
1-—P {at least one of the objects occupies the place corresponding 

to its number} 

l- P(E, VUEL,UESU sae U E;) 


n n n 
1-—[ 2 P(E;) - > P(E; O.E;) + LLL P(E; NE; O Ex) - ... 


t=1 iol ij, k=l 
J<j Kick 
+ (- 1)" PE NELN...NE)] At) 


Now P(E) = . Vi 
P(E; A E;)= P(E;) P(E; | E;) 


1 ee 
ao Y ij(i</ 
P(E: VE; 0 E;) = P(E) P(E; | E)) P(Ex (Ein E) 
=o, Vik G<i<k 
and so on. Finally, 
1 1 l l 
P(E, NE,N Es n...N£,)== .= a mee 
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Substituting in (*), we get 
p=1-["c4- “Cy 1 a 1 


n nn— 1)" C3Tin-lykn-2)~ _ 


° r-l I 
7) ara 
_ he of, alle ana tb 
= 1-|1-+ hte + 
1 | 1 ae | 
=a 3rtar en tO 1) al 
mr: 
k=0 *! 
Remark. For large n, 
ee oe aes 
Pee U3 at 
= ¢' = 0-36787 
Hence the probability of at least one match is 
ee ere - 1)" 
eS oy aa! al 
=1-+ (for large n) 


/ e 

(b) [Probability of exactly r matches {r < (nm — 2) }} Let Aj, (é= 1, 2,..., n) 
denote the event that ith letter goes to the correct envelope. Then the probability 
that none of the n letters goes to the correct.envelope is 


n 
P(A, NAN... NA, = © (- 17K! (**) [(cf. part (a)] 
k=0 
The probability that each of the ‘r’ letters is in the right envelope is 
and the probability that none of the remaining 


n(n—1)(n—-2)...(n-—r+1)’ 
(n — r) letters goes in the correct envelope is obtained by replacing n by (mn —r) in 


Aa-r = | y 
(+*) and ts thus given by 2 co “ae Hence by compound probability theorem, 
k=0 *: 


the probability that out of n letters exactly r letters go to correct envelopes, (in a 
specified order), is 
a-r k 
an ae Dy aa ; rsn-2., 
n(n-1)(n-2)...@-—r+ Dj ,29 &! 
Since r letters can go to n envelopes in *C, mutually exclusive ways, the 


required probability of exactly r letters going to correct envelopes, (in any order, 
whatsoever), is given by 
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1 se Oe | Ae pasa 
AGG 2)a. Gere) oo RL rlqce 

Example 4-55. Each of the n urns contains ‘a’ white balls and ‘b’ black balls. 
One ball is transferred from the first urn to the second, then one ball from the latter 
into the third, and so on. If px is the probability of drawing a white ball from the 
kth urn, Show that 


"C,X 


_ _ ati 
Prev Te ba tt oo 
Hence for the last urn, prove that 
a 
P»= a+b [Punjab Univ. B.Sc.(Maths Hons.),1988] 
Solution. The event of drawing a white ball from the &th urn can, materialise 
in the following two ways: 
(i) The ball transferred from the (k — 1)th urn is white and then a white ball is 
drawn from the kth urn. 
(ii) The ball transferred from the (K — 1)th urn is black and then a white ball is 
drawn from the kth urn. 


(1 — pr) 


a+l 
a+b+1’ 
since the probability of drawing a white ball from the (k— 1)th um is pe-, and 
then the probability of drawing white ball from the Ath urn is 
a+1 
at+b+1- 
Since the probability of drawing a black ball from thé (k- 1)th um is 
[1 — pe-1] and then the probability of drawing a white ball from the Ath urn is 
a 

a+b+1’ 

the probability of case (ii) is given by 


are UP 


Since the cases (i) and (ii) are mutually exclusive, we have by addition theorem 
of probability 


The probability of case (i) is pr-1 X 


_ 


a+1 


a 
Pe obo tae pe (4) 
a 
Pe abe ober l = 
Replacing k by k+ 1.in (*) we get the required result. 
Changing ktok-1,k=-2,... and so on, we get 
Dewi = = pPr-2+———— ..(2) 
a+b+1 a+b+1 
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Pr-2 = (3) 


oe 
1 eo ay b+ 1 


at+b+ 


(k=) 


But p: = Probability of drawing a white ball from the first urn = a 


, 3) by rics } wy and (k= Vth 


Po tbe tosh sl 


° . _ 1 
Multiplying (1) by 1, (2) by aa bal 


-2 
equation by } and adding, we get 


a+b+1 


5 . +————| 1+ ee 
Pr at+b+1} " a4+b41 a+b+1 (a+b+1y 7" 
l ie 
iessrstl | 
1 k-1 
( 1 y a a abrrese 


a+b+1) ~(@+b) atb+l fal 
at+b+1 


a a “acth-(ota “| 
a+b\a+b+1 a+b a+b+1 
“illetal’ l-Getal 
~ at+bllatb+! a+ob+1 

a 


Since the probability of drawing a white ball from the kth urn is independent 
of k, wechave 


i 


Example 4-56. (i) Let the probability pn that a family has exactly n children 


be a p"whenn>landp.=1-ap(l+p+p' +... Suppose that all sex distribu- 
tions of n children have thé same probability. Show that for k = 1, the probability 
that a family contains exactly k boys is 2.0. p'/(2 -p)"*'. 

(ii) Given that a family includes at least one boy, show that the probability 


that there are two or more boys is p/(2 - p). 
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Solution. We are given 
Pp, = P (thata family has exactly n children] 

ap", n2 1, 

and po = l-ap(h #p+p t+...) 

Let E; be the event that the number of children in 4 family is j and let A be 
the event that a family contains exactly k boys. Then 

P(E) = p,s J=9,1,2,. 

Now, since cach child can have any of the two sex x distributions (either boy or 
girl), the total number of possible distributions for a‘family to have ‘/’ children 
is 2’. 


C, 
P(A|E) = —,j/2k 
(A| Ej) ad 
eo : 6o 
and = P(A) = & P(E) P(ALE) = © pyP(AlED 
j=k j=k 
oo |i # 
_ al S| jzkel 
j=k 2 
ee ee 
na (| ' Wee 7 
oo p +7 R “y 
=0.5.°°'C, ( : (Putj-—k=r] 
r=G . 
al ae (5) si ed 
=ai-7= » C; os i C,= Cue) 
GF] Be (5 
We know that : me : 
G: = (- 4? 2G, = (- 1)-. wee neitad Bo 
* re (- 1V 7 #4, a sd BP Z 
Hence ; 
P(A)=a >» en @re.[5 | 
r=(0 2 


~ (k+ NC. (=2) 
0 ae 


p.y¥@ru 1 
i 


i 2 ap" 
(2=p)'*! © (Qh py! 


tt 
Q 


I 
R 
a a 


HIS wis Wis 


pe a i 
> | 
St 04 8 


NIS 


“| 
(b) Let B denote the event that a family includes at least one’boy and C denote 
the event that a family has two or more boys. Then 
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P(B) = 5 P (family has exactly k boys] 
k=1 


5 _ 20 7 ( p } 
1(2-p)" —P p=) \2-P 
p/(2—- p) Op 


* T-[p/2-pl  G—p) 2-p) 


[family has exactly boys] 


=z 2 op" RAC? 
2(2-p)*'  2-Pp ga \2-P 
“~‘ _b/2-py _ ae’ 
2>p°1-[p/(2-p)] “(2-p)’(1-p) 
Since C CBandBNC=C,P(BNC)=P(C) => P(B)P(C|B)=P(C) 
- Therefore, 


tl 
Pg 
ye 


to 
R 


Se) 
—— 


a) 
3. 
1 
gums. 


P(C) 0. p” (l-p)(2-p)_ p 
~ P(B) (2-p)*(1-p) Op =p 
Example 4-57. A slip of paper is given to person,A who marks it either with 
a plus sign or a minus sign; the probability of his writing a plus sign is 1/3. A passes 
the slip to B, who may either leave it alone or Change the sign before passing it to 
C. Next C passes the slip to D after perhaps changing the sign. Finally D passes it 
to a referee after perhaps changing the sign. The referee sees a plus sign on the 
slip. It is known that B, C and D each change the sign with probability 2/3. Find 
the probability that A originally wrote a plus. 
Solution. Let us define the following events: 
E,: A wrotea plus sign; E,: A wrote a minus sign 
E : The referee observes a plus sign on the slip. 


We are given: P (E,) = 1/3, P (E:)= 1-1/3 =2/3 
We want P (E, | E), which by Bayes'rule ts given:by : 


P(E1) P(E Ei) 
PEE) = 5B) PELE) +P (Ex) P ELE) ou 
P (E| E;) =P [Refetee observes the plus sign given that ‘A’ wrote 
the plus sign an the slip] 
= P [(Plus sign was not changed at all) U (Plus sign was 
changed exactly twice’in.passing from ‘A’ to referee 
through B, C and D )] 
=P (Es U E.) , (Say). 
= P (£3) + P (Ea), . ...(ii) 
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Let A;, Aj and A; respectively denote the events that B, C and D change the 
sign on the slip. Then we are given 
P (A) = P (Az) =P (Az) =2/3 3 P (Ai) =P (A2) =P (As) = 1/3 
We have ; 
P (Es) = P (A. Ar As) =P (Ai) P (Aa) P (As) = (1/3) = 1/27 
P (Es) = P [(Ai Az As) U (Ai Az As) U (Ai Az Aa) * 
— wad (Ai A2A3) +P (Ai A2 A3) + P (Ai A2A3) 
= P (A) P (Az) P (As) + P (Ai) P (A2) P (A3) + P (Ai) P (A2) P (As) 
2. 2 “1 ee 1 2 122 4 
ao a a Wa ae a ie el a a 
SubsQtuting in (ii) we 


P(E|E,)= xt +5 = ...(tii) 
Similarly, 
P (E |:E,) = P [Referee observes the plus sign given that ‘A’ wrote minus 
sign on the slip] 


= P [(Minus sign was changed exactly.once) 
U (Minus sign was changed thrice)] 
= P (Es U Eg), (say), 
= P (Es) + P (Es) ...(iv) 
P (Es) =P [(Ai A2 As) UV (Ai A2 As) U (Ai A2 A3)] 
= P (A:) P’(A2) P (As) + P (A1) P (A2) P (As) + P (Ai) P'(A2) P (As) 
2: iL... be kh. oe 8 


—_ — = — died 


“3 Reg 3°33 3°34" 6 


222 = 8 
P (Ec) = P (A; A2A3) = P (Aj) P (A2) P (As) = 3°3°3 °° 27 
Substituting in (iv) we get: 
2,8 _14 
P(E |E,) = 9 + oq = 97° ...(V) 
Substituting from (iii) and (v) in (i) we get: 
113 
= SoS Sopeee (eee 
CAVE) 113) 2.14 13 + 28 ~ 4i 
2° OT 3 . 


Example 4-58. Three urns of the same appearance have the following 
proportion of balls. 
‘First urn : 2 black I white 
SecondUrn : 1 black 2 white 
Third urn : 2 black 2 white 
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One of the urns is.selected and oné ball is drawn. It turns out to be white. What 
is the probability of drawing a white ball again, the first one not having been 
returned? 

Solution. Let us define the events: 

E;= The ‘event of sélection of ith urn, (i = 1,2,3) 
and A= The event of drawing a white ball. 
Then 
P(E\) = P(E2) = P(Es) = 1/3 | 

and =P(A/E,) = 1/3, P(A|E2) = 2/3 and P (A|E;3) = 1/2 

Let C denote the future event of drawing another white ball from the urns. 
Then 

P(C|E: A) = le) = Y,andP(C|E:NA) = 


: : P(E) P(A|E) P(C\E;NA) 


P(C{A) = = 


MISCELLANEOUS EXERCISE ON CHAPTER IV 


1. Probabilities of occurrence of n independent events £,, E2, ..., E, are pi, 
P2, .., Pa Tespectively. Find the probability of occurrence of the compound event in 
which Ei, E>,..., £, occur and E,41, E,+2, ..., E, do not occur. 
r Ar 
Ans. Il px II (1-p) 
i=] i=r+l 
2. Prove ie for any integer m 2 a 


(a) P ( A Ai) $ P(A) SP( U 3 A) S EP (A) 
i=1 i=] i=] 
m m 
(5) P( Br x P (Ai) 
t=]. i= 
3. Establish the inequalities : 
P(ANBOC) s P(ANB) S$ P(AUB) s P(AUBUC) s P(A) + P(B) + P(C) 
4, Let Ai, A2,...,A, be mutually independent events with P-(A,)= pr, 
k=1,2,...,” 
Let p be the probability that none of the events Aj, Aa, ..., Aa-occurs. Show 
that 


n 
p= I 


n 
(1-p) < exp |- E pr| 
k k=1 


l 
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Usc the above rclation to compute the probability that in six tosses of a fair dic, no 
"aces are obtained". Compare this with the upper bound given above. Show that if 
each p, is small compared, with 7, the upper bound is a good approximauon. 

5, Aand B play a match, the winner being the one who first wins two games 
in succession, no games being drawn. Their respective chances of winning a 
particular game are-p : q. Find 

° (i) A’s initial chance of winning. 
(ii) A’s chance of winning after having won the first game. 

6. A carpenter has a tool chest with two compartments, each one having a 
lock. He has two keys for each lock, and he keeps all four keys in thé ‘same ring. 
His habitual procedure in opening 4 compartment is to select a key at random and. 
try it. If it fails, hé selects one of the remaining three and tries it and so on. Show 
that the probability that he succeeds on the first, second and third try is 1/2,1/3, F/6 
respectively. (Lucknow Univ. B.Sc., 1990) 

7. Three players A, B and C agrce to play a series of games observing the 
following rules : two players participate in each game, while third is idle, and the 
game is to be won by one of them. The loser in each game quits and his place in 
the next game is taken by the player who was idle. The player who succeeds in 
winning over both of his opponents without interruption wins the whole series of 
games. . - 

Supposing the probability for each player to win a single game is 1/2, and that 
the first game is played by A and B, find the probability for A, B and C respectively 
to win the whole series if the number of games is unlimited. 

Ans. 5/14, 5/14, 2/7 , 

8. In-a certain group of mathematicians, 60 per cent have insufficient back- 
ground of modern Algebra, 50 per cent have inadequate knowledge of Mathemati- 
cal Statistics and 80 per cent are in either one or both of the two categories. What 
is the percentage of people who know Mathematical Statisti¢s aniong those who 
have a sufficient background of Modern Algebra? (Ans. 0:50) 

9. (a) If A has (n+ 1) and B has n fair coins, which they flip, show that 


the probability that A gets more heads than B is > 


(b) A student-is given a column of 10 dates and column-of 10 events and is 
asked to match the correct date to each event. He is not allowed to use ariy item 
more than once. Consider the case where the student knows. how to match four of 
the items but he is very doubtful of the remaining six. Hé.decides to:match these 
at random. Find the probabilities that.he will correctly match (i) all the items, 
(ti) at-least seven of the items, ‘and (iii) at least five. 

Ans, (ae (b) a (c) 1 

10. An astrologer claims that he can predict before birth the sex of a baby just 
to be born. Suppose that the astrologer has no real power but he tosses a coin just 
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once before every birth and if the head turns up he predicts a boy for that birth and 
if the tail tums up he predicts a girl. Let p be the probability of the event that‘at a 
certain birth a male child is born, and p’ the probability of a head turing up.in a 
single toss with astrologer’s coin. Find the probability of a correct ‘prediction and 
that of at least one correct prediction in‘n predictions. 

11. From a pack of 52 cards an even number of cards is drawn. Show that the 
probability of half of these cards being red is: 

{52 !/(26 !)? - 1] / (27-1) 

12. A sportsman’s chance of shooting an animal at a distance r (>a) is 
a’/r’.. He fires when r= 2a, and if he misses he reloads and fires when 
r = 3a, 4a... If he misses at distance na, the animal escapes. Find. the odds against 
the sportsman. 

Ans. n+1lin-1 

2 
Hint. P (Sportsman shoots at a distance ia] = —_ 7 = 
(iay i 
=> P [Sportsman misses the shot at a distance ia] = 1 - = 
i 


n n ° e 3 
P {Animal escapes] = [1 [1-73] I rie | 
i=2 é i=2 é 


n ° ‘ n ° 1 
= ex i ewer 
ie2\ ! Jis2i ! 2n 


Requiréd ratio = seal : (aera (a+ 1): (a-—1) 


2n 

13. (a) Pataudi, the captain of the Indian team, is reported to have observed 
the rule of calling ‘heads’ every time the toss was made during the five'matches of 
the Test series with the Australian team. What is the probability of his winning the 
toss in all the five matches? 

Ans. (1/2)° 

How will the probability be affected if x 

(i) he had made a rule of tossing a coin privately to decide whether to call 
"heads" or “tails” on each:occasion. 

(ii) the factors determining his choice were not predetermined -but he called 
Out whatever occurred to him on the spur of the moment? 

(b) A lot contains 50 defective and 50 non-defective bulbs. Two bulbs are 
drawn at random one at atime, -with replacement. The events A, B.C are 
defined as 

A= (The first bulb is defective) 
B= {The second bulb is non-defective) 
C'= {The two bulbs are both defective or both non-defective } 
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Determine whether 

(i)A, B,C are pairwise independent, , 

(ii)A. B,C are independent. 

14. A,B andC are three urns which contain 2 white, I black, 3 white, 2 black 
and 2 white and 2 black balls, respectively. One ball is drawn from um A and put 
into the urn B; then a ball is drawn from urn B and put into the urn C. Then a ball 
is drawn from um C. Find the probability that the ball drawn is white. 

Ans. 4/15. 

15. An um contains a white and b black balls and a series of drawings of one 
ball ata time is made, the ball removed being retrurned to the urn immediately after 
the next drawing is made. If pz denotes the probability that the nth ball drawn is 
black, show that 

Pr=(b—pa-1)/ (@+b-1). 

Hence find p, . 

16. A person is to be tested to see whether he can differentiate between the 
taste of two brands of cigarettes. If he cannot differentiate, it is assumed that the 
probability is one-half that he will identify a cigarette correctly: Under which of 
the following two procedures is there less chance that he will make all correct 
identifications when he actually cannot differentiate between the two brands? 

(i) The subject is given four pairs each containing both, brands-of cigarettes 
(this is known to the subject), he must identify for each pair which cigarette 
represents each-brand. 

(ii) The subject is given eight cigarettes and is told that the first four are of one 
brand and the last four of the other brand. 

‘How do you explain the difference in results despite the fact that eight 
Cigarettes are tested in each case? 

Ans. (i) 1/16 (ii) 1/2 5 

17. (Sampling with replacement). A sample of size r is taken from a 
population of n people. Find the probability U, that N given people will be included 
in.the: sample. é 


N ‘ 
Ans, U,= & (- 1)" ite 
m=0 m n 


18. Ina lottery m tickest are drawn at a time out of the total number of n 
tickets, and returned before the next drawing is made. Show that the chance that in 
k drawings, each of.the numbers. 1, 2, 3, ..., n will appear at'least once is given by 


moe Oa] 


[Nagpur Univ. M.Sc. 1987] 
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19.In a certain book of N pages, no page contains more than four errors, n, 
of them contain one error, n2 contain two errors, n3 contain three errors and ng 
contain four errors. Fwo copies of the book are opened at any two given pages. 
Show the probability that the number of errors in these two pages shall notexceed 
five is 


1 <4 (n3* + ny’ + 2n2 Ng + 2n3 Na) 
N 


Hint. Let E;1 : the event that a page of first book contains ierrors. . 
and E; II : the event that a page of second book contains / errors. 
P (No. of errors in the two pages shall not exceed 5) 
= 1-P (FE, IE W+E; 1B, W+b1 E,W 
+E; 1E; N+E,L Es 1+ &,1£. 10), 
20. (a) Of three independent events, the chance that the first only should 
happens i is a, the chance of the, second only is 6 and the chance of the third only 
is c. Show that the independent chances of the three events are réspectively ; 


Se ee 


' -  “@tx''b+x' cHx 
where x is the root of the equation 


(a+x)(b+x)(c+x =x’ 


Hint. P(E: VN Ex £3) =P (Ei) U- P(E) (1-P€s)J=a  * A(*) 
P (Ei VE, 9 Es) = (1 — P (E,)] P (E2) (1 - P (E)]=6 (#9) 
P(A NEE) =(1-P(E){i-P(E)IP(Es)=c | (#4) 


Multiplying (*),(**) and (t**), we get 
P (E;) P (Ex) P (Es) x? = abc, 
where. x= [1 — P (E,)] [1 — P (E,)] {1 —P (Es)] 
Multiplying (*) by [1 — P (E,)],-we get 
P(E,)= ae , and so on. 


(b) Of three independent events, the probability that the first only should 
happens is 1/4, the probability that the Second only should happen is 1/8, and the 
probability that the third only should happen is 1/12. Obtain the unconditional 
probabilities of the three events. 

Ans. 1/2, 1/3, 1/4. 

(c) A total of n shells are fired ata target. The probability of the ith shell hitting 
the target is pi;i=1,2,3,..,.n. Assuming that the » firings are n mutually 
independent events, find the probability that.at least two shells out of n hit the 
target. ; [Calcutta Univ. B.Sc.(Maths Hons.), 1988] 

(d) An um contains M balls numbered 1 to M, where the first K balls are 
defective and the remaining M-~ Ki are non-defective. A sample, of n balls is 
drawn from the um. Let A, be the event that the sample of'n balls contains exactly 
k defectives. Find P(A.) when the sample is drawn (i ) with replacement and, 
(ii) without replacement. [Delhi Univ. B.Sc. (Maths Hons.), 1989] 


‘theory of Probability 4.109 


21. For three independent events A, B and C, the probability for A to occur-is 
a, the probability that A, B and C will not occur is b, and the probability that at least 
one of the three events will not occur ts c. If p denotes the probability that C occurs 
but neither A nor B occurs, prove that p satisfies the quadratic equation 
ap’ + [ab - (1-a) (a+c-1)] p+b(1—-a)(1-c)=0 
2 
and hence deduce that c > it) Bal a - 

Further.show: that the probability of occurrence of:C is p/(p + 6), and that of 
B’s happening is (1 —c) (p + 6)/ap. 

Hint. Let P (A)=x, P (B)=y and P(C)=+z 

Then - x=a, (1-x)(1l-y)(1-z)=6, k-xyz=c 
and nats p=z(1— x) (1-y) 

Elimination of x, -y and z gives quadratic equation in p. 

22. (a) The chance of success in each trial is p. If pi is the probability ‘that 
there are even number of successes in k trials, prove that 

le Ge es (1 - 2p) 

Deduce that pr=4[1+(1- 2p)" ] 

(b) If a day is dry, the conditional probability that the following day will also 
be dry is Pi if.a day is wet, the conditional probability that the following day will 
be dry is p’. If u is the probability that the nth day will be dry, prove that 

—(p-p’)tm-1-p'=0 5 n22 

If the first day is dry, p= 3/4 and p’ = 1/4, find u . 

23. There are n similar biased dice such that the probability of obtaining a 6 
with each one of them is the same and equal to p. If all the dice are rolled once, 
show that p,, the probability that an odd number of 6’s is obtained satisfies the 
difference equation 

Pa + (2p ~ 1) pa-r=p ° 
and hence deriye’an explicit expression for px. 
| Ans. py =} [I'# (1 - 2p)" 

24. Suppose that each day the weather.can be uniquely classified as ‘fine’ or 
‘bad’. Suppose further that the probability of having fine weather on the last day 
of a certain yéar is Po and we have the probability p'that the weather on an arbitrary 
day will be.of-the same kind as On the preceding day. Let the probability of having 
fine weather on ai nth day of the following year t be Ps Show that 

Py= (2p -1) Py-1+(1—p) 


Deduce that : 
~(in-wip._l\,1 © 
P33 (2p 1) (Pr 3 )*3 P , 


25. A closet contains n pairs of shocs. If 2r shoes are chosen at random 
(with 2r <n), what is the probability. that' there will be (i)'no complete pair, 
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(ii) exactly one complete pair, (iii) exactly two complete pairs among them? 
Hint. (i) P(no complete pair)=[ i }o a ( 7 


2r 
ks oy n-f}t w-2 2n * 
(ii) P(exactly one complete pair)= n ( a) p : ( Dy 
- : n\( n-2 \aw-4 , ( 2n 
and (iti) P(exactly two complete pairs)=( , I or 4 }2 ; ( ry 


.26. Show.that the probability of getting no right pair out of n, when the left 
foot shoes are paired randomly with the ngth foot shoes; is the sum of the first 
(n + 1) terms in the expansion of e7’. 

27. (2) Ina town consisting of (m + 1) inhabitants, a person narrates a rumour 
to a second person, who in tum narrates it to a third person, and so on. At each step 
the recipient of the rumour is chosen at random from the n available persons, 
excluding the narrator himself. Find the probability. that the rumour will. be told r 
times without: 

(i) retuming to the originator, 
(ii) being narrated to any person more than once. 

(6b) Do the above problem when, at each step the rumour is told by one person 
to a gathering of NV randomly chosen people. 


-1 ° 
Ans. (a) () (1--] pte ee) ere) 


(+) | 
gi 


n 
N -1 
wa (-4] 5 w 
x | 

28. What is the probability that (i) the birthdays of twelve people will fall 
in twelve different calendar months (assume equal probabilities for the twelve 
months) and (ii) the birthdays of six people will fall in exactly two calendar months? 

Hint, (i) The birthday of the first person, for instance, can fall in 12 different 
ways and so for the second, and so on. 

-. The total number of cases= 12’. 

Now there are 12 months in which the birthday of one person can fall and 11 
months in which the birthday of the second person can fall and 10° months for 
another third person, and so on. 


.. The total number of favourable cases = 12.11.10...3.2.1 


Hence the required probability = wt 
12"? 


(ii) The total number of ways in which the birthdays of 6 persons can fall in 
any of the month = 12°. 
12 
ees) 


The-required probability = 95 
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29. An elevator starts with 7 passengers and’ stops at 10 floors. What is the 

probability p that no two passengers leave at the samé floor? 
[Delhi Univ. M.C.A., 1988] 

30. A bridge player knows that his two opporients have exactly five hearts 
between two of them. Each opponent has thirteen cards. What is the probability 
that there is thréé-two split on the hearts (that is one player has three hearts and the 
other two)? [Delhi Univ. B.Sc.(Maths Hons.), 1988] 

31. An um contains 2 white and 2 black balls. A ball is drawn at random. If 
itis white, it is riot replaced into thé um. Otherwise it is replaced along with another 
ball of ihe same colour. The process is repeated. Find the probability that‘the third 
ball drawn is black. [Burdwan Univ. B.Sc. (Hons.), 1990] 
23 
30 

32. There is a series of n urns. In the ith: um there are i ‘white and (n — i) 
black balls, i='1, 2, 3, ...,&. One um is chosen at random and 2 balls are drawn 
from it. Both turn out to be white. What is the probability that the jth urn was 
chosen, where j is a particular number between 3 and n. 

Hint. Let E; denote the event of selection of jth um, j=3,4,...,2 and A 
denote the event of drawing of 2 white balls, then 


paaley=(1 (451 r]; oe "yi pay= (SS) 


P (E;|A)=— = 


(2) 


_33. There are (N+ 1) identical urns marked 0, 1,2, ...,N each of which 
contains N white and red balls. The kth urn contains k red and N — k white balls, 
(k=0; 1,2, ... N). Anum is chosen at random and n random drawings of a ball are 
made from it, the ball drawn being replaced after cach draw. If the balls drawn are 
all red, show that the probability that the next drawing will also yield a red ball is 
approximately (n + 1) (n + 2) when N 1s large. 

34. A printing machine can print n letters, say 0, O2, ..., &, . It is operated 
by electrical impulses, each letter being prodyced by a diffezent impulse. Assume 
that p is the constant probability of printing the correct letter and:the impulses are 
independent. One of the n-impulses, chosen at random, was fed into-the machine 
twice and both times the letter a1 was printed. Compute the-probability that the 
impulse chosen was meant to print ). [Delhi Univ. M.Sc.(Stat.), i981] 

Ans. (n— 1) p’/(np’-2p +1) 

35. Two players A and B. agree to contest a match consisting. of a-Series of 
games, the. match to.be won by the player who first wins three games, ‘with the 
Provision that if the players win two games each, the-match is to continue until it 


Ans, 
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is won by one playcr winning two games more than his opponent. The probabililty 
of A winning any given game is p, and the games cannot be drawn. 

(i) Prove that f (p), ine initial probabuty of A DINE the match is given by: 

f(p) =p (4 — Sp + 2p’)/(t — 2p +-2p’) 

(ii) Show that the equation f(p).= p has five real roots, of which three are 

admissible values of p. Find these three-roots and explain their significance. 
[Civil Services (Main), 1986] 

36. Two. players A and B start playing a series of games with Rs. a and b 
respectively. The stake is Re. 1 on a game and no game can be drawn. If the 
probability of A winning any game is a constant p, find the initial probability of 
this exhausting the funds of B or his own. Also show that if the resources of B 
are unlimited then 

(i) A is certain to be ruined if p= 1% , and 

(ii) A has an even chance of escaping ruin if p=2'°/(1+2'”’). 

Hint. Let u, be the probability of A’s final win when he has Rs, 7. 

Then Un = Pune + (1 — p)a-i Where u,=0- and: ta+5= 1 

Un +1 ~u=(4=2 ](m— meet ) 
P JS 


XN 


Hence Un +1 — Un = orn J u,, by repeated application, 


eo weal (SA) / [1-737] 


Hence using 4+. = 1, w=fr-(452)]/ (oer a 


.. Initial probability of A’s win is us = ta p’ 


Probability.of A’s ruin = 1 — i, . 

For p=”’,u= 
if p=2°°/(1+2'”). 

37. In a game of skill a player has probability 1/3, 5/12 and 1/4 of scoring 0; 
l-and 2 points respectively at each trial, the game terminating on the first realization 


of a zero score at a trial, Assuming that-the trials are independent, prove that the 
probability of the,player obtaining a total score of npointsis  _' 


2/3) 4717 = 
“"13(4 )730(°3 J» 
Hint. Event can materialize in-the two mutually éxclusive ways:: 


(i) at the (n — 1)th. trial, a score of (n — 1). points, is.obtained and. a:score of 1 
point is obtained. at the nthtnal. a 


a 
+b 
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(ii) at the (n — 2)th trial, a score of (n — 2) points is obtained and a score of 2 
points is obtained at the last two trials. 


5 1 1 1 § DD: 
Hence ty # = Uy 1+ ZUn-2 Where Uo= 3, li = 3° 36 
1 ] 
Also Un = & 3 |e p+] Un-2 > ta ty Mann = 3 Maca +5 M2 


This equation can be solved as a homogeneous difference equation of second 

order with the initial conditions 
a 3 
Uo = = 

38. The following weather forecasting i is used by an amateur forecaster. Each 
day is classified as ‘dry’ or “wet’ and the probability that any given day is same as 
the preceding one is assumed to be a constant p, (0 < p < 1). Based on past records, 
itis supposed that January 1 has a probability B of being dry. Letting 

B.= Probability that nth day‘of the year is dry, obtain an expression for B, in 
terms of B and p. Also evaluate lim- B,. 


Hint. B. = p.Ba-1 + (1-p) (1-B.- 
=> B. = (2p- 1) B.-1 + (1-p) ; n= 2,3,4,... 
Ans. B, = (2p-1)""""B-’%) +h; lim B=” 


Rh —> co 

39. Two ums contain respectively ‘a white and b black’ ‘and ‘b white and a 
black’ balls, A series of drawings is made according to the following:rules: 

(i) Each me only one:ball is drawn and immediately retummed to the same um 
itcame from. : 

(ii) If the ball drawn is white, the next drawing is made from the first urn. 

(iii) If it is black, the next drawing is made from the second um. 

(iv) The first ball drawn comes from the first.umn. 

What is the probability that nth ball drawn will be white? 

Hint. p, = P [Drawing a white ball at the rth draw]. 


; " 6 
<a rate ue et ae 


b 
” = ae Prittyp - 
: l a-b : 
A ° Dp =- = —~ Oa i 
7 Png.” (3 +b | 
40. If a coinmis tossed repeatedly; show that the probability of getting nt heads 
before n tails is : : ; 
1 m+n-1 
prast > mtan- C. Sf 


i=m -_—« [Burdwan Univ. (Maths Hons.), 1991] 
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OBJECTIVE TYPE QUESTIONS 
I. Find out the correct answer from group Y for each item ‘of group X* 


Group X Group Y 
(a) At least one of the events A or B (i) AANB)U(ANB)U(ANB) 
occurs. 
(b) Neither A nor B occurs. (ii) (AU B)-(ANB) 
(c) Exactly one ofthe eventsAorB (iii) ACB 
Occurs. (iv) BCA 
(d) Ifevent A occurs, so does B. (vy) [A-(AQB)| VU [B-(ANB)] 
(e) Not more than one of the eventsA (vi) ANB 
or B occur: (vii) H-(AUB) 
(viii) AUB 
(ix) 1-(AU-B) 
II. Match the correct expression of probabilities on the left : 
(a) P (o),‘where $-is null set (i) 1—P (A) 
(b) P(A|B)P (B) (ii) P(AQNB) 
(c) P(A) (itt) P(A)-—P(ANB) 
(d) P(AXNB) (iv) 0 
(e) P(A~B) (v) 1-P (A)-P(B)+P (ANB) 


(vi) P(A)+P(B)-—P(ANB). 
aI. Given that A, B:aridsC are mutually exclusive events, explain why the 
following are‘not. permissible assignments of probabilities: 
: (i) P(A)=0-24, P (B)=0-4 and P (AU C)=0.2 
(ii) P (A)=0-4, P (8)=0-61 
(iti) P(A)=0-6, P (ANB) =0-5 
IV. In eachof the following, indicate whether events.A and B are : 
(i) independent, (ii) mutually..exclusive, (iii) dependent but not mutually ex- 
Clusive. 
(a) P(ANB) = 0 (b) P(A GB) = 03, P(A) = 0-45 
(c) P(AUB) = 085, P(A) = 0-3, P(B) = 06 
(d) P(AUB) =‘0-70, P(A) = 0-5, P(B) = 0-4 
(e) P(AUB) = 0-90, P(A|B) = 0-8, P(B) = 0S. 
V. Give the correct label as dnsWer like a or b'etc., for the following 
questions: 
(i) The probability of drawing any one spade card froin a pack of cards is 
1 1 4 ' “1 
(a) = (b) 55 (C) 35 (d) 7 
(ii) The.probability of drawing one ‘white ball from a’bag:containing.6 red, 
8 black, 10 yellow and 1 green balls is 
. @e (0 (1 Me (ee 
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(iii) A coin is tossed three times in succession, the number of sample points 
in sample space is 
(a) 6 (b) 8 (c) 3 
(iv) In the simultaneous tossing of two perfect coins, the probability. of 
having at least one head is 


(a)> (b)4  (cy2 (dd) 1 


(v) In ie Geuiudon tossing of two perfect dice, the probability of 
obtaining 4 as oe sum of the resultant cies 1S 
1 2 
(4) 75 r (6) 12 () 5 : (d) 12 
(vi) A single letter is selected at random from the word ‘probability’. The 
probability that it is a vowel is . 


(aj (b)% (c) + (a) 0 
(vii) An um contains 9 balls, two of which are red, three blue and four 


black. Three balls are drawn at random. The chance that they are of the same 
colour is 


@y> (bs (ye (as 
(viii) A number is nea at ne among the first 120 natural numbers. 
The brome of the number seh being a multiple of 5 or 15 is 


(at (be (cE 


(ix) If A-and B are mutually exclusive events, then 
(a) P (AU B)=P (A) .P (B) 
(b) P(AUB)=P (A)+P (B),, (c) P(AUB)=0. 
(x) If A ”_ B are two independent events,. .the ‘obsess ‘that both A 
and B occur is = + and the probability that neither of them occurs is. 2. The prob- 


ability of the occurrence of A is: : 
(a)>, (bx, (ed Gs (dg 
VI. Fill in the blanks : 


‘(i) Two events are gaid-to be equally likely if ...... 
(ii) A setof events is said to be independent if ...... 
(iti) If P(A) .P(B).P(C)=P(ANB NC), then the events A, B,:C are ...... 
(iv) Two events-A ‘and B are mutually Exclusive if P (A 7-:B)=... and are 
independent if P (A 7 B)= 
(v) The probability of getting a multiple of 2 in a throw of a dice is 1/2 and 
of getting a multiple of 3 is 1/3. Hence probability of getting a multiple of 2 or 3 


(vi) Let A and B be independent events and suppose the event C has prob- 
ability 0 or 1. Then A, B and C are...... events. 


(vii) If A,B,C are pairwise independent and A is independent of BUC, 
then A, B, C are ...... independent. 
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(viii) A man has tossed 2 fair dice. The conditional probability that he has 
tossed two sixes, given that he has tossed at least one six Is ...... 
(ix) Let A and B be twoevents such that P (A) =0-3 and P A U B)=0:8, 
If A and: B are independent events then P (B) = 
VII. Each of following statements ts either true or false. If it is true prove it, 
otherwise, give a counter example to show that it is false. 
(i) The probability of occurrence of at least one of two events js the sum 
of the probability of each of the two events. 
(ii) Mutually exclusive events are independent. 
(iii) For any two events A and B, P (A 7 B) cannot be less than either P (A) 
or P (B ). a 
(iv) The conditional probability of A given B is always greater than P (A). 
(v) If the occurrence of an event A implies the occurrence of another event 
B then P (A) cannot excced P (B). 
(vi) For any twoeventsA and B, P (A UB) cannot-be greater then either 
P (A) or P (B). 
(vii) Mutually exclusive events are not independent. 
(viii) Pairwise independence:does not necessarily imply mutual independ- 
ence. 
(ix) Let A and B be events neither of which, has probability zero. Then if.A 
and B are disjoint, A and B are independent. 
(x) The probability of any event is always a proper fraction. 
(xi) If O< P(B)< 1} so that P (A | B) and P(A 1B) are both defined, then 
P (A) =P (B) P (A|B) +P @)P (AB). 
(xii) For.two events A and B if 
P (A) =P-(A |B) =1/4 and P(A] BY= 1/2, then 
(a) A and B are mutually exclusive. 
“(b) A and B are independent. 
(c) Ais asub-event of B. 
(d) P (A|B)=3/4. [Delhi Univ. B.Sc.(Stat, Hois.), 1992] 
(xii) Two events can be independent and mutually exclusive simultaneously. 
(xiv) Let A and B be events, neither of which has probability zero. Prove or 
disprove the following : 
(a) If A and B are disjoint, A-and B are independent. 
(b) lf A and B are independent, A and B are disjoint. 
(xv) If P (A)=0, then A=6. 


t ( 


CHAPTER FIVE 
Random Variables — Distribution Functions 


5-1, Random Variable. Intuitively by a random variable (r.v) we mean a 
real number X connected with the outcome of a random experiment E. For 
example, if E consists of two tosses of a coin, we may consider the random vartable 
which is the number of heads ( 0, 1 or 2). 


Outcome : All HT TH TT 
Value of X : 2 Il 1 0 


Thus to cach outcome @ , there corresponds areal number X (w) . Since the 
points of the sample space S correspond to outcomes, this means that a real number, 
which we denote by X (a), is defined for each w € S. From this standpoint, we 
define random variable to be a real function on S as follows: 


" Let § be the sample space associated with a given random experiment. A 
real-valued function defined on S and taking values in R (- ~© , © ) is called a 
one-dimensional random variable. [f the function values are ordered pairs of real 
numbers (i.¢c., vectors in two-space) the function is said to be a two- dimensional 
random variable. More generally, an n-dimensional random variable is simply a 
function whose domain is S and whose range is a collection of n-tuples of reat 
numbers (vectors in n- space).” 


For a mathematical and rigorous definition of the random variable, let us 
consider the probability space, the triplet (S, B, P), where S is the sample space, 
viz., space Of outcomes, B is the o-field of subsets in §, and P is a probability 
function on B. 


Def. A random variable (r.v.) is a function X (@) with domain S and range 
(-©0 , ce) such that for every real number a, the event [w : X (@) < aje B. 


Remarks: 1. The refinement above is the same as saying that the function 
X (@) 1s measurable real function on (S, B). 


2. We shall need-to make probability statements about’a random variable X 
such as P {X<a). Forthe simple example given above we should write 
P{X< 1) =P (HH,HT,TH}.=3/4. Thatis, PX < a) is simply the probability 
pf the set of outcomes @ for which X (@) < aor 

PXs a)=P(@:X (W)S a} 
Since P is a measure on (S,B) i.e., P is defined on subsets of B, the above probability 
will be defined only if {[ @:X(@)< a) € B, which implies that X(w) is a nieasurable 
function on (S,B). 


3. One-dimensional random variables will be denoted by capital letters, 
X,Y,Z,...etc. A typical outcome of the experiment (i.e., a typical clement of the 
sample space) will be denoted by w or e. Thus X (@) represents the real number 
which the random variable X associates with the outcome @. The values whict: 
X,Y, Z, ... elc., can assume are denoted by lower case letters viz., x, y, z, ... etc. 


ww 
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4. Notations.-Ifx is a-rcal number, the set of all @ in S such that X(@ ) = x is 
denoted briefly by writing X = x. Thus 
P(X=x)=P{o:X (@)=x} 
Similarly P(X <al'=Piw:X (@) € [-~, al} 
and P.[a<X<b)=P(o:X(@)€ (a,b) ) 
Analogous meanings are given to 
P(X =aorX=b)=P{(X=a)uU(X=b)}, 
P(X=aandX=b)=P{(X=a)Q(X=b)}, ete. 
Iilustrations ; 1. If acoin is tossed. then 
§ = { @i, Oe | where ®= H, th= T 
1, if = H 
AON TG at oT 
X (w) is a Bernoulli random variable. Here X (@).takes only two values. A random 
variable which takes only a finite number of values is called single. 
2. An experiment consists of rolling a die and reading the number of points 
on the uptumed face. The most natufal random variable X to cunsider is 
X(@) = @;@ =1, 2, ..., 6 


- If we are interested in whether the number of points is even or odd, we consider 
a random, variable Y defined as follows : 
_ |0, if w is even 
Oy i if w is odd 
3. If a dart is thrown at a circular target, the sample space S is the set of all 
points w on the target. By imagining a coordinate system placed on the target with 
the origin at the centre, we can assign various random variables to this experiment. 
A natural one is the two dimensional random vaniable which assigns to the point 
@®, its rectangular coordinates (x,y). Another is that which assigns @ its polar 
coordinates (r, @ ). A one dimensional random variable assigns to each @ only one 
of the coordinates x or y (for cartesian system), r or (for polar system). The event 
E, “that the dart will land in the first quadrant" can be described by a. random 
variable which assigns to each point w its polar coordinate 6 so that X-(@) = 6 and 
then E ={:0<X (@) < 1/2}. 
4. Ifa pair of fair dice is tossed then S = {1,2,3,4,5,6} {1,2,3,4,5,6} and 
n(S)=36. LetX bea random variable with image set 
X(S) = {1,2,3,4,5,6) 
P(X = 1) =P{1,1} = 136 
P(X = 2) = P{(2,1),(2,2),(1,2)} = 3/36 
P(X = 3) = P(G,1),3,2),(3,3),(2,3),C1,3)} = 5/36 
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P(X =4)=P((4,1), (4,2), (4,3), (4,4), 3,4), (2,4), 1,4)} = 7/36 

Similarly P(X = 5) = 9/36 and P (X = 6) = 11/36 

Some theorems or Random Variables. Here we shall state (without proof) 
some of the fundamental results and theorems on random variables. 

Theorem 5-1. A funcuon X(@) from S to R (— © , 9) is a random variable if 
and only if 

{m:X(@)<a} € B 

Theorem 5-2. IfX,; and X2. aresandom variables and C is‘a constant then 
CX, ,X1+X2,X,X7 are also random vaniables. - 

Remark. It will follow that C:X:+C2X2 is a random variable for constants 
C, and C,. In parucular X, — X2 isar.v. 2 

Theorem 5-3. If {X,(@),22 1} arc random variables then 

sup X,(@), inf X,.(@), mn Weed X,(@) and lim inf X,(@) are-all ran- 
n n 


aso 
dom va:iables, whenever they are tii inite for all @. 
Theorem 5-4. If X is a random variable then 


(i) . where x oe if X (@)= 0 


(ii) X+(@) = max [0, X(@)] 
(ii) X-(@) = -—min [0, X(@)] 
(iv) |X | 
are random variables. 

Theorem 5-5. If X; and X2 are random variables then 
(i) max [X), Xz ] and (ii) min [X), X2 ] are also random variables. 

Theorem 5-6. If X isa r.v.andf(-) is acontinuous function, then 

f(X)isar.v. 

Theorem 5-7. If X is. a rv.and f(.) is an increasing function, then 
f(X)isar.v. 

Corollary. If f is a function of bounded variations on every finite interval 
[a,b], and X is ar.v. then f(X) isar.v. 

(proofs of the above theorems are beyond the scope of this book) 

EXERCISE 5 (a) 

1. Let X bea one dimensional random variable. (i) If a< b, show that the 
lwo events a < X S$ band X < a are disjoint, (ii) Determine.the union of the two 
events in part (i), (iti) show that P(a<X< b)=P(X< b)- P(X< a). | 

2. Let a sample space S consist of three elements @, , @, and @;. Let 
P(@,) = 1/4, P(@) = 1/2and P(@;) = 1/4. If X is arandom variable defined’ on 
S by X (@,) = 10, X (@) =-3, X (W ) =15, find P(-2< X< 2). 
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3. Let S=(e:, e2,..., en) be the sample space of some experiment and let 
E cS be some event associated with the experiment. 
Define we, the characteristic random variable of E as follows : 
lifeeE 
We (ei) = {0 ife, @E. 
In other words, We is equal to 1 if E occurs, and we is equal to0 if E does not 


occur. 
Verify the following properties of characteristic random variables : 


(i) Wy is identically zero , i.e., Ve (e)=0; f=1,2,...,n 
(ii) Ws is identically one , i.e., Ws(e:)=1; i=1,2,...,n 
(iii) E> F => we(e)= wee); '=1,2,...,2 and conversely 
(iv) If E CF then we(e) < We(e:); i=1,2,... ,n 
(Vv) We(e:)+ VE (e;) is identically 1: i= l, 2, ... 
(vi) Wenr (€:)= We(e:) Wr(e:)3 F=1,2,.., 
(vii) Weur (e) = We (6) + Wr (e) - latatal for i=1,2,. 
5.2. Distribution Function. Let X be ar.v. on (S,B,P). Then the ee 
Fy (x)=P(XSx)=P{@:X(@)< x}, — o<x<0 
is called the distribution function (d,f.) of X. 
Ifclarity permits, we may write F(x) instead of Fx (x). (5-1) 
5-2-1. Properties of Distribution Function. We now proceed to derive a 
number of properties common to all distribution functions. 
Property 1. [fF is the df. of the r.v.X and if a <b, then 
P(a<X<b)=F(b)- F(a) 
Proof. The events ‘a<X< 5’ and ‘X< a’ are disjcint and their union is the event 
‘X< b’. Hence by addition theorem of probability 
P(a<X< b)+ P(XS a)=P(XS 5b) 
=> P(a<xX<sb)= P(X<b)- P(Xsa)=F(b)- F(a) © ..5-2) 
Cor. 1. 
P(asX<b)=P\((X=a)U (a4<X<b) 
=P(X=a)+ P(a<X<b) 
(using additive property of P) 


=P(X=a)+[F(b)- F(a)] (5:2 @) 
Similarly, we get 
P(a<X<b) =-P(a<X< b)-P(X=b) 
=F(b)- F(a)- P(X= b) (5:2 5) 


P(as X<b)s P(a<X<b)+ P(X=a) 
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= (b)-F(a)—-P(X=b)+ P(X=a)_ ..(5-2¢) 

Remark. When P (X = a) = 0 and P(X = b) = 0, all four events a< X< b. 
a<X<b, af X <banda<X< bhave the same probability f(b) — F(a). 

Property 2. If F is the df. of one-dimensional r.v. X, then 
()OSF (x) SI, (ti) f (x) S Fy) if x<y. 

In other words, all distribution functions are monotonically non-decreasing 
and lie between 0 and 1. 

Proof. Using the axioms of certainty and non-negativity for the probability 
function P, part (i) follows triviality from the definition of F (x). 

For part (ii), we have for x < y, 


F(y)-F(x)=P(x<X¢ y)2 0 (Property 1) 
= F (y) > F(x) 
= F (x) < F(y) when x < y (5-3) 


Property 3. if F is af. of one-dimensional r.v. X, then 
F(-00)= lim F(x)=0 
x——00 
and F(c)= lim F(x)=1 
xX— oo 
Proof. Let us express the whole sample space S as a countable union of 
disjoint events as follows : 


S=[U (-n<Xs- nt+1)JULU (me X¥ n41)] 


a=] na=0 
=> P(S)= ¥ P(-n<Xs -n+1)+ D)-P(n<Xs +1) 
n= | a=0 


(-.* P is additive ) 


=> l=lim ) [F(-n+1)- F(-na)] 


a-— oo a=l 


b 
+ lim ) [F(n+1)-F(n)] 


bo n=O: 


lim [F(0)-F(-a)]+ lim [F(6+1)- F(0)] 


a@—0o bow 
= [F(0)- i at) Sa Cea a) 
l= F(0)- F(-0) wo *) 


Since — 00 < 00, F(—0o)< Foo). Also 
F(-0)2 0 and F(~)<1 ( Property 2 ) 
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O< F(-)S F(e)< 1 (**) 
(*) and (#*) give F(— 22) = 0 and F(c)= 1, 
Remarks. 1. Discontinuitics of F(x) are at most countable. 


2. F(a)- F(a-0)=_ lim P(a-hs Xa), h>0 
h-0 
. F(a)- F(a- 0)= P(X=a) 
and F(a+0)-F(a)= lim P(as X< at h)=0,h>O0 
h-0 
=> F(at+ 0)= F(a) 


5-3. Discrete Random Variable. _If a random variable takes at most a 
countable number of values, itis called a discrete random variable. /n other 
“words, a real valued function defined ona discrete sample space is called a discrete 
random variable. 
5-3-1. Probability Mass Function (and probability distribution of a 
discrete random variable)... 
_ Suppose X is a one-dimensional discrete random variable taking at most a 
countably infinite number of values x, x2,... With each possible outcome x; , 
‘we associate anumber pij= P(X = x;)= p(x; ), called the probability of x;. The 
numbers p(x;); i= 1,2,.... must —— the following conditions : 


(i) p(x)20 Vi, Ww E p(x)= 1 
i=] 

This function pis called the probability mass function of the random vanable 
X and the set (x; , p (x; ) ) is called the probability distribution (p.d.) of the r.v. X. 

Remarks: 1. The set of values which X takes is called the spectrum of the 
random variable. 

2: For discrete random: variable, a knowledge of the probability mass 
funetion enables us to compute probabilities of arbitrary events. In fact, if E is a 
set of real numbers, we have 


P(Xe E)= x p(x), where S is the sample space. 


xE ENS 


Illustration. Toss ofcoin, § = (H,T}. LetX be the random variable 
defined by 
X(H)= 1, ie, X= 1, if ‘Head’ occurs. 
X(T)= 0, i.e, X= 0, if ‘Tail’ occurs. 
If the coin is ‘fair’ the probability function is given by 
P((H) )=P((T})=4 
and we can speak of the probability distribution of the random variable X as 
P (X=1)=P((H} )=; 


2 
P(X=0)=P(({T) )= 


2 


2 


i 
2 
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§-3-2, Discrete Distribution Function. In this case there are a 


countable number of points x4, X2,.43,... and numbers p,2 0, 2 p,= t such 

that F(X )= = p;. For example if x; is just the integer i, F(x)isa 
(4:48 2X) = 

"step function" having jump p; at i, and being constant betwcen each pair of 

integers. 

meee B(x) 


17 2 3 4&4 § § 7 x 


Theorem 5-5. p(x;)= P (X= 4)= F(x,)- F(x%-1), where Fis the df. 
of X. 
Proof. Let x, < x.< ... We have 


F (x)= P(X$4x) | 
7 5 P(X= xi) = y P(x} 
f= 1 a 
and F(x-1)= P(XSx%-1)= DY p(x) 
F (xj) - F(4- e Prim ..(5-5) 


Thus, given the distribution function of discrete random variable, we can 
compute its probability mass function. . 
Example 5:1. An-experimeni consists of three independent tosses of a fair 
coin, Let 
X = The number of heads 
Y = The number of head runs, 
Z = The lenght of head runs, ° 
a head run being defined as consecutive occurrence of at least two heads, its length 
then being the number of heads occurring together in three tosses of the coin. 
Find the probability function of (i) X, (ii) Y, (tii) Z, (iv) X+Y and (v) XY and 
construct probability tables and draw their probability charts. 
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Solution.. 
Table 1 
S.No. . Elementary Random Variables 
event. —§. —_————————————————SSSS—ss——SSSFS— 
X+Y 
HHH 


HHT 
HTH 


OC~INDWRWHD = 

ear | 

= 

= 
Doe &@B NK NN WI) 
cooorcore} |< 
SDOONCON WIN 
o— We NW 
DOONTCON W/Z 


Here sample space ts. 

S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 
(i) Obviously X is ar.v. which-can take the values 0, 1, 2, and 3 

p (3) =P (HHH) = (1/2) = 1/8 

p(2)=P [HHT U HTH UTHH} 

=P (HHT )+ P (HTH )+ P (THH ) = 1/8 + 1/8 +18 = 3/8 

Similarly p (1) = 3/8 and p (0) = 1/8. 
These probabilities could als6 be obtained directly from the above table 1. 


Table 2 
Probability table of X 


Values of X 
(x) 


p(x) 1g 38 38 18 
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Table 3 518 
ene 4 (8 
(ii) Probability Table of Y 


Values of Y, — 3/8 
(y) 28 


: a oad 
0 y 
This is obvious from table 1. pe Probably Puan 
(iii) From table 1 , we have 9 
Tabled ° 4/8 
Probability Table of 3/8 
Values of Z, ‘ 2/8 
(2) 1/8 
sel hat chart a Z 


p(u) 


(iv) LetU =X + Y. From table 1, we get 5/8 
Table 5 es 
ue 318 
Probability Table of U 
2/8 
se 


Probability chart of U = X+Y 
p(y) 


5/8 

(v) Let V = XY 418 

Table 6 3/8 

Probability Table of V 2/8 

Values of V; 1/8 
(v) 


Probability chart of V = XY 
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Example 5-2. A ‘random variable X has the following probability 
distribution : 
x 0 | 2 3 4 5 6 7 
p(x): OO ek 2k = 2k So 3BK kK?) oODK? Tk? +k 
(i) Find k, (ii) Evaluate P (X < 6), P (X 2 6), and P(0<X< _ 5), (iii) If 


P(X €c)> ry find the minimum value of c, and (iv) Determine the distribution 


function of X. [Madurai Univ. B.Sc., Oct. 1988] 
7 


Solution. Since 2, p (x) = 1, we have 
x= 


=> k+2k+2k+3k+k?+ 2k? + 7k +k=4 
=> 10k? + 9kK-1=0 
=> (lOkK-1) (kK +1)=0 =sk= 1/10 
[-. k=—l, is rejected, since probability canot be negative.] 
(ii) P(X <6) =P(X=0)4+P(X=!1)+...+P(X=S) 
l 2 2 3 l 8] 


ee, Se ——$ 


—10* 10* 10* 10* 100~ 100 

19 
P(X26)=1~P(X<6)=705 
P(O0<X<5)=P(X=1)+P(X=2)+P(X=3)+ P(X =4) = 8k’= 4/5 
(iil) P(X Sc) > 5. By trial, we get c = 4. 


(iv) 


Fy (x)= P(X $x) 


0 0 

1 ; = 1/10 
2 3k =3/10 
3 Sk = 5/10 
4 8k =4/5 

5 8k +k? =81/100 
6 8k + 3k? = 83/100 
7 9k + 10k? =1 


EXERCISE 5 (b) 


1. (a) A student is to match three historical events (Mahatma Gandhi's 
Birthday, India’s freedom, and First World War) with three years (1947, 1914, 
1896). If he guesses with no knowledge of the correct answers, what is the 
probability distribution of the number of answers he gets correctly ? 


(b) From a lot of 10 items containing 3 defectives, a sample of 4 items 1s 
drawn at random. Let the random variable X denote the number of defective items 
in the sample. Answer the following when the sample is drawn without 
replacement. 
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(i) Find the probability distribution of X, 
(ii) Find P(X <1),P(X< 1) and P(O<X <2) 


Ans. (2) x 0 1 2. 3 
l 1 
p(x) ° 0 6 


2. (a) A random variable X can take all non-negative integral values, and the 
probability that X takes the value r is proportional to o” (0< a< 1). Find 
P(X= 0 ). [Calcutta Univ. B.Sc. 1987] 

Ans. P(X =r)=AqQ’; r=0,1,2,...;A=l-a;P(X=0)=A=l1l-a 

(b) Suppose that the random variabic X has possible values 1, 2, 3, ... and 
P(X=j)= 2’, j= 1,2,... (i) Compute P(X "is even), (ii) Compute 
P(X2 5), and (iti) Compute P(X is divisible by 3). 

Ans. (i) 1/3, (ti) 1/16, and (i) 1/7 

3. (a) Let X be a random variable such that 

P(X= -2)= P(X=-1), P(X=2)= P(X= 1) and 
P(X>0)= P(X< 0)= P(X= 0). 
Obtain the probability mass function of X and its distribution function. 


Ans. xX ~2 ms | 0 ] 2 
- Poiogoa4 4 
P 6 6 3 6 6 
1 2 4 5 
= . = 2 ' 
ae) 6 6 6 


(b) A.random variable X assumes the values -3, -2, -1, 0, 1,2, 3 such that 
P(X= -—3)= P(X=-2)= P(X=-1), 
PUX= 1s F(X = 2)=-P (X= 3); 
and P(X=0)= P(X>0)=P(X< 0), 
Obtain the probability mass function of X and its distribution function, and find 
further the probability mass function of Y= 2X 24 3X44, 


[Poona Univ. B:Sc., March 1991] 
Ans. xX 


s 22 =f OO 1 2 3 
oao1t © 2 2 a 2 a 
P(x 9 9 9 3 9 9 9 
Y 3 6 3 4 9 18 31 
ty) 1 oa 31-61 +231 ~=321 ~«21 
PY 9 9 9 3. 9 9 9 
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4.(a) A random variable X has the following probability function : 


Values of X,x : -2 -] 0 | 2 3 
p(x): 0-1 k 0:2 2k = «0:3 k 
_ (i) Find the value of k, and calculate mean and variance. 
(ii) Construct the c.d.f. F(X) and draw its graph. 


Ans. (i) 0-1, 0-8 and 2-16, (ii) F (X ) = 0-1, 0-2, 0-4, 0-6, 0-9, 1-0 
(b) Given the probability function 
x 0 1 2 3 
p(x) 0-1 03 05 0-1 


Let Y= X7+ 2X, then find (i) the probability function of Y, (ii) mean and 
variance of Y. 
Ans. (i) y 0 3 8 15 


P(y) | 01 03 05 0-1 


5. A random variable X has the following probability distribution : 
Values of X,x |O0 1 2 3 4 § = 6. 7 8 
p(x) a 3a 5a Ta 9a lla 13a 15a 17a 
(i) Determine the value of a. 
(ii) Find P(X< 3),P(X2 3),P(0< X< §). 
(iii) What is the smallest value of x for which P(X < x)> 0-5? and 
(iv) Find out the distribution function of X ? 
Ans, (i) a= 1/81, (ii) 9/81, 72/81, 24/81, (iii) 6 


(iv) x 0 ] 2 3 4 5 6 7 8 
F(x) a 4a 9a l6a 2a 36a 49a G4a 8la 
6. (a) Let p(x) be the probability function of a discrete random 
variable X which assumes the values x, , x2, x3 ,X4, such that 2 p (x) =3 p (x) 


= p (X3) =5 p (x4). Find probability distribution and cumulative probability dis- 
tribution of X. (Sardar Patel Univ. B.Sc. 1987) 


me CO 
ee) [ite [ime [ie | Ae 


(b) The following is the distribution function of a discrete random 
variable X : 


, (ii) 6-4 ,16-24 


-3 -1 0 | 2 3 5 8 

0:10 030 045 OS 075 090 095 1-00 
(i) Find the probability distribution of X. 

(ti) Find P(X is even) and P(1< X< 8). 

(iii) Find P(X = —3|X< 0) and P(X2 3| X>0). 

[ Ans. (ii) 0-30, 0-55, (iii) 1/3, 5/11]. 
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7. If p(x)= 753 x= 1,2,3,4,5 
= 0, elsewhere 
Find (é) P{X = lor2}, and (i) P 3 <X< 2 X> i} 
[Allahabad Univ. B.Sc., April 1992} 


Hint. (i) P{X= 1 or 2}= P(X= 1)+ P(X= 2)=—4 == ; 
Pita <x <2 ax> 1} 
(i) PS <X<3|x>1}5 P(X> 1) 
_PA(Xslor2)0X>t P(x=2) Ms 
P(X>1) 1-P(X=1) 1-(Ms) 7 


8. The probability mass function of a random variable X is zero 
except at the points x= 0,1,2. At these points it has the values p (0) = 3c’, 
p(1)=4c— 10c’ and p (2)=S5c—1 for somec>0. % 

(i) Determine the value of c. 

(ii) Compute the following probabilities, P(X <2) and P(1<X <2). 
(iit) Describe the distribution function and draw its graph. 
(iv) Find the largest x such that F (x)< 2. 

(v) Find the smallest x such that F (x)24%3. [Poona Univ. B.Sc., 1987) 


; eee 
Ans. (i). (iz) (i) 1,(v) 1. 


9. (a) Suppose that the random variable X_ assumes three values 0,1 and 2 
with probabilities 5, 4 and > respectively. Obtain the distribution function of 
X.. [Gujarat Univ. B.Sc., 1992] 


(b) Given that f(x)=k(1/2)' is a probability distribution for a random 
variable which can take on the values x = 0, 1, 2, 3, 4, 5, 6, find & and find an 
expression for the corresponding cumulative probabilities F (x). 

[Nagpur Univ. B.Sc., 1987) 

5-4. Continuous Random Variable. A random variable X’ is said to be 
continuous if it can take all possible values between certain limits. /n other words, 
a random variable is said to be continuous when its different values cannot be put 
in 1-1 correspondence with a Set of positive integers. 

A continuous random variable is a random variable that (at least concep- 
tually ) can be measured to any desired degree of accuracy. Examples of continuous 
random variables are age, height, weight etc. 

5:4-1. Probability Density Function (Concept and Definition ). Consider the 
small interval (x, x + dx) of length dx round the point x. Let f(x) be any continuous 
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function of x so that f(x) dx represents the probability that X falls in the in- 
finitesimal interval (x, x + dx). Symbolically 
PasXsxt+dx)=afr (yd ... (5-5) 
In the figure, f(x) dx represents 
the area bounded by the curve 
y =f (x), x-axis and the ordinates at 
the points x and x+ dx. The func- 
tion fx (x) so defined is known as 
probability density function or simply 
density function of random variable 
X and is usually abbreviated as 
p.df. The expression, f (x) dx , usually written as dF (x), is known as the prob- 
ability differential and the curve y= f(x) is known as the probability density 
curve or simply probability curve. 
Definition. p.d.f. fx (x) of the r.y. X is defined as : 
fe = lim POs Heaton) (55.4) 


bx >> 


- f (X).d x Pes 
? 


x- Ox x+dXx 
2 2 


_ The probability for a variate value to lie in the interval dx 1s f(x) dx and hence 
the probability for a variate value to fall in the finite interval [a , B) is: 


P(asX<B)= fe f(x) dx + (5-5 b) 
which represents the area between the curve y = f (x), x-axis and the ordinates at 
x=aqandx=f. Further since total probability is unity, we have ’ f(x) dx=1, 


where [ a, 5 } is the range of the random variable X . The range of the variable may 
be finite or infinite. 

The probability density function ( p.djf.) of a random variable (7.v.) X 
usually denoted by fx (x) or simply by f(x) has the following obvious properties 


(i) f(x)2 0, -0< x< ... (55 €) 
(ii) | fe) de= 1 (55d) 
(iii) The probability P(E) given by 
P(E)= | f(x)de 1. (55) 
E 


is well defined for any event E. 

Important Remark, In case of discrete random variable, the probability ata 
point, .e.,P (x= c)is not zero for some fixed c. However, in case of continuous 
random variables the probability at a point is always zero, i.e.,P (x= c)= 0 for 
all possible values of c. This follows directly from (5-5 6) by taking a= B= c. 
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This also agrees with our discussion earlier that P(E )= 0 does not imply that 
the event E is null or impossible cvent. This property of continuous r.v., viz., 


P(xX=c)=0, Ve OSS) 
leads us to the following important result : 
PiasXs B)=P(as X< B)=P(a<X< B)=P(a< X<B) ...(5-5g) 
i.e., incase of continuous r.v., it does matter whether we include the end points of 
the interval from o to B. 
However, this result is in general not true for discrete random variables. 
5-4-2. Various Measures of Central Tendency, Dispersion, Skewness, and 
Kurtosis for Continuous Probability Distribution. The formulae for these 
measures in case of discrete frequency distribution can be easily extended to the 
case of continuous probability distribution by simply replacing p;= f,//N by 
f (x) dx, x; by x and the summation over ‘i’ by integration over the specified range 
of the variable x. 
Let fx (x) or f(x) be the p.df. of a random variable X where X is defined 
from a to b. Then 
: b 
(i) adunele mean = I? xf(ixyyad& (5-6) 
(ii) Harmonic mean. ae mean H is given by 


b 
Hole é ys eer (56 a) 
(iii) Geometric mean. sees mean G is given by 
log G =| log xf (x) dx 


(56 b) 
(iv) ps? (about origin) =|? Y f(x) de a 
u’ (about the pointx= A) =|” (x— AY f(a) dx a 
and pW, (about mean) = i (x— mean) f (x) dx AST b) 
In particular, from (5-7), we have 
ji’ (about origin) = Mean = |” x f(x) dx 
and w=? Lia 
2 
Hence poe p=? Pf Qdr-| ? fede] ...5-70) 


From (5: 7), on putting r=3 and 4' respectively, we get the values of 
Ms’ and 1’ and consequently the moments about mean can be obtained by using 
the relations : 
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[ls = [ls — Byte’ py’ + 2k)” 
we a = pla’ — 4ps” py’ + Ole” pa? — 3p," ee) 
and hence B; and B2 can be computed. 
(v) Median. Median is the point which divides the entire distribution in two 
equal parts. In case of continuous distribution, median is the point which divides 
the total area into two equal parts. Thus if M is the median, then 


Js dx= | FG) dx=4 


... (5-8) 
Thus solving 
M ! b 1 
[e faydr=t or ly fx) de=4 (58 a) 
for M, we get the value of median. 
(vi) Mean Deviation. Mean deviation about the mean 1,’ is given by 
b 
M.D. = pr | x— mean | f (x) dx (59) 
(vit) Quartiles and Deciles. Q; and Q3 are given by the equations 
Q1 _1 Qs =3 
ie PERS sane fe POG ... (5-10) 


D;, ith decile is given by 
PY fadc= = 1 (5-104) 


(viii) Mode. Mode is the ics of x for which f (x) is maximum. Mode is thus 
the solution of 
f'(x)= 0 and f’"(xX)< 0 . (5-11) 
provided it lies in [a,b]. 
Example 5-3. The diameter of an electric cable, say X, is assumed to be a 
continuous random variable with pdf. f(x)= 6x(1- x), OS x< 1. 
(i) Check that above is p.d f., 
(ti) Determine a number b such that P (X < b) =P (X > b) 
(Aligarh Univ. B.Sc. (Huns). 1990] 
Solution. Obviously, forO< x< 1, f(x)2 0 


Now pf@d=6l x(1~x) dx 


= 6J (x— 


Hence f (x) 1s the p.df. of r.v. X 
(ii) P(X <b)=P{(X>b.) al?) 
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=> Io fey de= JP fey dx 
=> 6 J) x(l-a)dr=6f) x(1-a) dx 
2s x elo |e xy) 
2 3lo0 |2 3le 
” be \_{fl_i)_(e_& 
2 3)] ||2 3 2 3 
= 3b? — 2b’ = [1 - 3b? + 26°] 
=> 4b°-6b°+1=0 
(2b—- 1) (2b? -2b- 1)=0 
= 2b-1=0 or 2b6’-2b-1=0 


Hence b= 1/2 is the only real value lying between 0 and 1 and satisfying (*). 
Example 5-4. A continuous random variable X hasa pdf. 
f(x)= 3%, OS.x¢ 1. Findaand b such that 
(i) P{X<S a)=P{X> a}, and 


(ii) P{X>b)=005. [Calicut Univ. B.Sc., Sept. 1988] 
Solution. (i) Since P(X S$ a)= P(X> a), 


each must be equal to 1/2,-because total probability is always one. 


1 a ae 
PK<a=5 = Jp f@a=5 
7 rae cl geet 
= Jovan = 313 02 
oe. = at 
ios a?) = 9 
(ii) P(x>6)=005 = J? ray dr=005 
=> x oe => 1- ‘a 
31, 20 ; 20 
,_ 19 1 
=> = 00 => o=(59 Ae 


Example 5:5. LetX be acontinuous,random variate with p.df. 
f(x)= ax, O< x¢ 1 
=a, Ilsxe2 
=~ axr+ 3a, 2<x<3 
= 0, elsewhere 
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(i) Determine the constant a. 
(ii) Compute P(X < 1-5). [Sardar Patel Univ. B.Sc., Nov.1988] 
Solution. (i) Constant ‘a’ is determinéd from the consideration that total 
probability is unity, ¢.e., 


J f@)de= I 
= [9 forde+ bf det [re det Brandes Je f() dx=1 
7 [jax dx+ Fi ade + lp (— ax + 3a) dx=1 


=l 


2 
x 
me aa 


} x |? 
+al|x +a 
0 ] 


= 5+ a+a [$+ 2} 2+ ae 
l 


a a : = 1 
=> at at a= => W=1 => a= 5 
(i) PIX S15)=J!? foydr=J > fix)drt fh fx) de+ J? fon ax 
1 1-5 
=a), xdx+J", a.dx 
xy |1 15 a 
a ral xl! =5 +054 
=a=5 [a= 4, Part(i)] 


Example 5-6. A probability curve y= f(x) has a range from 0 to ~. If 
f (x) = e-", find the mean and variance and the third moment about mean. 
[Andhra Univ. B.Sc. 1988; Delhi Univ. B.Sc. Sept. 1987] 
Solution. 


u, (rth moment.about origin ) = Pe x" f(x) ad 
: ola x’e* d= T(r+Near! 
(Using Gamma Integral) 
Substituting r = 1, 2 and 3 successively, we get 
Mean = p,’=1!=1, py’ =2!=2, ps’=3!=6 
Hence variance = (l2= pe — p,?=2-T=1 
and Ms = ps’ — 3 plo pn’ +2 py = 6-3xX24+2=2 
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Example 5-7. Ina continuous distribution whose relative frequency density 
is given by 

f()=yo.x(2-x), OSxS2, 
find mean, variance, B, , and Bo and hence show that the distribution is symmetri- 
cal. Also (i) find mean deviation about mean and (ii) show that for this distribution 
[n+1=Q, (iti) find the mode, harmonic mean and median. 
[Delhi Univ. B.Sc.(Stat. Hons.), 1992; B.Sc., Oct. 1992] 
Solution. Since total probability is unity, we have 


Jef) de= 1 
=> yo fy x(2-x)dr= I => you 
f(x)= 3x(2- x) 


, y 3 re 2 
w= fi xfoide= 2) x visa 


(r+ 2) (r+ 3) 
In parucular 
; = - z. ,_ 3:2? 6 
Mean= w= 37 = 1, He = 5» 
ee, ae ge 
5G 5 677 
Hence varience= p2= bh — py)/?= 2-1=% 
M3 = Us ~ 3 Wy’ + m= 2 - 3 2.14 2= 0 
, pg Psa 16 8 6 3 
= p’- 4 + 6 *— 3/8 = S- 4.2. Re ee 
Ha = ps Hs Hh M2 Wi - 3th 7 5 1tO.5.1-3.1a35 


Since B, = 0, the distributior is symmetrical. 
Mean deviation about mean 


= Ik lx-1] f(x)de 


J lx-1l pqyart Ro la-tl pende 


3 2 
=| ; (l-x)x(2-x)dx+ iF (x- I)x-2) de | 


the integrand ¢ 
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3041 
= 3[f) (Qx- 33°42) dx + fi) 2-2 -2x) ae | 
-3| i Bike |! | x ¥_ mw P)_ 3 
=4lle- a t+ lot i337 a7 hd ss 
Most = Jo (x-mean)"*! F(x) dx 
oS ce 2n+1 as 4 
== fF te 1**! 5 (2—x) de 
= : ae emt e+e Dl -dadt (x-l=r) 


_3 fi 2n+1 — 72 
=o fied yde 


Since ¢”*' 1 isan odd function of ¢ and(1-~/7) is an even function of ¢, 
+11 _¢?) is an odd function of ¢. 
Hence Mn+ = 0. 


Now f= 52-29) = 0 => x=] 


and f"(a)= 3(-2)=-3<0 


Hence mode = 1 
Harmonic mean H is given by 


= H= > 
If M is the median, then 
M 1 
Jo f@ar= 5 
3 ¢M 1 ‘ 
=> = Jo x(2~x)de= 5 x 
31M 
Bo Res 2 
= xo- 31073 
=> 3M?- M®=2 
= M*?-3M*+2=0 


(M-1)(M*-2M-2)= 0 


U 
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The only value of M lying in [0,2] is M = 1. Hence median is 1. 
Aliter. Since we have proved that distribution is symmetrical, 
Mode = Median = Mean = 1 

Example 5-8. The elementary probability law of a continuous random 

variable X is 
f@= yer", asx<~, b>0 
where a,b and y. are constants. 

Show that y= b= Vo and a= m-— 6, where mand G are respectively the 
mean and standard deviation of the distribution. Show also that B, = 4 and 
B.= 9. [Gauhati,Univ. B.Sc.,.1992] 

Solution. Since total probability is unity, 


JP f@)ar=1 => yf et deal 


owl ok we ened cee 
a opBml 

=> Yo = b 

H,’ (rth moment about the point ‘x = a’ ) 


= | @-ay f@ae= oJ Gay eh dx 


=> Yo 


= 5b < eat [ On putting x- a= ¢] 
. ! 
= b as os [ Using Gamma Integral ] 


In particular 
t= 1/b, ps’ = 2/6", pts’ = 6/b", py’ = 24/7b* 
m= Mean =a+ Uh) =a+(1/b) 


=> ge and m=a+—-£a+o 
b b- 
Hence Yea b =e and a=m-o 
, , , 23 1 2 
Also Ls = Us’ ~ 3 pe’ wit 2u1 = 5 (6-3242)=-G= 20° 
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and Pa = pls — 4 ls’ pa + 6 pe’ i’? - 3 ph 
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== (24- 46.1+621-3)=4= 9 9 o* 


Hence Bi= us/t2 = 40°/0°= 4 and B2= p/p = 90/0" = 9 
Example 5-9. For the following probability distribution 
dF = yee (7! dx, —w2<x<oco 


show that yo= 5 , th’ = 0, o= V2 and mean deviation about mean = 1. 
Solution. We have ine f@)d= 1 


=> Yo hon elad=1 = 2y, [ é'*! d= 1, 


‘since e"|*! is an even function of x) 


= 2] et de=1, (sincein0< x< 0, |x]=x) 
-x loo 
=. “2y5 = ao => 2y= 1, ie, y= = 


1’ ( about origin ) = Jo. x f(x) dx= 5 Jo rel ds 


= Q, 


(since the integrand x .e a 


is an odd function of x ) 
u’ = ae ¥ f(x) dx= =I. goes te 
=o 21, x ol ay 


[since the integrand x” e"'*! 


is an even function of x ] 
[2’ = 1p x e*dx=1T(3) (onusing Gamma Integral ) 

=> io = 2!= 2 

Now O = b= pr’ - p= 2 

M.D..about mean = |* | x—mean | f(x) dx 


biel -ecthennabels 
1 ass -\x\ 
et lxl ee! ax 


= |p xefde= 1(2)=1 
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Example 5-10. A random variable X has the probability law: 
a8) = 55 er ay O<x<00 


Find the distance between the quartiles and show that the ratio of this distance to 
the standard devation of X is independent of the patameter ‘b’. 
Solution. If Q; and Qs are the first and third quartiles respectively, we 


have 
OQ, = 1 0; -2'/2 = 1 
\ f(x) d= 4 J xe dx 4 
2 
x 
Put = ag then dy= B 
2 2 742 
OY/20* -» , 1 e” |Qi/2b 1 
iF e ~ dy 4 = ah 4 
a 1 zo7;y? 3 
Qi/2 _ = Q/%> _ A 
=> l-e | => e€ eT 
=> i= V 2b Vlog (473 ) 
Again we have ie fQ@da= 2 which, on proceeding similarly, will give 
2.2 2 2 
1-e@ ?@M%=-3/4 > €8™* = 1/4 
=> Q3= Vv 2b v log (4 ) 


The distance between the quartiles is given by 
O3-Q,= V2b [Vlog 4 - ee ] 


in’= | x f(x) dx= ge 4 Xt ay 
= |y° V 2by’? &? ay y= 35 | 
“Vale er pate 
2013 } V2... 23 ) 1238 = bV(n/2) 

Hr" = Io x* f(x) dx= Io rs en dy 


= 26 | y €’ dy [y=] 


= 267 1(2)= 267. 1!= 26° 
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GO? = p= py’ - p’?= 267- B.S (2-2 


2 2 
= o= b V2-(n/2) 
H Q:- Qi V2[ Vlog 4 - Vlog (473) ] 
ence = 
= 2— (x/2) 


which is independent of the parameter ‘b’. 
Example 5-11. Prove that the geometric mean G of the distribution 
dF =6(2-x)(x~ 1) dx, 1$x<2 
is given by 6 log (16G)= 19. [Kanpur Univ. B.Sc., Oct. 1992] 
Solution. By definition, we have 


log G= iF log x f(x) d= 6 log x (2-x) (x- 1) dx 
--6f* (x° ~ 3x + 2) log x dx 


Integrating by parts, we get 


2 
log G= -6| [2-32 425 hog x| 
3 
-f [23% sae] u ax| 
2 x 
=-—4 log 24 6x32 (on simplification) 
19 4 19 
log G+ 4 log 2=—— => log G + log 2 =— 


=> log G + log 16=-— => log (16 G)="2 


= 6 log (16'G) = 19 


Example 5-12. The time one has to wait for a bus at a downtown bus stop 
is observed to be random phenomenon (X) with the following probability density 
function : 


fx(o= 0, for x<0 
(x+1), for 0S x< 1 
(x-1), for 1s x<3 
(3-x), for 35 x<2 
(4-x), for 25 x< 3 
, for 35 x<6 


| 
wli-ol= wle wile wl]— 
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= 0, for 6<x, 
Let the events A and B be defined as follows : 
A : One waits between 0 to 2 minutes inclusive: 
B : One waits between 0 to 3 minutes inclusive. 
(i) Draw the graph of probability density function. 


(ii) Show that (a) P(B\A)= :, (b) P(A NB)=: 


sd | hae 


Solution. (i) The graph of pa. is given below. 
(x) 


ufo 


3/9 


ae 
— 


> (on simplification ) 


P(ANB)= P(1SXS2)=]? f(x) dx 
w4( 1 4(5 
=i 5[x-5 ber Fh, §(S-* 
0.204 P23 
“9L2° 2) "9127 2m 3 


: P(B\A)= PLAOB) 1/3 2 


P(A) 1/23 
(b) ACY B means that waiting time is more than 3 minutes. 
is P(AnB)=P(X>3)=Js" sacs Je f(x) det | fx) de 


SE Sae$ Valte! 


N 
(on simplification ) 


5-26 Fundamentals of Mathematica} Statistics 

Example 5:13. The amount of bread (in hundreds of pounds) X that a certain 
bakery is able to sell in a day ts found to be a numerical valued random 
phenomenon, with a probability function specified by the probability density 
functionf (x), given by 


f(@=A.x, for O< x< § 
= A(10-x), for 5< x< 10 
= 0, otherwise 


(a) Find the valué of A such that f (x) is a probability density function. 
(b) What is the probability that the number of pounds of bread that will be sold 
tomorrow I$ 
(i) more than 500 pounds, 
(ii) less than 500 pounds, 
(iii) between 250 and 750 pounds? [Agra Univ. B.Sc., 1989} 
(c) Denoting by A,B,C the events that the pounds of bread sold are as inb 
(i), b (ii) and b (iii) respectively, find P(A\B),P(A|C). Are (i) A and B 
independent events? (ii) Are A and C independent events ? 
Solution. (a) [n order that f (x) should be a probability density function 


[°° f@) de=1 
| 5 10 — 
ie, |p Axde+ F A(10-x) d= 1 
=> A= oe (On simplification) 


(6) (i) The probability that the number of pounds of bread that will be sold 
tomorrow is more than 500 pounds, i.e., 


10° 2 x? {10 
p(ssxsio)= fl°4 (10-x)de= £ | 1-2 | 

1 (25 

-4+[2 i 2= 95 


(ii) The probability that the number of pounds of bread that will be sold 
tomorrow is less than 500 pounds, i.e., 


- (5 1 a GE [Pe Th ae! 
P(OSXS5)= Jo pe -xde= Ze} [y= 7 = 05 
(iii) The required probability is given by 
_f5 1 _ 3 
P(25sKS75)= Jo. so x dea JO? 5 (0-x) de= 3 


(c) The events A, B and C are given by 
A:5< X< 10; B:0< X< 53; C:2:5< X< 75 
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Then from parts 6 (i), (ii) and (iii), we have 


P(A)= 0-5, P(B)= 05, P(C)= 


& |W 


The eventsAMB andANC are given by 
ANB=0 and ANC: 5< X< 7-5 
P(AQNB)= P(b)= 0 
and P(ANC)= i fla) de= 3 je (10 — x) dx 
1 75 


P(A). P(Qh=; Exgs 25 PIANC) 


= A andC are independent. 
Again P(A). P(B)= t # P(ANB) 


=> A andB are not independent. 


P(A|B)= pa = 0 
palo PANO. 381 


P(C) 3/4 2 


Example 5-14. The mileage C in thousands of miles which car owners get 
with a certain kind of tyre is a os variable having probability density function 


fae 5 — 5 ¢° for x>0 


= " , for x <0 
Find the probabilities that one of these tyres will- last 
(i) at most 10,000 miles, | 
(ti) anywhere from 16,000 to 24,000 miles. a 
(iti) at least 30,000 miles. (Bombay Univ. B.Sc. 1989) 
Solution. Letr.v. X denvte the mileage (in ’000 miles) with a certain kind 
of tyre. Then required probability is given by: 


(i) — PXS10= J Foae= m7 ye? dx 
_ | ’ Sia 10 l-e” 
~ 20 |-1/20 |o — 


L- 90-6065 = 0-3935 
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° 1 24 _ x _ | _ vm {24 
(ii) P(lW6<SX <24) 70 J16 exp ( x ax | e : 


-16/ -2A/D 
sig (Ag = ¢ 


0-4493 — 0-3012 = 0-1481 


~ 4/5 - 6/5 
= 


7 1 en 7/2 oo 
(iii) P(X230)= Jan f@) a= 55 | T7790 30 
= ei*= 0-2231 

EXERCISE 5 (c) 


1. (a) A continuous random variable X follows the probability law 
fQ)= Ax’, O<xS1 

Determine A and find the probability that (i) X lies between 0-2 and 0-5, 
(i) X is less than 0-3, (iii) 1/4<X < 1/2 ana qv) X >3/4 given X >1/2. 

Ans. A= 0-3, (i) 0-117, (ii) 0-027, (iii) 15/256 and (iv) 27/56. 

(b) If a random variable X has the density function 

_ Jls4, -2<x<2 
f@)= 0, elsewhere . 
Obtain (i) P(X <1), (ii) P(| X |>1) (iti) PQX+3>5) 
(Kerala Univ. B.Sc., Sept.1992) 


Hint. (ii) P(\X|>1)=P(X>1 or X<-1=J7) f(x) d+ Jo f(a) de 


or PUAX|>D=1- P(LX|<l)=1- P-1SX<1) 
Ans. (i) 3/4, (ii) 1/2 (iii) 1/4. 
2. Are any of the following probability mass or density functions? 
Prove your answer in each case. 
1 3 1 1 
(a) f(x)= x; x= 16° 16°4°2 
(6) f(y=hO™; x20; A>0 
| 2x, O< x< I 
(c) f(y= 5 4- 2x, 1< x< 2 
0, elsewhere , 
(Calicut Univ. B. Sc., Oct. 1988) 
Ans. (a) and (b) are ,.m.f./p.d.f.’s, (c) is not. 
3. Iffi andf are p.d.f.’s and 6:+ 6,= 1, check. 


g(xy=Ofitx)+ f(x), isap.df. 
Ans. g(x) is a p.df. if 0<(61,6.)<1 a-° 4+ ®= 1. 
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4. Acontinuous random variable X has the probability density function : 
f(@)= A+Bx, O<x1. 
If the mean of the distribution is 5, findA andB. 


Hint: Solve Jf f(x) dx= 1 and Ik x f(x) dx = 4. Find A and B. 


5. For the following density function 
f= ex (1-x), 0<x<l, 
find (i) the constant c, and (ii) mean. 
[Calicut Univ. B.Sc.(subs.), 1991] 
Ans. (i)c=12; (ii) mean= 3/5. 
6. A continuous distribution of a variable X in the range (—3, 3) is defined by 
f= + B+x’, -3sxs-1 
+ (6-2x), -1<xs1 
+ (3-x), 1sxs3 
(i) Verify that the area under the curve is unity. 


(ii) Find the mean and variance of the above distribution. 
(Madras Univ. B.Sc., Oct. 1992; Gujarat Univ. B.Sc., Oct. 1986) 


Hint: [°, fa@yde= J~) anaes J, perae+ JP pera 


Ans. Mean=0, Variance=1 
7. If the random variable X has the p.d.f., 

f@= 5 @+)), -l<x<l 

= 0, elsewhere, 

find the coefficient of skewness and kurtosis. 
8.(a) A random variable X_ has the probability density function given by 

f= &(i-x), O<x<1 
Find the mean ut , mode and S.D. o , Compute P (u — 206 < X <UL + 20). 
Find also the mean deviation about the median. 

(Lucknow Univ. B.Sc., 1988) 


(b) For the continuous distribution 
dF = y.(x-x*) dx ; O<x<1, y, being aconstant. 
Find (i) arithmetic mean, (ii) harmonic mean, (iii) Median, (iv) Mode and (v) rth 
moment about mean. Hence find B, and B2 and show ‘that the distribution is 
symmetrical. (Dethi Univ. B.Sc., 1992; Karnatak Univ. B.Sc.,'1991) 


Ans. Mean = Median = Mode =i 
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(c) Find the mean, mode and median for the distribution, 
dF (x)= sin xdx, OSx<n/2 
Ans. 1, 2/2, 2/3 


9. Ifthe function f(x ) is defined by 
f@=ce*", 0<x<0, a>0 
(i) Find the value of constant c. 
(ii) Evaluate the first four moments about mean. 
[Gauhati Univ. B.Sc. 1987] 
Ans. (i) c=0, (ii) 0, 1/0’, 2/7, 9/a%. 
10. (a) Show that for the exponential distribuuon 
dP= y,.e€”%° dx, 0Sx<~, o>0 
tne mean and S.D. are both equal to o and that the interquartile range is 
6 loge 3. Also find 1,’ and show that B= 4, B= 9. 
[Agra ‘:niv. B.Sc., 1986 ; Madras Univ. B.Sc., 1987] 
(b) Define the harmonic mean (H.M.) of variable X as the reciprocal of the 
expected value of 1/X , show that the H.M. of variable which ranges from 0 to 


co with probability density z 2? &* is3. 


11. (a) Find the mean, variance and the co-efficicnts 8B, , B2 of the distribu- 
tion, 
dF=k x &* dx, 0<x<0, 
Ans. k= 1/2; 3,3, 4/3 and S. 
(b) Calculate B, for the distribution, 
adF=kxe*dx, )<x<e 
Ans, 2 [Delhi Univ. B.Sc. (Hons. Subs.), 1988] 
12. A continuous random variable X_ has a p.d.f. given by 
fw=kxe * x20, 2>0 
= Q, otherwise 
Determine the constant & , obtain the mean and variance of X . 
[Nagpur Univ. B.Sc. 1990] 
~ [3. For the probability density function, 
2(b6+ x) 


f(x)= Tere t -b<x<0 
_ 2(a~-~x) , é 
~ a(at+b)’ Os xsa 
Find mean, median and variance. [Calcutta Univ. B.Sc. 1984] 


Ans. Mcan=(a-6)/3, Variance=(a’+ 6+ ab)/18, 
Median = a— ¥ a(at b) /2 
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(ii) Show that, if terms of order ( a ~ 6 )*/a’ are neglected, then 
mean ~ median = (mean — mode) / 4 
14, A variabic X can assume valucs only between 0 and 5 and the equation 
of its frequency curve is 
y= A sin ; mx, 0<x<5 


where A is a constant such that the area under the curve is unity. Determine the 
value of A and obtain the median and quartiles of the distribution. 

Show also that the variance of the distribution is 50 : ~ < : 

Ans. 1/10; 2-5, 4/3, 10/3 

15. A continuous variable X is distributed over the interval (0, 1] with p.d.f. 
ax'+ bx, wherea, areconstants. If the arithmetic mean 6f X is 0-5, find the 
values of a and b. 

Ans. — 6,6 

16. A man leaves his house at the same time every moming and the time 
taken to journey to work has the following probability density function : less than 
30 minutes, zero, between 30 minutes and 60 minutes, uniform with density k ; 
between 60 minutes and 70 minutes, uniform with density 2k ; and more than 70 
minutes, zero. What is the probability that on one particular day he arrives at, work 
later than on the previous day but not more than 5 minutes later. 

17. The density function of sheer strength of spot welds is given by 

f(x)= 2x/160,000 for 0< x< 400 
= (800- x) /160,000 for 400< x< 800 
Find the number a_ such that 
Prob. (X < a) =0-50 and the number 6 such that 
Prob. (X < 6) =0-90. Find the mean, median and variance of X. . 
{Dethi Univ. B.E., 1987] 

18. A batch of small calibre ammunition is accepted as satisfactory if none 
of a sample of five shot falls more than 2 fect from the centre of the target at a 
given range. If X , thé distance from the centre of the target to a given impact 
point, actually has the density 


2 
f(x)=k.2xe~* , O< x< 3 
where k is a number which makes it probability density function, what is the value 
of k and what is the probability that the batch will be accepted? =z. 
[Nagpur Univ. B.E., 1987) 


Hint. \ f@Md=1 > k= 1K 1~e’) 
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Reqd. Prob. = P [ Each of a sample of 5 shots falls within a distance of 2 ft. 
from the centre ] 


1~-e* 
= rocte2y | [2 roar} = [455 
19. A random variable X_ has the p.d.f. : 


_ | x, W< x< il 
x)= | 0, otherwise 


° ° l oe I l . eee 3 | 

Find (P[X< 1), (ii) p{i< X< 1], (ii) P{ X> 7 | X> 1]. and 
: 3 i 
fv) P(X < a | X> i], 


(Gorakhpur Univ. B.Sc., 1988) 


‘: .P(X>%) Yo 7... P(*W<X< %) 
Ans. (i) 1/4, (ii) 3/16, (lit) Bx Wy wm 12° (™)  Bx> A) 5) 


5-4-3. Continuous Distribution Function. If X is a continuous random 
variable with the p.d.f. f(x) , then the function 
Fx(Qx)= P(XSx)= ]*_ fdt, -o<x<e, (5-12) 


is called the distribution function (d:f.) or sometimes the cumulative distribution 
function (c.d.f.) of the random variable X. 


Remarks 1. Os F(x)s 1, -wKc xc om, 


2. From analysis (Ricmann integral), we know that : 
F'(i)= < F (x)= f(x) 20 [- f(x) is pd} 
=> F (x) is non-decreasing function of x . 
3, F(-e)= lim F(x)= tim [* f(x)de= [7 p(xrdr= 0 
x3 @ x0 


and F(+e0)= lim F(x)=lim J* f(x)de= J" f(x) d= 1 


xe xXx 0 


4. F(x) 18a continuous function of x on the right. 
5. The discontinuities of F (x ) are at the most countable. 
6. It may be noted that 


P(as Xs b)= J]? payac= J? peyae- J pow 


=P(X< b)- P(Xs a)=F(b)- F(a) 
Similarly 


P(a<X<b)=P(a< Xs b)=P(asX< b)= J? feryae 
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7. Since F’(x)= f(x), wehave 
(x)= f(x) = dF(x)= f(x) de 
This is known as probability differenual of X. 


Remarks. I. It may be pointed out that the properties (2), (3) and (4) above 
uniquely characterise the distribution functions. This means that any function F(x) 
satisfying (2) to (4) is the distribution function of some random vartable, arid any 
function F(x) violating any one or more of these three properties cannot be the 
distribution function of any random variable. 

2. Often, one can obtain a p.d.f. from a distribution function F (x) by 
differentiating F (x) , provided the derivative exists. For example, consider 

0, for x < 0 
Fy(x)=)x, for 0< x< 
1, for x> 1 

The graph of F (x) is given by bold lines. Obviously we see ihe F (x) Is 

continuous from right as stipulated in (4) and we also see that F (x) is_not 
continuous at x= 0 andx= 1 and hence is not derivable atx = 0 andx= 1. 


F(x) 


Differentiating F (x) w.r.t. x, we get 


d , J 1,0<x< 1, 
i PY 9: otherwise 


{Note the strict inequality inO0< x< 1, since F(x) is not derivable at 
x= 0 andx= 1] 
Let us define 
_ »09< x< 1 
f(x)= ie 0, otherwise a 
Then f(x) isa p.d.f. for F. 
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Example 5-15. Verify that thé following is a distribution function: 
(), 
i(x x<-a 
F (x)= (3+ , ~ asgxea 
‘a 3s 
X> a 


(Madras Univ. B.Sc., 1992) 
Solution. Obviously the propertics (i), (ii), (iii) and (iv) arc satisfied. Also 
we ohserve that F-¢x ) Is continuous atx= a andx= — a. as well. 
Now 


‘ - | 
d "— -~acxs 
af x05 be Series 
0, otherwise 
=f(x), sa 


In order that F (x) is a distribution function, f(x) must be a p.d.f. Thus we 
have to show that 


JP) dea 


Now J" pts) de= J% poy de= 5-J% Lede= | 


Hence F ( x ) isad.f. 
. Example 5-16. Suppose the life in hours of a certain kind of radio tube has 
the probability density we 


{ose “3 , when x2 100 


= 0, when x< 100 
Find the distribution function of the distribution. What is the probabilty that none 
of three such tubes in a given radio set will have to be replaced during the first 150 
hours of operation ? What is the probability that all three of the original tubes will 
have been replaced during the first 150 hours? —(Dethi Univ. B.Sc, Oct. 1988) 
Solution. Probability that a tube will last for first 150 hours is given by 


P(X< 150)= P(O< X< 100)+ P(100< X< 150) 
l 150 100, 1 
= Sigg £0) 4e= Sidg 2 


~ 3, 
Hence the probability that none of the three tubes will have to be replaced 
aiuring the first 150 hours is (1/39 = 1/27. 


The probability that a tube will not last for the first 150 hours is 1 - z= 
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Hence the probability that all three of the original tubes will have to be ee 
during the first 150 hours is (2/3) = 8/27. 
Example 5-17. Suppose that the time in minutes that a person has to wait at 
a certain station for a train is found to be a random phenomenon, a probability 
function Specified by the distribution function, 
F(x)=0, for x< 0 
for OS x< 1 


for l< x< 2 


=-, for 2<x< 4 
=1, for x24 

(a) Is the Distribution Function continuous ? If 50, give the formula for its 
probability density function ? 

(b) What is the probability that.a person will have to wait (i) more than 3 
minutes, (ii) less than 3 minutes, and (iii) between 1 and 3 minutes ? 

(c) What is the conditional probability that the person will have to. wait for-a 
train for (i) more than 3 minutes, given that itis more than I] minute, (ii) less than 
3 minutes given that itis more than 1 minute ? (Calicut‘Univ, B.Sc., 1985) 

Solution. (a) Since the value of the distribution function ts the same at the 
points x= 0,x= 1,x= 2, and x= 4° given by the two forms, of" F (x) -for 
x< 0 and 0< x< 1, O€ x< Land lg x< 2,15 x<2 and 2< x<4, 
2<x< 4 andx2 4, the distribution function is continuous. 


I 


& |x Nl ws 


7) 


26 


Probability density function = f(x) = < F(x) 


f(x)= 0, for x< 0 
=, for OS x< 1, 
=0, for l<x<2 
=i, for 2< x< 4 
£0, forx2>4 


(b) Let the random variable X represent the waiting time in minutes. 
Then 


(i),Required probability = P (X> 3&)= 1-P(X< 3)=1- F (3) 
3 


© x ss ]— 


a, 
e 


(ii) Required probability = P(X < 3)= P(X< 3)» P(X= 3) 
3 


Ul 
" 
a 
oe) 
ws 

II 


(Since, the probability that a continuous variable takes a.fixed value 1 Is 


7250) a 
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P(1< X¥< 3)=P(1< X¢ 3) 

oN i of ce 

F (3) F()= 4 OG 

(c) LetA denote the event that,a person has to wait for more than 3 minutes 


and B the event that he has to wait for more than 1 minute. Then 
P(A)= P(X>3)=4 [ cf. (b), (i) ] 


(iii) Required: Probability 


P(B)= P(X>1)=1-P(X¥ 1)= 1-F(l)= 1- s= 
P(AN B)= P(X>3NX>I=P(X>3)=5 
(i) Required probability is 

P(ANB) 


| . 1421 
fe P(B) — 1/2 2 


(ii) Required pronenanyy CALE —P(B) 


Now P (A.0 B)= P(X< 3AX> =P (P< XS 3=FG)- F(I)= 2-5 


P(A(B y= — = 
Example 5-18. A petrol pump is supplied with petrol once a day. If its 
daily volume X of sales in thousands of litres is distributed by 
f(x)=5(l-xy, O< xsl, 
what must be the capacity.of its tank in order that the probability that its supply 
will be exhausted in a given day shall be 0-01 ? (Madras Univ. B.E., 1986) 
Solution. Let the capacity of the tank ( in ’000 of litres) be ‘a’ such that 


P(X>a)=001 + f) f(x) de= 001 


; ~x)|l- 
= i S(1-x)' de=001 9 or [5 aay = 0-01 


=> (l-a)’= 1/100 or 1- a= (1/100) 
= 1- (1/100)'" = 1- 0:3981 = 0.6019 
Hence the pee of the tank = 0-6019 X 1000 litres = 601-9 litres. 


Exam ple §-19.: Prove. that mean deviation is least when measured from the 
inedian. ' [Delhi Univ. B.Sc. (Maths. Hons.), 1989] 


Solution. If f(x) is thé probability function of a random variable X, 
as X< 6, then mean deviationM(A), say, about the point x= A is given 
by " 

: je 

M(A)=] | x-Al fa 
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= [A px- Al serdar + JP bx- Al fae 


= [AC A= x) f(xyae + [2 (x- AvFUy ar (1) 


We want to find the value of ‘A’ so thatM(A) is minimum. From the 
principle of maximum and minimum in differential calculus, M(A) will be 
minimum for variations inA if 

0M(A) 
0A 

Differentiating (1) w.r.t.’ A’ under the integral sign, since the functions 
(A- x) f(x) and(x- A) f(x) vanish atthe point x=, A”, we get 


eA = JA paar? foxy dx 8) 


2 
= 0 and _ 


> 0 ses (2) 


aMca), 


Also = [4 p(x) der = | f(x)de]., 


b 
| [? exp aee 7 
= 2 JA s(x) de- b= 2A) 1, 
where F(.) is the distribution function of X. Differentiating again w.r.t. A, we, get + 


0 as a 
aa MMs 27(A) ... (4) 


MCA _  “ 


= 0, on using (3) gives 


JO pxyde= J? f(x) de 


i.e., A is the median value. 
Also from ee we see that 


0 M(A ae 

0A? 
assuming that f(x ) does not vanish at the median value. Thus mean deviation is 
least when taken from median. 


+lf f(x ,®) is a continuous function of both variables x and 8, possessing 


2 
ms as and a and b are differentiable 


continuous partial derivatives axd m * 000x 


functions of 8, then 


2 [ [8 peu Je [2 Hace oe) S-som 
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EXERCISE 5 (d) 
1. (a) Explain the terms (i) probability differential, (ii) probability ‘density 
function, and (iii) distribution function. 

(b) Explain what is meant by a random variable. Distinguish between a 
discrete and a, continuous random variable. Define distribution function of a 
random variable and show that it is-monotonic non-decreasing everywhere and 
continuous on the nght at every point. 

[Madras Uniy. B.Sc. (Stat Main), 1987] 

(c) Show that the distribution function F (x) of a random: variable X isa 
non-decreasing funcuon of x. Determine the jump of F (x) at a point x, of its 
discontinuity in terms of the probability that the random variable has the value Xx. 

(Calcutta Univ. B.Sc. (Hons.), 1984] 
2. The length (in hours ) X of a certain type of light bulb may be supposed to 
be a continuous random variable with probability density function : 


f(x)= =, 1500 < x< 2500 
x 
= 0, elsewhere. 


Determine the constant a, the distribution function of X , and compute the 
probability of the event 1,700< X< 1,900. 


' Pe) (pa re 
Ans. a= 70,31, 250; F(x)= i mane 5) and 
P (1,700 <X < 1,900 )= F (1,900 )- F (1,700) = $(sean00- vane 


3. Define the “distribution function” (or cumulative distribution function) of 
a random variable and state its esscntial properties. 
Show that, whatever the distribution function F (x) of a random variable 
X,P(as F(x)s b)]= b-a, 0S a, b< 1. 
4. (a) The distribution function of a random variable X is given by 
- 1—~ (1+x e *, for x= 0 
ae for x< 0 
Find the corresponding density function of random variable X. 
(b) Consider the distribution for X defined by 


b 


0, for x< 0 
POD Lh ae for x> 0 
Determine P(x= 0) and P(x> 0). 


: [Allahabad Univ. B.Sc., 1992] 
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5. (a) Let X be a continuous random variable with probability dcnsity 
fynction given by 
ax, O< x< 1 
a, 1$x<2 
— axt 3a, 25 xS 3 
0, elsewhere 
(i) Determine the constant a. 
(ii) Determine F (x), and sketch its graph. 
(iii) If three independent observations are madc , what is the probability that 
exactly one of these three numbers is larger than 1-5 ? 
[Rajasthan Univ. M.Sc., 1987] 


f(x)= 


Ans. (i) 1/2, (iii) 3/8. 
(b) For the density fx (x) = ke **(1- e77*) [o,. (x), find the normalising 
constantk , fx (x) and evaluate P(X > 1). 
[Delhi Univ. B.Sc. (Maths Hons.), 1989] 
Ans. k= 2a: F (x)= 1-26 +67"; P(X>1)= 2€°-e 7" 
6. A random variable X has the density function : 
] 
f(x)= KT 
= Q, otherwise 
Determine K and the distribution funcuion. 
Evaluate the probability P (X 2 0). Find also the mean and variance of X . 
[Karnatak Univ. B.Sc. 1985] 


3, If -~<x<o 
x 


Ans. K=1, F(x)= + {wx a P(x20)= 1/2, Mean=0, 
Variance does not exist. y 
7. A‘continuous random variable X has the distribution function 
' 0, if x< 1 
F(x)=| k(x- 1), if l< xs 3 
1, if x> 3 


Find (i) k , (ii) the probability density function f (x ) , and (iii) the mean and 
the median of X. 


Ans. (i) k= =. (i) f(x) = ; (xt ty, 1g x83 


8. Given f(x)= i 1~ x), peas l 


Show that 
())k= 1/5, (ii) F (x)= 0 forxs 0 andF(x)=1-e”, forx> 0 
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Using F (x), show that 
(iti) P(3 <X < 5)= 0-1809, (iv) P(X < 4) = 0-5507, (v) P(X > 6) = 0.3012 
9. A bombing plane carrying three bombs flics directly above a railroad track. 
If a bomb falls within 40 feet of track, the track will be sufficiently damaged to 
disrupt-the traffic. Within a certain bomb site the points of impact of a bomb have 
the probability ‘density function : 
f(x)= (100+ x)710,000, when -100< x< 0 
= (100- x)/10,000, when 0< xs 100 
= 0, elsewhere 
where x represents the vertical deviation (in feet) from the aiming point, which is 
the track in this case. Find the distribution function. If all the bombs are used, what 
is the probability that track will be damaged ? 
Hint. Probability that track will be damaged by the bomb is given by 
P(|.X| < 40)= P(- 40< X< 40) 


= [o, Fad aes Jo fede 


=f 100+x 4, (40 100-x 16 
= J-40 10,000 0 10,000 5 


‘. Probability that a bomb will not damage the track = 1 ~ ie. 


Probability that none of the three bombs damages the track 


9. 
(2 3) = 0-046656 


Required probability that the track will be damaged = 1 - 0.046656 = 0.953344. 


10. The length of time (in minutes) that a certain lady speaks on the telephone 
is found to be random phenomenon, with a probability’function specifiéd by the 
probability densuy function f (x ). as 

f(x)= Ae’, for x20 
= 0, otherwise 

(a) Find the value of A that makes f(x) ap.df. 

Ans. A=1/5 

(b) What is the probability that the number of minutes that she will talk over 
the phone is 

(i) More than 10 minutes, (ii) less tian 5 minutes, and (iii) between_5 and 10 
minutes ? (Shivaji Univ. B:Sé., 1990] 


Ans. (i) a (ity — * , (iti) £ 
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11. The probability that a person will die in the time interval (¢; , ¢2) 1s given 
by 


bo : 
A IF f()dt, 
where A is aconstant and the function f ( ¢ ) determined from long records, is 


ev J t?(100-1)?, O< t< 100 
POs 0, elsewhere 

Find the probability thata person will die between the ages 60 and 70 assuming 
that his age is = 50. [Calcutta Univ. B.A. (Hons.), 1987] 

5:5. Joint Probability Law. Two random-variables X and Y are said to 
be jointly distributed if they are defined on the same probability space. The 
sample points consist of 2-tuples. If the joint probability function is denoted by 
Pyy(x,y) then the probability of a certain event E is given by 

Pyy(x,y)= P((X,Y)e E) w. (5:13) 
(X,Y) is said to belong to £, if inthe 2 dimensional space the 24tuples lic in the 
Borel set B, representing the event E. — 

5-5-1. Joint Probability Mass Function and Marginal and Condi- 
tional Probability Functions. Let X and Y be random variables on a sample 
space § with respective image sets X(S)= { x1 Xe preets Xa} and 
Y(S)= Vt > V20-005Ym We make the product set 

X (S) x Y (S)= { x eS eee Xa bX{ Wey We sees Ym} 
into a probability space by defining the probability of the ordered pair (xi, y;) 
to be P(X=x;, Y= y;)- which we write p(xi,yj). The function p on 
X (S) x Y (S) defined by 
pj= P(X=x0 Y= y,)= p(x,y;) vw. (5°14) 
is called the joint probability function of X and Y and is usually represented in the 
form of the following table : 
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n 


m 
Lr Lf p(x%,y)= 1 
i= 1 j= 1 


Suppose the joint distribution of two random variables X and Y is given,-then 
the probability distribution of X is determined as follows : 
px (x)= P(X =x;)= P[X=x, NY=yi]+ P[X =x; NY =yorl+... 
+P([X =x, ANY=y;)+...+ P(X =x 0 -Ym | 
= Pat Pit ...F Pyt ... + Pim 
m m . 
= L py= 2 p(xi, y;)= pi- . (5:14) 
j=i j=l ' 
and is known as marginal probability function of x. 
n na m 
Also Di.= Pi.t Pa.t ... + DPa.= > Dy Pij = 1 
iz] iz] j=l 
Similarly, we can prove that 


n na 
Pr(y)= P(Y= yj)= Upy= Lp x, yy)= pj .. (5:14 b) 
i=] i=l 
which is the marginal probability function of Y. 
Also 
Pikveal yoyqut = Oss. PGaye Bi 


PLY= yl PY) Bi 
This is known as conditional probability function of X given Y= yj 
Similarly 
P(X,Y) _ Pi 
P(Y= y;| X= x;)= ———= — (5-14-¢ 
yy | ] D(x) Di ( ) 
is the conditional probability function of Y given X = x; 


nr 
Risg: (ee Ci Ee Ps ee ee Pe Pe 


t=] P.j P; Pj 
Similarly 
n 
y he | 
j=l Bi. 
Two random variables X and Y are said to be independent if 
P(X=%,Y= yj)= P(X=x;).P(Y= y;), ... (5:14 d) 


otherwise they are said to be dependent. 

5-5-2, Joint Probability Distribution Function. Let (X , Y ) be a two- 
dimensional random variable then their joint distribution function is denoted by 
Fyy(x,y) and it represents the probability that simultaneously the observation 
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(X, Y) will have the property (X <x and Y¥< y), i.e., 
Fyy (x, y) = P (-0 <X Sx, -wo< Y<y) 


x y 
=| 1 Sxy &, y) dx dy ... (5-15) 
OS (For continuous variables) 
where Fxy &% y)2 0 
And | fxy@, y) dxdy=1 or % Tfiwy)=1 
—oo % —oo x ys 


Properties of Joint Distribution Function 

1. (i) For the real numbers a), 5), az and b 

P (a, <XSby,a,< YS bo) = Fyy (6), b2) + Fyy (aj, a>) 

— Fyy (4), 62) — Fxy (6), 42) 
[For proof, See Example 5-29] 

(ii) Let a; < a2, b, < b>. We have 

(X Sa), Y San) + (€,; <X Sd, YS ay) = (X Sb), YS ay) 
and the events on the L.H.S. are mutually exclusive. Taking probabilities on 
both-sides, we get : 

F (ay, a2) + P (a, <X Sby, YS ay) =F (by, a2) 

=> F (by, a2) — F (ay, a2) = P (a, < X S$ by, YS ay) 

F (by, ay) 2 F(ay, a2) [Since P (a,< X $ by, YS ay) 20] 

Similarly it follows that 

F (ay, by) — F (ay, 42) = P (X Sa), a. < YS bo) 20 

=> F (a), b2) 2 F (aj, a2), 
which shows that F (x, y) is monotonic non-decreasing function. 

2. F (--, y)=O=F (x, - ©), F (+ 0,+0) = |] 

3. Ifthe density function f(x, 'y) is continuous at (x, y) then 

2 
g : 5 =f y) 

5-5-3. Marginal Distribution Functions. From the knowledge of joint 
distribution function Fyy (x, y), it is possible to obtain the individual 
distribution functions, Fy (x) and Fy (y) which are termed as marginal 
distribution function of X and Y respectivély with respect to the joint 
distribution function Fyy (x, y). 


Fy (y= P(X Sx) =P (XS x, ¥<e)= lim Fyy (x y). 
:2= 


= Fyy (x, 0) ... (5°16) 
Similarly, Fy (y)=P (Ys y) = P(X< 0, YSy) 
= lim Fyy (x, y) = Fxy (e, ) 
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Fy(x) is termed ds the marginal distribution function of X corresponding to 
the joint distribution function yy (x,y) and similarly Fy (y) 1s called marginal 
distribution function of the random variable Y corresponding to the joint distribu- 
tion function Fyy (x,y). 

In the case of jointly dierete random variables, the marginal distribution 
functions are given as 

Fy(xyy= ¥ P(X<x,Y=y), 
y 

FyQ)=  PX=x,¥Sy) 
x 

Similarly in the case of jointly continuous random variable , the marginal 
distribution functions are given as 


Fy (x)= ae eae fey (,Y) ay} dx 


Fy(y)= [> J fer») ax dy 


5-5-4 Joint Density Function, Marginal Density Functions. From the 
joint distribution function F xy (x,y) of tWo dimensional contunuous random 
variable we get the joint probabilty density function by differentiation as follows : 


fxy(x, y= OF (x,y)/dxdy 


<X< <Y< 
Hien ox, ys Ys y+ dy) 
dx dy 

Or it may be expressed in the following way also: 

"The probability that the point (x , y ) will lie in the infinitesimal rectangular 
region, of area dx dy is given by: 

P{x-Ldx<XSxtldx,y-Ldys¥syttdy}=dFer(x,y) 
and is denoted by fxr (x , y) dx dy, where the function fxr (x, y) is called the joint 
probability density function of X and Y. 

The marginal probability function of Y and X are given respectively 


frQ)= se fev (x,y) dx (for continuous variables) 
= y; Pxy (x,y) (for discrete variables) 

x 
(5-17) 


and fx (x) = I fxy(x,y) dy (for continuous variables) 
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= > pxy(x,y) (for discrete variables) 
y 
(5-17a) 


The marginal density functions of X and Y can be obtained in the following 
manner also. 


fe (x) = dfx ax). =|. 


fey (x,y) dy 


and f(y) = sin J. fer. y) de 


Important Remark. If we know the joint p.d.f. (p.m-f.) fry (x, y) of two 
random variables X and Y, we can obtain the individual distributions of X and Y 
in the form of their marginal p.d.f.’s (p.m.f?s) fx (x) and fy (y) by using (5-17) and 
(5-17a). However, the converse is not true i.e., from the marginal distributions of 
two jointly distributed random variables, we cannot determine the joint distribu- 
tions of these two random variables. 

To verify this , it will suffice to show that two different joint p.m.f’s (p.d.f.’s) 
have the same marginal distribution for X and the same marginal distribution for 
Y . We give below two juint discrete probability disiribuitions which have the 
same marginal distributions. 

JOINT DISTRIBUTIONS HAVING SAME MARGINALS 


Probability Distribution I Probability Distribution II 


~ 
Hi 


. (5-17 5) 


fx) | 050 0-50 


As an illustration for continuous random variables, let (X , Y). be continuous 
rv..with joint p.d.f. 


fev, yb=xty; OS,y)S1 (5-17 c) 
The marginal p.d.f."s of X and Y are given by: 
1 
0 


5 2 
tx w=] f@,y) dy= J) (x+y) dy= y+ 
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= feQexth ; Osx 
Similarly fr (9) = I fe.yde=yti ; OS y <1 
Consider another continuous joint p.d.f. 


g(x,y) = («+3 \(y+4) - OS (x,y)s1 .. (517 €) 
Then marginal p.d.f.’s of X and Y are. given by : 


$1 (x) = i g(x,y.) dy= [x+ 1} ie [y+ 5 \ay 


| 2 j : 
= 1)J/y,1 
. [x+ | ears 0 
=> gi (x)= x+ : : O<xsl 
o ho (S7f) 
Similarly g2(y)= y+ 5 ; Os ysl ] 


(5-17 d) and (5-17 f) imply that the two joint p.d.f.’s in (5-17 c) and (5-17 e) 
have the same marginal p.d.f.’s (5:17 d) or (5:17). 

Another illustration of continuous r.y.’s is given in Remark to Bivariate 
Normal Distribution, § 10-10-2. 

5-5-5, The Conditional Distribution Function and Conditinal Prob- 
ability Density Function. For two diamensional random variable (X , Y ) , the 
joint distribution function Fx r(x , y) for any real numbers x and.y is given by 

Fyy(x,y)= P(X¥Sx,YSy) 

Now Iet A be the event (Y < y) such that the event A is said to occur when Y 
assumes valucs up to and inclusive of y. 

Using conditional probabilitics we may now write 


Fry(x,y= ]*  P(A|X=x] dFy (0) 5-18) 


The conditional distribution function Fy\x (y|x) denotes the distribution 
function of Y when-X has already assumed the particular valuc «. Hence 
Fryix (ylx)= PLY sy|X=x)= PL A|X=x) 
Using this expression, the joint distribution function Fxy (x, y) may be 
expressed in terms of the conditional distribution funcuon as follows: , 


— 0o 


Fare, W= J Fix(yle) dF) (518.4) 
Similarly . 
FxyQ.syy= JX Friy(xly) dFr(y) (5-18 b) 


a 
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The conditional probability density function of Y given X for two random 
variables X and Y which arc jointly continuously distributed is defined as follows, 


for two real numbers x and y: 
<2 
fix (YIZ)= gy Fix (yl) 


Remarks: 1. fx (x)> 0, i 


frix(y|x)= — 


Proof. We have 
Fare. y=]* Frix(ylxy dFx (0) 


= J* Fnx(ylx) feQae 
Differentiating w.r.t.x, we get 
oO 
5, fer (xsy)= EME ane 


Differentiating w.r.t. y, we get 


- [2 Fry, |= frix (yx) fx @) 
aa fer ®, y= Srx (yb) fe 
rd frix(y x)= AED 


2. If fy (y) >0, then 


fair (aly )= AEE 
3. In terms of the differentials, we have 
P(x<XSx+dx|y<YSyt+dy) 
_ P(x<XSxt+d, y<¥sytdy) 
7 P(y<Ysy+dy) 
_ fxr@,y) dx dy _ 
= FO far (aly) ae 


(519) 


Whence Sxjy (x | y) may be interpreted as the conditional density function of 


X on the assumption Y = y. 


5-5-6. Stochastic Independence. Let us consider two random variables X 
and Y (of discrete or conunuous type) with joint p.d.f. fry (x,y) and marginal 
p.d.f.’s fy (x) and gy (y) respectively. Then by the compound probability theorem 


for Y= LO ala 
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where gy (y |x) 1s the Conditional p.d.f. of Y for given value of X = x. 


If we assume that g (y|x) does not depend.on x, then by the definition of 
marginal p.d.f.’s, we get for continuous F.v.’s 


e(y)= J" fix.y) dx 
= J" fe(xya(ylx) de 


= g(y|x) liga fe(x) & 


[since g (y|x) does not depend on-x | 


= g(yl[x) [- f(.) isp.d.f. of X ] 
Hence 
ely)=a(ylx) 
and fxy(x,y)=fx(X) ar(y) a (*) 


provided g ( y| x ) does not depend on x. This motivates the following definition 
of independent random variables. 


Independent Random variables. Two r.v.’s X and Y with joint p.df. 
fxy (x,y) and marginal p.df.’s fx (x) and gy(y) respectively are said to be 
stochastically independent if and only if 

fxv(4,Y)= fe) ar () ... (5-20) 

Remarks. 1. In terms of the distribution function, we have the following 
definition : 

Two jointly distributed random variables X and Y are stochastically inde- 
pendent if and only if their joint distribution function Fy,y(.,.) is the product of 
their marginal distribution functions.Fx(.) and Gy(-), ie., if for real(x,y) 

Fxy(x,y)= Fx (x). Gr (y) .. (5-20 a) 

2. The variables which are not stochastically independent are said to be 
stochastically dependent. 

Theorem 5-8. Two random variables X and Y with joint p. df. f(x,y) are 
stochastically independent if and only if fxr (x,y) can be pxpressed as the 
product of a non-negative function of x alone and a non- Sa function of y 
alone , i.e., if , 

fry, y= hy (x) ky (y) (5-21) 
where h(.)> 0 and k(.)> 0. 
Proof. If X and Y are independent then by definition, we have 


fxr, y= fx x). gr) 
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where f (x) and g (y) are marginal p.d.f. of X and Y respectively. Thus condition 
(5:21) is satisfied. 

Conversely if (5-21) holds, then we have to prove that X and Y are inde- 
pendent. For continuous random variables X and Y , the marginal p.d.f.’s are given 
by 


f= J” fey) dy= J" he ko) dy | 
=h(@) J° ky) dy= cr h(x), say rn) 
and -ery)= J", fe.) dx= J" hk O) 


= ky) J° he) de= co k(y), say wu (*4) 
where c, and cz are constants independent of x and y. Moreover 


‘es ie f(x,y) dx dy= 1 


= [7 J A@ ko) de dy= 1 
= (J h(x) ax}( J. k (y) dy |= i: 
=> C2 i= 1 woe (#**) 
Finally, we get 
Fay (% sy) = hy (x) ky (Y) = 1 €2 Ax (x) ky) [ using (+**)] 
= (C1 hx @)) (cr kr &)) 
= fx(x) gr () [ from (*) and (**)] 


=> X and Y are stochastically independent. 


Theorem 5-9; If the random variables X and Y aré stochastically inde- 
pendent, then for all possible selections of the corresponding pairs of real numbers 
(a1, bi) , (G2, b2) where a;< 6; for alli = 1,2 and where the values + ~ are 
allqwed, the events (a, <X < b:) and (a2 < Y < bo) are independent, i.e., 

P [(a, <X $b) N(ar<Y¥ Sb) ]= P(a<XSb) P(a<YSbh) 

Proof. Since X and Y are stochastically independent, we have in the usual . 

notations 
fry (%, y= fx) gr (y) »- (#) 


In casé of continuous r.v.’s , we have 


P ((ai<X <1) N(ar< Y¥ <b) |= fe te f(x,y) dx dy 
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(: 2 fx (x) dx 2 &y (Y) ay | [ from (*)) 


5, 
@ 


P(a< XS b)P(a< YS bn) 
as desired. 

Remark. In case of discrete r.v.’s theorems 5-8 and 5-9 can be proved on 
replacing integration by summation over the given range of the variables. 


Example 5.20. For the following bivariate probability distribution of X and 
Y, fina 


@P(X<1,Y=2), @diP(X <1), (iii) P(Y=3), (iV)P(Y <3) and 
(v)P(X<3,YS4) 


(t) P(XS 1, Y=2)= P(X=0, Y= 2)+ P(X= 1, Y= 2) 


oe, et oe 
=a ae ag 
(ii) P(X<s 1)= P(X=0)+ P(X= 1) 
eee ae! (From able bk) 
~ 32° «16° «8 
(iii) PCY= 3)= a ( From above table ) 
(iv) P(Y< 3)= P(Y=1)+ P( Y= 2)+ P(Y= 3) 


3... 3+, bh... 23 


~ 32° 32° © 
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(v) P(X<3, YS 4)=P(X=0,YS 4)+ P(X=1, YS 4) 
+ P(X=2,Y<¢ 4) 
32" 32 16 16 8 8 
Fi ( i ree ores arewe Z|; ee 
32 32" 64° 64 16 
Example $21. The joint probability distribution of two random variables X 
andY is given by: — 
(x,y)= nee e ae x= 
Ep Naae. n(ntl1l)’ y= 1,2,..,x% 
Examine whether X and Y are independent . (Calicut Univ. B.Sc., 1991) 


Solution. The joint probability distribution table ean with the marginal 
distributions of X and Y is given below. 


n(ntl) antl) 
2 
n( nt 1) 


n(nt+1)[n(nt1) - 
2 2 
n(nt1) | a(ant 1) 


_ 2 2x 2 2x 3 ar 
n(ntl) n(nt+1) anCat ly 7 —- 


px{x) 


Note that y= l, 2,...,x 

When x=1I, y=1; whenx= 2, y= 1,2; whenx= 3,y= 1,2,3 and 

soon, ° 
From the above table, we see that 


% 


Pxv(X,Y)# px(x)pr(y) ; Vi xsy 
=> X and Y are not independent. 
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Example 5:22. Given the following bivariate probability distribu- 
tion, obtain (i) marginal distributions of X and Y, (ti) the conditional 
distribution of X given Y = 2. 


( Mysore Univ, B:Sc., Oct. 1987 
Solution. : 


Ys *~s M5 


Ms As Ms 
*s Ys Ms 


(i? Margital distribution ial X. From’ the ae table, we get 
6 2 1 


PRX B~V)= 555 53 P(X= ORT 2 3 P(X= I)= 4 ae 
Marginal distribution of Y : 

eee ee ee a ee ee ic Dd 

P(Y=10)= i) P(¥=1)=—e=4; PUrs 2=T5= 3 


(ii) Conditional distribution of X given Y = 2. We have 
P(X=x0Y=2)= P(Y=2).P(X=2| Y= 2) 
ai Pixss| yaoje = 2o7 


P(t=, 1 
oo “« P(X=-1AY=2) -2/15 2 
P{x= = “11 ¥= aye Pi( Y= 2) ~ 1/3 5 


Example 8 23;,X and Y are.two, andomjpariables having the joint density 
function, f (ory (2x+ -y) , Where x Andean. assume only the integer 


values 0.1 dnd2 F "ind the conditional disiribunonoj oft for. Xex 
[South ujarat Univ. BISc., 1988] 
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Solution. The joint probability function 
f(x,y)= sx 24+ y); x= 0,1,2; y= 0,1,2 
gives the following table of joint probability distribution of X and Y. 


JOINT PROBABILITY DISTRIBUTIONS (x,y) OF ¥ AND Y 


0 1/27 2/27 
270 3/27/27 
4/21 «5/27—s«6/27 


For example f ( 0, 0) = = (0+ 2x-0)= 0 


-fQ, 0) = (0+ 2x = = ; f(2,0= aa (0+ 2 x =— a 
and so on. 
The marginal probability distribution of X 1s given by 


‘ fx (x)= EINE y), 


and is tabulated in last column of above table. 
The conditional distribution of Y for X = x is given by 
YeviX=ex)= — 
and is obtained in the following table. 


CONDITIONAL DISTRIBUTION OF Y FOR X =x 


5/15 


Example §-24. Two discrete random: variables X and Y have the joint 
probability density function : 


Me*“p(i-py? 0.1.2 0.1.2 
p(x,y)= y '(x- y)! » y= 9 Ss pecgX 4 x= 9 So a9 oes 
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where X, p are constants with )>’0 and0< p< 1. 
Find (i) The marginal probability density functions of X and Y. 
(ii) The conditional distribution of Y fora given X and of X fora given Y. 
(Poona Univ. B.Sc., 1986 ; Nagpur Univ. M.Sc., 1989) 
Solution. (i) 


7 = _ * xo-k _ z-y 
mx(x)= ¥ p(xy= y Seed 
y= 0 


ta yi(z-y)! 


Ww sh, Ft ) x l- p)"~” \e Ses . zs 
= 28 y SPA FES Gp (1s py? 


Ot Ee Va x! 


y=0 
x -A 
= Nev = 0217 2, 
x! 
which is the probability function of a Poisson distribution with parameter A. . 
as i Me _ z-y 
= xX; = ; 
rOy= X plew= “Gry 


x=Q x= y 


_ Opye* — Cp? Apr e® parm 
ya: sy. MA y! 


which is the probability function of a Poisson distribution with parameter A p . 
(ii) The conditional distribution of Y for given X is 


in en EE 
PU ee lal? p(l- py 7x! 


px(x) oy S(x-y) Mee 
= ayy PUPP = GPA =pY?. x>y 
The conditional probability distribution of X for given Y is 
Pxiy(xly)= a 
x =A z~ 
_A ; Ae, ee, (cf. Part (i) ) 
= ea” > g=ul-p,x>y 


Example 5-25. The joint p.df. of two random variables X and.Y is 
given by : 


_ 2 91+ x+y) shee 
FO" Tae xy tye? l0< yeu 
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Find the marginal distributions of X and Y, and the conditional distribution of Y 
forX =x. 
Solution. Marginal p.d.f. of X is given by 


fe(xy=|o fey dy 


9 [ (1+ y)tx |, 
~ 21+xf 0 (14+ yy 


= reer. Jo [a+ yy?+ x tyr lay 


0 


20 a 


+ —— 
0  |3atyy 


9 | = | 
2(1+ xy L]/20+yy 


__ 9 , [444 
2(l+x)y 12 3 


Since f(x,y) is symmetric inx and y , the marginal p.d.f. of Y is given by 


w= Io flay) dx 


32. 3+2y , us 
4 Cay OF 


The conditional distribution of Y for X = x is given by 


fr (Y=y | X=x)= ey) 


Sx (x) 
__ 9(ltxt+y) | 41 +2) 
2(l+x) (1+y)? 3(3+2x) 
_  6(1+x+y) _ 


~ (1ty)t (3 42x)” wearer 


Example 5-26. The joint probability density function of a two-dimen- 
sional random variable (X,Y) is given by 


f(x,y)=23; 06 x<1,0< y<x 
= 0, elsewhere 
(i) Find the marginal density functions of X and Y, 


(it) find the conditional density function of Y given X=. x and conditional 
density function of X given Y = y, and 


5:56 Fundamentals of Mathematical Statistics 


(iit) (check for independence of X and Y. 
| (M.S.Baroda Univ. B.Sc., 1987; Karnataka Univ. B.Sc., Oct. 1988} 
Solution. Evidently f(x,y )2 O:and 


Ip If 2aeay=2 J) xara 
(i) The marginal p.d.f.’s of X and Y are given by 
fe(x)= J". far (x,y) dy= JX 2ay= 2x, O<x<! 
= ‘0, elsewhere 
fr y)* J° for (xy) de= J 2dx= 2(1- y), O0<y<! 


='0, elsewhere 
(ii) The conditional density function of Y given X is 


+) - fre,y)_ 21 
frix(ylx)= Klay Dee O< x< 1 
The conditional density function of X given Y is 
Rive jee ee , O<y<l 


fr(y) ~ 2(i-y)) (l-y)’ 
(iti) Since fy(x) fr(y)= 2(2r)(1-y pega y), X and 'Y are not 
independent. 
Example 5-27. A gun is aimed at a eeriain point (origin of the coordinate 
system). Because of the random factors, the actual hit point can be any point (X,Y) 
-in a circle of radius R about thé origin. Assume that the joint density of X and Y is 
constant in this circle given by : 
fa(x y)=k, forx+y< RP 
=Q, otherwise 
(i) Compute k, (ii) show that 


fa(x)= {1-(5) 


= 0, otherwise 


(Calcutta Univ. B.Sc.(Stat. Hons.),1987] 
Solution. (i) The constant k is computed from the consideration that the total 
probability is 1, ie., 


172 
, for -R<xsR 


j J #ex.9) aeay= 1 = ff kdray=1 


—o —o x+y'sR’ 
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> 49) kdedy=1 
i] 


where region / is the first quadrant.of the circle 


xv+ y= R’. 


2 2 
=> A ti Lady) c= 1 


=> 4k Io VR?- ¢ a= 1 


§-S7 


R* ._,(x\|R 
7 a, 1/2 = 
=> 4k XNR-x + — sin (= Io ] 
R* ; l 
= 4k ( as = aeerers 
fa (x,y)= W(nR?) 3) vr +y’s R* 
=Q, otherwise 
R?-x’ 
be l 
(it) fe(x)= J" fl.ydy=—e J | - dy 
nt A sae 
1/2 1/2 
[ because 2+ ys R? = ~(R- 2) < ys (R- x) | 
9) Remix ”) 1/2 
= — 1-dy= R-x 
rR’ aR 


Example 5:28. Given: 


f(x,y)= EP [o,0)(4) - Kony (y) s 


find (i) P(X>1), i) POX < VX << 2Y), (ii) P< X 4+ <2) 


[Delhi Univ. B.Sc. (Maths Hons.), 1987] 


Solution. We are given: 


f(x,y)= EV 5 OS x<00, OS y<o 


=(e*)(e”) 
= fx(x).fr(y) ; 
= xX andy are independent and 


fx(x)= e*; x20 and fr(y)= me y20 


Os x<0o, 0S y<oo 


w- (1) 


(2) 
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P(X>1)=] fr(xyde=f eta 


(i) 
1 1 


ae ee 
~ baa ly e 
i _ PUXK< YO X< 2Y) 
(ii) P(X<Y|X< 2Y)= P(X<2Y) 
_ P(X<Y) 
~ P(X < 2Y¥) ~ Q) 


|ay= ~fer(er- l jay 
oY 
aaa 
0 2 2 
oo 2y ce) 
P(X< 2Y)= | J Hay) deo ~fer(e?-1dy 
oLO 0 
e°? ad 1 2 
=—-|—™ie?’ = ]~-=+- 
~3 3 3 
Substituting in (3), : 
1/2. 3 


P(X<Y|X<2Y)= W374 


(iii) P(1<X+Y< 
. > Y 
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“|| ne 16 a : | [x 1 


l- 
l 2 2—x 
“fe J fe ile J a fe 
0 1-x 1 0 
1. fr 
“flere are [Ele 1a 


iT) 

( 
oen 
®, 
WV 

| 

®& 

t 
Nae” 

o—_—_—._ 

—" 

| 
— Cy 
o—m 

©, 

b> 

1 

3, 

to 
Wea” 


HH 

| 
on 
®& 

| 

® 
a.) 
ya 
Qo = 
| 

®& 

1 
w 
a 
+ 
a) 

( 

" 
mm 


2/e - 3/e’ 

Example 5-29. (i) Let F(x;y). be the df. of X and Y. Show that 
P(a< Xs b,c< YS d)=F(b,d)- F(b,c)—- F(a,d)+ F(a,c) 
where a,b,c,d are real constants a< by c< d. 

Deduce that if: F(x,y)= 1, for x+ 2y2 1 

F(x,y)=0, for x+ 2y< 1, 
then F (x,y) cannot be joint distribution function of variables X and Y. 
(ii) Show that, with usual notation : for all x,y, 
Fy(x)+ Fy(y)- l< Fyy(x,y) < VFx (x) Fr(y) 
* (Delhi Univ. B.Sc. (Maths Hons.), 1985 ] 
Solution. (i) Let us define the events : 


A:{Xsa);B:(Xs 6); C={Y¥<c};D=(YEd); 
fora< b; c<d. y (b,d) 
P(a< Xs bncK<YEa) d A 
= P[(B-A)N(D-Q)] 
=P([BON(D-C)-AN(D-O)} _(#) : (D.C) 


(By distributive property of sets) 
We know thatifEcF>EQF=E, then 0 a b xX 
P(F-E)= P(EQ F)= P(F)- P(EQF)= P(F)- P(EY _ ... (e*) 
Obviously ACB => [AN(D-C)] ec [BN(D-C)] 
Hence using (**), we get from (*) 
P(a< Xs bNc< Ye d)=P[Bn(D- CJ1- P{AN(D- C)] 
= P((BND)-(BONC)j- P[(AND)- (ANC)) 
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= P(BND)= P(BAC)- P(AND)+ P(ANC) ... (#**) 
[ On using (**), sinceeC c D> (BNC)C(BQND)and(ANC)CAND)}) 
We have: 
P(BOD)=P[XS OQ Ys dj=F(b,d). 
Similarly 
P(BOC)= F(b,c) ; P(AND)= F(a,d) and P(ANC)= F(a,c) 
Substituting in (***), we get: 
Pla<xXs bNc<YS d)=F(b,d)-F(b,c)-F(a,d)+F (a,c)... (1) 
We are given F(x,y)=1, for x+ 2y2 1 (2) 
=Q, for x+ 2y< 1 i 
In (1) let us take : @= 0, b= 1/2,3 c= 1/4, d= 3/4 St. a<b and 
c< d. Then using (2) we get: 
F(b,d)=1;F(b,c)=1;3 F(a,d)=1; F(a,c)=0 
Substituting in (1) we get: 
P(a< Xs bnNc< YE d)= 1-1-1+ 0=-1; 
which is not possible since P(.) 2 0. 


Hence F (x,y) defined i in (2) cannot be the distribution function of variates 
X and y, 


ee Oe x};B={Ys y} 

Then P(A)=P(XS x)= Fx(x); P(B)=P(YS y)= Pro) (3) 
and P(AQNB)= P(XS xQNYSy)= Fxy(x,y) = 
(ANB)CA => P(ANB)S P(A) => Fxy(x,y)S Fx(x) 
(ANB)CB = P(AQNB)S P(B) => Fxy(x,y)S Fy(y) 
Multiplying these inequalities we get.> 

Fey y (x,y)S Fx(x)Fr(y) => Prer(x,y)$ VFx (x) Fr(y) vo (4) 
Also P(AUB)< 1 => P(A)+ P(B)- P(AQB)S 1 

=> P(A)+P(B)—1< P(ANB) 

=> Fy (x)+ Fy(y)-1 < Fyy (x,y) (9) 
From (4) and (5) we get: 

Fx(x)+ Fr(y)-1 S$ Frv(x,y) $ Fx (x) Fry , as required. 


Example 5-30. Jf X and Y are two random variables on joint density 
‘function 


f(x,y)= ; (6- ~-y);0<x<2,2< y<4 


= 0, otherwise 
Find (i) P(X <1 AY <3), (ii) P(X+Y <3) and (iii) P(X <1) Y< 3) 
(Madras Univ. B.Sc., Nov. 1986) 
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Solution. We have 


(i) P(X< 1NY< 3) fi. [P. fi.y) ax dy 


00 [Uo 


_ (13 bees _ 
= J, i g (6 x-y) d&dy = 
(ii) The probability that X +Y will be/less than 3 is 


P(X+¥<3)= |} _ *2(6- —-x~y) dedy = 


(iii) The probability that X < 1 when it isknown that Y < 3 is 


P(X<1N¥<3)_ 3/8 
Xe TIES) re eET iii a7 5 5 


[Pcr< 3)=]5 . $(6-x-y) drdy=2 | 
Example 5-31. If the joint distribution function of X and Y is given by: 
F(x,sy)= 1l-e te + €""; x>0, y> 0 
= Q; elsewhere 
(a) Find the marginal densities of X and Y. 
(b) Are X and Y independent ? 
(c)FindP(X< 1QY<s 1) andP(X+ YE 1). (I.C.S., 1989) 
Solution. (a) & (b) The joint p.d.f. of the r.v.’s (X , Y) is given by: 


far(x,y)= PEGI. 9 [er coon] 


ox oy ox 
=@€"%: ¥>0, y20 
= 0; — otherwise wv (i) 
We have 
fev (x,y) = €*.€7= fx(x)fr(y) .» (ti) 
where fkke(xy=e*;x20;) frl(yy=e’; y20 ... (iii) 


(ii) = X and Y aré independent, 
and (iti) gives the marginal p.d.f.’s of X and Y. 


() P(Xstn Ys)! I) p2,y)deay 
= th * de | (Io dy 
= (1+ ety 
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lyvl-x 
p(x+y<i=] J rer=J| [rey 49 | a 


x+ysi- ol o 


0 


Example §-32. Joint distribution of X and Y is given by 


-(8+y'), 
f(% y= 4xy e Y*; x20, y20. 
Test whether X and Y are independent. 


For the above joint distribution , find the conditional densit: of X given 
Y=y. (Calicut Univ. B.Sc., 1986) 


Solution. Joint p.d.f.of X and Y is 


-(2 + 9’) 
f(x,y)=4xye "77; x20, y2 0. 
Marginal density of X is given by 


r- ) =) 2 2 
fxy= J f(xydy= J aay HEY) ay 
0 0 


2 2 
= 4x e* J ye -dy 


0 
=4e* Jer. (Pur y?= ¢ ) 
0 
= Qy- oe oer ad Fi 


2 
=> fi(x)=2xe”*; x20 
Similarly, the marginal p.d.f. of Y is given by 


rg 2 
hiy= | f(x,y) de = 2y e” ; y20 
0 


Since f(x,y)=fi(x) > f2(y), X and are independently distributed. 
The conditional distribution of X for given.Y is given by : 
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- — yy- Ley) 
f(X=x| Y= y) Ay) 


2 
=2xe*: x20. 


EXERCISE S(e) 


1. (a) Two fair dice are tossed simultaneously. Let X denote the number on 
the first die and Y denote the number on the second die. 
(i) Write down the sample space of this experiment. 
(ii) Find the following probabilities : 
(1) P(X+Y=8), (2) P(X+¥28), (3) P(X=YY, 
(4)P(X+ Y=6| Y=4), (5) P(X- Y=2). 
(Sardar Patel Univ, B.Sc., 1991) 
2. (a) Explain the concepts (i) conditional. probability, (ii) random variable, 
(iti) independence of random variables, and (iv) marginal and conditional prob- 
ability distributions. 
(b) Explain the notion of the joint distnibution of two random variables. If 
F(x , y) be the joint distribution function of X and Y , what will be the distribution 
functions for the marginal distribution of X and Y ? 
What is meant by the conditional distribution of Y under the condition that 
X =x? Consider separately the cases where (i) X and Y are both discrete and 
(ii) X and Y are both continuous. 
3. The joint probability distribution of a pair of random variables is given by 
the following table :- 


\ 


Find : 

(i) The marginal distributions. 
0.1 O1 02 (ii) The conditional distribution of X given 
Y=1. 


4. (a) What do you mean by marginal and conditional distributions ? The 
following table represents the joint probability distribution of the discrete random 
variable (X , Y ) 


({) Evaluate marginal distribution of X. 
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(ii) Evaluate the conditional distribution of Y given X = 2, 
(Aligarh Univ. B.Sc., 1992) 
(b) Two discrete random variables X and Y have 


P(X=0,Y= 0)= 2; P(X=0, Y= l= 


1 
9 
P(X=1,Y=0)=1;: P(X=1, Y= l= 7 


Examine whether X and Y are eel 
(Kerala Univ. B.Sc., Oct. 1987) 
5. (a) Let the joint p.m.f. of X; and X2 be 


p(x ,%2)= ar ;m= 1,2,3 3 m= 1,2 


= 0, otherwise 
Show that marginal p.m.f.’s of Xi and X2 are 
2x, +3 6+ : 
pPi(x)= ey mar m= 1,2,3 3; p2(m)= oi m= 1,2 
(b) Let 
f(m,m)= C(mm+ &) 3 0<(1, m<l 
= QQ, elsewhere 
(i) Determine C. 
(ii) Examine whether X, and X2 are stochastically independent. 
(a)=c( Amt € "), 
Ans. (i) C = m4 , (ii) oo 


g (x)= C(in+ e-1) 


Since g(x ).g (x2) # f(a ,X2), X: and X2 are not stochastically inde- 


pendent. 


6. Find k so that f(x,y)= kxy, 1S x< y<2 will be a probability 


density function, \ (Mysore Univ. B.Sc., 1986) 


2 2 
Hint. JJ f(x,y) dedy=1 => falls 6 |tc = 1 => k= 8/9 
1 x . 
f(x,y)= €&"%'” : x20, y2 0 
fA = 0 velseihere 


is the joint probability density function of random variables X and Y , find 


(i)P(X< 1), (ii) P(X> Y), and (iii) P(X+ Y< 1). 
Dp Ae eee 
Ans, (i) 1l- O° (tt) 7 and (iii) 1 - 
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(b) The joint frequency function of (X , Y) is given to be 
f(x,yj=Ae*’; OS x< y, OSy<t+o 
= 0 ; otherwise 
(i) Determine A. 
(ii) Find the marginal density function of X. 
(iii) Find the marginal density function of Y. 
(iv) Examine if X and Y are independent. 
(v) Find the conditional density function of Y given X = 2. 
” [Madras Univ. B.Sc. (Main Stat.), 1992] 
(c) Suppose that the random variables X and Y have the joint p.d_f. 
wy J Re(Ca- Y), OS x< 2, -xX< VK x 
fir.y)= | 0, elsewhere . 
(i) Evaluate the constant k . ) 
(ii) Find the marginal probability density functions of the random “z.iables. 
(South Gujarat Univ. B.Sc., 1988) 
8. (a) Two-dimensional random variable (X , Y ) have the joint density 
f(x,y)= 8xy,0<x< y< l 
= 0, otherwise 
(i) Find. P(X¥< W290 Y< 1/4). 
(ii} Find the marginal and cond)tional distributions. 
diii) AreX and Y independent? Give reasons for your answer. 
) (South Gujarat Univ. B.Sc., 1992) 
fix ly)=2x/y? O<x<y,0<y<l 


fAlxy=4x(1-x’),0< r< 1 
Ans. = Q, otherwise 
“fly)=4y’,0< y< 1  faly |x) =2y/(1-0’) 3 x<y<1,0<x<1 
9. (a) The random variables X and Y have the joint density function : 
f(x,y)=2, ifx+ y<1, x20 and y20 
= 0, otherwise 
Find the conditional distribution of Y , given X =x. 
(Calcutta Univ. B.Sc. (Hons.), 1984) 
(b) The random variables X and Y have.the joint distribution given by the 
probability density function : 
6(1-x- y), forx>0, y>O, x+ y< 1 
fny)= | 0. eee : : 
Find the marginal distributions of X and Y . Hence examine if X and Y are 
independent. [Calcutta Univ. B.Sc. (Hons.), 1986) 
10. If the joint distribution function of X and Y is given by 
F(x,y)=(l-e“*)(l-e7) for x>0, y>0 
= 0, elsewhere 
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Find P(1< X¥<3,1< Y¥<2). (Dethi Univ. M.A.(Econ.), 1988] 


3° 2 
Hint. Reqd. Prob. = feral e dy |= (1- e?1- e) 
1 1 


11. LetX and Y be two random variables with the joint probability density 
function 


_ J) Bxy, O0< xs y< 1 
fez,y)= | 0, otherwise 


Obtain: 

(i) the joint distribution function of X and Y. 

(ii) the marginal probability density function of Y ; and 
(iii) P(XS415< YS 1). 
12. Let X and Y be jointly distributed with p.d.f. 


! 
—(l+xy), |x| <1, <1 
jaye t(1tay), del < 1, yl 
Q , otherwise 
Show that X and Y are not independent but X” and Y* are independent. 


1 
Hint. fr(x)= | f(x,y) dy = < -l<x<l; 
-1 


1 
frly)= J f(x,y) d= 5, -~l<y<l 
-1 


Since f(x,y) #fi(x)fo(y),X and Y are not independent. However, 


Vx 
P(X*<x)=P(|Xis ve)=J] fi(x)de= ve 
— Vx 
P(X<xNYs y)=P((XIS & NO IYIs Yy) 


= P(X’<x)-P(y’< y) 


=> X’ and Y’ are independent. 
13. (a) The joint probability density.function of the two dimensional random 
variable (X , Y ). is given by : 
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33 
~) xy/l6, OS x8 2,05 ys2 
f(x,y) 0 , elsewhere 


Find the marginal densities of X and Y. Also find the cumulative distribution 


functions for X andy. (Annamalai Univ. B.E., 1986) 


3 3 
Ans. fx(x)= 7; O<sx<2; fr(y)= 7; O< y<2 


0 ; X< O 0 ; y< 0 
Fx(x)= 4 4/16; OS x 2| Fr(y)=3 y'/16 ; Os ys 2 
l - x>2 l ; yo? 


(b) The joint probability density function of the two dimensional random 
variable {( X , Y) is given by: 
. xy , L$ xs yS2 
0 , elsewhere 


(i). Find the marginal density functions of X andy, 
(ii) Find the conditional density function of Y given X =x, and conditional 


density funciton of X given Y =y. 


f(x,y)= 


(Madras Univ. B.Sc. (Stat. Main), 1987] 


Ans. (i) piar= ff. y)dy= 2 (4-2); 1s x2 


e 


= : otherwise 


0 
wore [ye de = Fy (y>-1) > ls y<s2 


; Isxsy 


2x 
y-1 
_ f(x,y)_ _2y 
x ; xf ys2 
frix(y| x) = oye fae 
14, The two random variables X and Y have, forX = x and Y= y, the joint 


probability density function : 
f ( x ’ y ) = 
2x y 


Derive the marginal distributions of X and Y . Further obtain the conditional 


distribution of Y for X= x and also that of X given Y= y. 
(Civil Services Main, 1986) 


fxyy (X41) = 


, for 1< x< co and t<y<x 


Hint. fx(x)= | f(x, y)dy= J f(x,y) ay 
y 1/x 


5 68 y Fundamentals of Mathematical Statistics 


friy= | fCx.yax 


x 


= | f(x,y)de; OS ys 1 
1/y 
= | f(x,y)de; 1S yew 


y Oo. | xX 
15. Show that the conditions for the tunction 


f(x,y)= k eap [Ax+ 2 Hxy+ By | » ~o< (x,y) <0 

to be a bivariate p.d.f. are ) 

(QASO, (ii) B SO (iii) AB-H'20. 

Further show that under these conditions 

2 1/2 
k= = (AB~ H) 

Hint. f(x,y) will represent the p.d.f. of a bivariate distriibution if and 

only if 


Je. fe pcx.y) de dy = 1 


~0oo %— 00 
rove) co 2 a 
=> k [2 J° exp [Av+2Hxy+ By lax dy =1 .. (*) 
We have 
Ax+2Hxy+ ay= alse MH ry+ Fy 
H\. AB-H 
a Lae aa 2 +s 
a|(x+4y] + x a u(S®) 
Similarly, we can. write 
: 2 
Ax+2Hxy+ By =B (> 3" + or i ? ue (##*) 


Substituting from (#*) and (##*) in (*) we observe that the double integral on 
the left hand side will converge if and only if 
As 0, B< 0 and AB- H’2> 0, 
as desired. 
Let us ake A=-a ; B=-b ; H=h so that AB—H’= ab—h’, where 
a>0Q,b>0. 
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Substituting in (*), we get 


kJ? JP exp ~ Ky -2(- shad pare 
co ab —h’ 2 J~ 2 
=> ra expi- 7 yl: [°° exp -=(ax-hy) dx | dy 
= ] vee (4#**) 
(By Fubini’s theorem) 
= exp{—1 (arty | ae= J a 
Now I=. exp| 7 a hy )" ¢} dx= J°. exp ae 
(ax— hy= u) 
= 1 Va Vaevt 
a 
a 22 
(fo, ef? ae) 
Lee c 


Hence from (****), we get 


bVE J? exp {- 2 a— Hy yf y= 


OBJECTIVE TYPE QUESTIONS 
I. Which of the following statements are TRUE or FALSE. 
(i) Given a continuous random variable X with probability density function 
f(x), then f(x) cannot exceed unify. 
(ii) A random variable X has the following probability density function : 
f(x)=x,0< x< 1 


= 0, elsewhere 
(iii) The function defined as 
f(x)= |x|, -l<x< 1 
= Q, elsewhere 


is a possible probability density function. 
(iv) The following represents joint probability distribution. 


1/9 1/18 1/18 
11g 29 3/9 
18 1/18 = =:1/18 
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Il. Fill in the blanks : 

(i) \f pi. (x) and p.(y ) be the marginal probability functions of two inde- 
pendent discrete random variablies X and Y , then their joint probability function 

p(x,y)=... 
(ii) The function f(x) defined as 
f(x)=|2z], ~-1< x< 1 
= 0, elsewhere 

is a possible ...... 

5-6. Transformation of One-dimensional Random Variable. Let X bea 
random variable defined on the event space S and let g (-) be a function such that 


Y= g(X ) isalsoat.v. definedon S . In this section we shall deal with the following 
problem : 

"Given the probability density of ar.v. X, to determine the density of a new 
rv. Y=-g(X)." 

It can be proved in general that, if g (-) is any continuous function, then the 
distribution of Y= 9 ( X ) is uniquely determined by that of X . The proof of this 
result is rather difficult and beyond the scope of this book. Here we shall consider 
the following, relatively simple theorem. 


Theorem 5-9. Let X be a continuous r.v. with p.df. fx (x). Lety= g(x) 
be strictly monotonic (increasing or decreasing) function of x. Assume that 


g(x) is differentiable (and hence continuous) for all x. Then the p.df. of the r.v. 
Y is given by 


hy (y )= fx(x) 


where x is expressed in terms of y. 


Proof. Case (i). y=g(x) is strictly increasing function of x (i.e., 
dy/dx > 0. The df. of Y is given by 


Hy(y)= P(YS y)= P[g(X)s yJ= P(XS g'(y)], 
the inverse exists and is unique, since g (-) is strictly increasing. 
Hy(y)= Fx[ ¢'(y)], where F is the d.f. of X 


= Fx(x) [-- y=g(x) => o'(y)=2] 
Differentiating w.r.t. y, we get 


9 


dx 
dy 


d d d 
by (y)= a UFa(x)1= 2 (Fe(2) on 


2 dx . 
= f(x) Gy ; ww. (*) 
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Case (ii). y= g(x) is strictly monotonic decreasing. 
Hy (y)=P(Y¥Sy)=Plg(X)syl=P[X28'(y)] 
=1-P[X<g"(y)]=1-Fxle\y)J=1-Fe(2), 
where x="! (y), the inverse exists and is unique. Differentiating w.r.t. y , we 
get 


hy (y) = 2 [1-Fy(x) f= “fel x) 


_ tt 
= fx (2) A#*) 


Note that the algebraic sign (—ive) obtained in (**) is correct, since y is a 
decreasing function of x = x is a decreasing function of y = dx /dy< 0. 
The results in (+) and (**) can be combined to give 


hy (”) = fx @) 


a 
dy 


Example 5-33. Ifthe cumulative distribution function of X is F (x) , find the 
cumulative distribution function of 

(i) Y= X+ a, (ii)Y= X- 6, (iii) Y= ax, 

(iv) Y= X*., and (vy) Y= X? 

What are the corresponding probability density functions ? 

Solution. Let G (-) be the c.d.f. of Y. Then 
(i) Gxy)= P(Y¥Sx)=P[X+ as x}]=P[X< x- a]= F(x- a) 
(ii) GX) = P(Y¥s x)= P[(X- bs x]= P[X< x+ b)= F(x+ 5) 


(tii) G(x) = P [aX < x)= P| xs |, a> 0 
= (2), ifa>O 
a 
P| xe 2. 1- P| x< | 
a a. 
1~ F(2), ifa< 0 
a 


(iv) GQz)= PLYS x)= P[X?s x]= P[Xs x? ]= F(x”) 
(v) G()= p[ x? x |= [-x< x< 2] 


and G(x) 


i 


= P[Xxs ¥?)- pP[x< - #7] 


§-72 rundamentals of Mathematical Statistics 


= 0, ifx< 0 
= F(Vx)- F(-Vx-0), ifx> 0 


F Ge) 


X-a F(x +a) fix+a) 
aX F(x/a) a> ? (1/a) f(x/a), a>O 
1- F(x/a) ,a< 0 (— 1/a) f (x/a), a< 0 
x? F(Nx + F(- Vx -0) aap VvE +f Vx) ] 
| for x> | Yas for x> 0 
0, otherwise = 0 for x< 0 
i l l 
x? F(x’) zfG) se 


EXERCISE Sf) 

1. (a) Arandom variable X has F (x) as its distribution function [ f(x) is 
the density function ). Find the distribution and the density functions of the 
random variable : 

(i) Y= a+ bX ,a and b are real numbers, (ii) Y= X~', [P (X= 0)= 0], 

(iii) Y= tanX, and (iv) Y= cos X. 

_J wR ,-l<x<l 

(0) LAE G , elsewhere 

be the p.d.f. of the r.v. X. Find the distribution function and the p.d.f. of Y= X’. 
[ Dethi Univ. B.Sc. (Maths Hons.), 1988 } 


Hint. F(x)= P(X< x)= more 5 (x+ 1) 
-1 


wee (*) 
Distribution function G (-) of Y= X? is given by: 
Gy (x)= F(Vx)- F(- ve) 5 x>0 [ c.f. Example 5-33 (v) ] 
l ) l 
= 9(vx+ 1)-9(- vet 1) (Prom (*)] 


=Vvx ; O<x<1 
(As- 1<x<1, Y= xX? wenniag haan 


p.a.f.of Y= X” is g(x)=G’(x)= a - O< x< 1 
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2.LetX be a continuous random variable with p.d.f.f(x).LetY= X *, Show 
that the random variable Y has p.d.f. given by 


l 
- Lf(vy)+f(- W)], y> 0 
sips [Pi Lf) + F(- WI, y> 
0 , ys 0 
3. Find the distribution and densitiy functions for (ij) Y= aX+ b, az 0, 
b real, (ii) Y= e* , assuming that F (x) and Ff (x) , the distribution and the density 
of X are known. 
G(y)= F {(y-6)/al], if a> 0 Mees y-by 
Giy)= 1-F{(y-b)//a], ifa<of ® lal’\ a 
fi Giy)= F (logy), y> 4 g(y)= + f(logy), y> 0 
it) y 
= 0, ys 0 =i):; y <0 
4. (a) The random variable X has an exponential distribution 
f(x)=e"*, 0< x¢ 
Find the density function of the variable (i) Y= 3X + 5,(ii) Y= X°. 
(b) Suppose that X_ has p.d_f., 
f(x)= 2x, 0< x< 1 
= 0, elsewhere 
Find the p.d.f. of Y= 3X+ 1. 
Ans. g(y)= 2(y-1), l<y<4 


5. Let X bea random variable with p.d_f. 


Ans. (i) 


f(x)=S(x4 1) -1<x<2 
= 0, elsewhere 
Find the p.d.f. of U= X°. [ Poona Univ. B.E., 1992 } 


6. Let the p.d.f. of X be 
f@=2, -38 x83 
= 0, elsewhere 


Find the p.d.f. of Y= 2X7- 3. 
7. LetX bearandom variable with the distribution function : 


0, x< 90 
Fy(x)=3x, OS x1 
1, x> 1 
Determine the distribution function F y (y ) of the random variable Y= VX 
and hence compute mean of Y. [ Calcutta Univ. B.A.(Hons.), 1986 ] 


5-7. Transformation of Two-dimensional Random Variable. In this sec- 
tion we shall consider the problem of change of variables in the two-dimensional 
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case. Let the r.v.’s U and V_ by the transformation u = 
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u(x,y),v= y(x,y), 


where u and v are continuously differentiable functions for: which Jacobian of 


transformation 
ox dy 
jo OY). Ou Ou 
d(u,v) |ax dy 
Ov av 


is either >Qor<0 throughout the (x, y ) plane so that the inverse transformation 
is uniquiely given byx= x(u,v), y= y(u,v). 

Theorem 5-10. The joint p.df. guv(u,v) of the transformed variables U 
and V is given by 

guv(u,v)= frr(x,y). II 
where | J lis the modulus value of the Jacobian of transformation and f(x,y) is 
expressed in terms of u and v. 
Proof. P(x< XS x+ dx, y< YS yt dy) 
=P(u< USut du, v< VSv+t dv) 


=> fa (x,y) dx dy= gw (u,v) du dv 
> guv(u,v) du dv= fir(x,y) resey | du dv 
= gw (U,V) = fer (X.Y) oe Sfeteg yw 


Theorem 5-11. /[fX and Y are independent continuous r.v.’s, then the p.df. 
of U= X + Y is given by 


h(u)= ia fx) fr(u-v) dv 


Proof. Let fry (x,y) be the joint p.df. of independent continuous 7.v.’s X 
and Y and let us make the transformation : 


u=X+y, VEX => X=v, y=u-v 
dx dy 
O(x,y)_|dOu dul _ i oo 
~ O¢u,v) [dx ay! |] 7 
dv dv 


Thus the joint p.d.f. of r.v.’s U and V is given by 
gu (u,v)= far (x,y) II 
= fx(x)-fr(y) II 
(Since X and Y are independent) 


= fx(v)sfr(u- v) 
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The marginal density of U is given by 
h(uy= J guvtu,y) av 


= {= fx(v) fr(u-v) dv 


Remark. The function A(-) is given a special name and is said to be the 
convolution of fx (-) and fy (-) and we write 
h()=f) * fr) 
Example 5-34, Let (X,Y) be a two-dimensional non-negative continuous 
r.v. having the joint density : 


2 2 
f(x,y)= ary e (47) ; x20, y2 0 
0 , elsewhere 
Prove that the densitiy function of U= Vx?+ Y? is 
2 
h(u)= we“ , OS u< co 
0 , elsewhere 


[ Meerut Univ. M.Sc., 1986 ] 
Solution. Let us make the transformation : 


u= V4 y> and v=x 


=> v20,u20 anduz>v => £=u20 and O<v<u 
The Jacobian of-transformation J is given by 
| ax ay 
i. (u,v) _ du du cam y 
J a(x,y) |ax ay Vey 
dv av 


The joint p.d.f. of U and V is given by 
g(u,v)= f(x,y) lJ 


2.2 
om 4xy els +y ) 


= 4x Ve 4 y’ eo (e+ y) 


2 
= {ame -u20,0< v<u 
0, othérwise 


_Ne+y 
y 
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(xX, 
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Hence the density function of U= Vx?4 Y? is 


u ~y fu 
h(u)= Jo g(u,v) dv= due“ J vdv 
2 
e Wwe" ,u20 
0, elsewhere 
Example 5-35. Let the probability density function of the random variable 
Y) be 


-2 4 Grty)va ; x,y>0,a>0 


ro 
Q , elsewhere 


Find the distribution of =(X- Y). 


fay=| 


[ Nagpur Univ. B.E., 1988 | 
Solution. Let us make the transformation : 


u= 5 (xe y) and v=y 


=> x= 2u+v and y=v 
The Jacobian of the transformation is : 
ax ax 
_|dau av). |2 ls 
J= ay ayl~ lo (= 
du dv 


Thus, the joint p.d.f. of the random variables (U ,V) is given by : 


| 2 -(Va)(u+ v) 
a 


—oo<u<oo,y>—2u if u<O0 


g(u,v)= v>0 if u20 and a>0 
0 , elsewhere 


The marginal p.d.f. of U is given by 
co 6.2 
(eat a exp{—(%)(u+v)}dv 


Hence 


= (2/a) Val 


gu) = = e — 0 <u <oo 
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Example §-36. Given the joint density function of X and Y as 
f(x,y)= ax e*>;0<x<2,y>0 


= 0, elsewhere 
Find the distribution of X+ Y. 
Solution. Let us make the transformation : 
u=xtyandv=y => y=v,x=u-v 


The Jacobian of transformation J = oY) = 1 and the region 0 <x<2 


and y > Otransformsto0 < u-— v< 2andv> Qas shown ithe following figure. 
u 7 


The joint density function of U and V is given by 
g(u,v)= + (u-v) e”°:;O0<v<u,u>dO0 
To find the density of U= X+ Y, we split the range of U into two parts 


(i) O< us 2 (region) (ii) u>-.2 (region II) (which is suggested by the 
diagram). 


For0< u< 2, (Region I): 
h(u)= by g(u,v) dv= ; hs (u-v)e” dv 


=3 | -e"(u-v)+e” 


u 
= @ (Integration by parts) 


ey rey 
=5 (e +u- 1) 
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For 2 < u< oc, (Region II): 


h(u)= ; es (u-v)e” av 


gd er (iaveajl 
2 v= u- 2 

-l, 

=se (14 e ) (on simplification) 

Hence 
(e+ u-1),0< us 2 
g(u)= ier (Ie 2), 2c uc m 

Q , elsewhere 


] MISCELLANEOUS EXERCISE ON CHAPTER FIVE 


1. 4 coins are tossed. Let X be the number of heads and Y be the number of 
heads minus the number of tails. Find the probability function of X, the probability 
function of Y andP(-2< Y< 4). 

Ans. Probability function of X is 


Values of X, x 0 1 2 3 4 


Pi(x) 6 6 6 6 6 


4 6 4 21 
16 16 16 16 
P(-2<Y<4)= a4 ets. = 


2. Arandom process gives measurements X between 0 and | with a probability 
density function 


f(x)= 12X%- 2127+ 10x, OS xS 1 
= 0, elsewhere 
(i) Find P(X < 4) andP(X> 3) 
(it) Find a number & such that P(X < k)= 
Ans. (i) "M6, “6, (ii) k= 0-452. 


i 
, a 
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3. Show that for the distribution 


— Ix— b| 

d= y.| 1 = ] dx, b—-a<x< bra 
= 0, otherwise, 

Yo = . , mean = b and variance = a’/6 


4. A ray of light is sent in a random direction towards the x-axis from a 
station Q (0, 1) on the y-axis and the ray meets the x-axis at a point P. Find the 
probability density function of the abscissa of P. 

(Calcutta Univ. B.Sc.(Hons.), 1982] 
_5. LetX be acontinuous variate with p.d.f. , 
f(x)=k(x- 2); a<x<b,k>0 
What are the possible values of a and b and whatisk ? 
(Delhi Univ. B.Sc.(Maths Hons.), 1989} 

6. Pareto distribution with parameters r and A is given by the probability 

density function 


e+) ? 


f(x)=rA’ ae for x2A 
x 


=0,x<A,r>0 
Show that it has a finite nth moment if and only if 2 < r. Find the mean and 
variance of the distribution. 
7. For a continuous random variable X, defined in the range (0 < x< ©), 
the probability distribution is such that 


P(X< x)= 1- fae where B>0O 
Find the median of the distribution. Alsoifm, m, and o denote the mean, mode 
and standard deviation respectively of:the distribution, prove that 

2m? - m= 0 and m= mV % 
What is the sign of skewness. of the distribution ? 

8. (a) Two dice are rolled, S= {(a,b) la,b= 1,2....,6}.LetX denote 
the sum of the two faces and Y the absolute value of their difference, i.e., X is 
distributed over the integers 2, 3, ...., 12 and Y over 0, 1,2, ...,5. Assuming the 
dice are fair, find the probabilities that (i) X= SQN Y= 1, (ii) K= TAY2 3, 
(i)X= Y, and(iv)X+ Y=4NX- Y=2.> 

Ans. (i) 4%, (ii) 4%, (iti) 0 and (iv) Ys. 

9. The joint probability density function of the two-dimensional variable 
(X , Y ) is of the form 

f(x,y)= ke @” 0S y<x<we 
= 0, elsewhere 
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(i) Determine the constant k. (éi) Find the conditional probability density 
function f, (x|y) and (iii) Compute P (Y > 3). 
[ Sardar Patel Univ. B.Sc., 1986 ] 
.- (iv) Find the marginal frequency function f, (x) of X. 
(v) Find the marginal frequency function f2 ( y ) of Y. 
(vi) Examine if X, Y are independent. 
(vii) Find the conditional frequency function of Y given X = 2. 


Ans. (i)k= 1, (ii) f(xly) =e", (iii) &°. 


10. Let 
-2 
x -Xx x” 
[Jeri -py <_< 
f(x,y)= y: 
a ; ; x=0,1,2,..3 y= 0,1,2,..5 with y2 x 


0, elsewhere 
Find the marginal density function of X and the marginal density function of Y, 
Also determine whether the random variables X and Y are independent. 
(1.S.1., 1987] 
11. Consider the following function : 
e?’ 
fxly)= pat. dai 
0, otherwise 
(i) Show that f(x|y) is the conditional probability function of X given Y; 
y2 0. 
(ii) If the marginal p.d.f. of Y is 
rAe**, x>0. 
hoyr={4 x50, 2>0 
what is the joint p.d.f. of X and Y ? 
(iii) Obtain the marginal probability function of X. 
[Delhi Univ. M.A.(Econ.), 1989] 
12. The probability density function of (x, , x2) is given as 


x 


os 0, 6,e° 7-22 if X1,%2.>0 
finin)= {|e otherwise . 


Find the density furction of ( y: , y2 ) where 
n= a 1, y2= 3x,+ x2 almost everywhere. 
2 


[Punjab Univ, M.A.(Econ.), 1992] 
13. (a) Let X; , Xx be a random sample of size 2 from a distribution with 
probability density function, 


f(x)=e*,0<x<0 
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= 0, elsewhere 


Show 
¥,=Xi+ X), and Y= —** | 
ee ns a, i 2, ©) 
are independent. [Sardar Patel Univ. B.Sc., Sept. 1986] 


(b) X; , X2, X3 denote random sample of size 3 drawn from the distribution: 
f(x)=e* , 0<x<0 
= 0, elsewhere 


Show that 
_ xX = Xi+ X2 _ 
Y,= Xt % , Y2= Xit Xt % and Y3;= X,+ X2+ X3 
are mutually independent. 


14. If the probability dénsity function of the random varaibles X and Y | X is 
givenby ‘ 


ene @ te0 
f{x)= , elsewhere 
-x iY 
———. > 
and frix(y|x)}= | yt! ° Vee 
0 , elsewhere ' 
respectively, find the probability density function of the random variable Y. 
[Jiwaji Univ. M.Sc., 1987] 
15. (a) The random variable X and Y have a joint p.d.f. f(x,y) given by 
f(x, y)= g(xt+y), x>0, y> 0 
= 0, otherwise. . 
Obtain the distribution function H(z) of Z= X+ Y and hence show that its 
p.d.f. is 
h(z)=z Q(z), z>0 . 
=(0 z< Q. ec 
(b) The joint density function of two random variables is given by 
f(x,y)=e"'™ 3 x>0, y> 0. Show that the p.dkf. of 
X+Y 
2 


U= is g(u)= 4ue? 
[Calicut Univ. B.Sc., 1986] 

16. The time.X taken by a garage to repair a Car is a continuous, random , 

variable with probability density function 
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es ‘aa x), OS xe 2 
1 = 


If, on leaving his car, a motorist goes to keep an engagement lasting for a time 
Y, where Y is a continuous random variable, independent of X, with probability 
function . 


Q , elsewhere 


1 
hove | 27 J 
“| O, elsewhere ; 
‘determine the probability that the car will not be ready on his return. 
[Calcutta Univ. B.A.(Hons.), 1988] 
17. IfX and Y are two independent random variabigs such that 
sf(x)= e" , x20 and g(y)= 36°" _y20; 
find the probability distribution of Z= X/Y. 
{Madurai Uniy. B:Sc., Oct. 1987} 
18.. The random variables X and Y are independent and their probability 
density functions are, respectively-given by 


l 1 : -y/2 
(x)=—. +, IxI< I and ge(y)= ye’’” , y> 0. 
f a eae) a(y)= y y 

Find the joint probability density of Z and W where Z= XY and'W= x. 
Deduce the probability density of Z.. {Calcutta Univ. B.Sc.(Hons.), 1985] 


CHAPTER SIX 
Mathematical Expectation, 
Generating Functions and 
Law of Large Numbers 
6-1. Mathematical Expectation. . Let ¥ be a random variable (r.v.) with 


p.d.f. (p.m.f.) f(x). Then its mathematical expectation, denoted by E(X) is “ 
given by : 


E(X)=[ x f(x)de, (for continuous r.v. ) (6/1) 
25 x f(x), — ( for discrete r.v. ) ...(6°14) 
provided the nghthand ratewnat or series is absolutely convergent, i.e., provided 
f lxsexylar =f bla < & | (6-2) 
or ~ b> |x f(x)| =z |x| f(x) < (6°24) , 


Remarks. 1. Since absolute convergence implies ordinary convergence, if ' 
(6-2) or (6:2a) holds then the integral or scries in (6-1) and (6-1a) also exists, i.e, 
bas a finite valuc and in that case we define E (X ) by (6-1) or (6-14). It should be . 
clearly understood that although X has an expectation only if L.H:S. in (6-2) or. 
(6:2a) exists, i.e., converges to a finite limit, its value is given by (6-1) or (6-14). | 

2. E (X) exists iff E |X| exists. ; 

3. The expectation of a random variable is thought of.as a long-term average. 
[See Remark to Example (6° 2a), page 6-19.]. ). ; 

4. Expected value and variance of an Indicator Variable. Consider the 
indicator variable : X = J, so that 

X=1 if A happens 
0 if A happens 
E(X)= 1 P(X=1) +0. P(X¥=0) 


=> E()= 1 PIlg=1] +0. Pi = 0] 

=. E (4) = P(A) 

Fhis gives us a ‘very uscful'tool to find P (A), rather tham to evaluate E (X). 
Thus P(A)= E(Is)° ...(6°2b) 


For illustration of this result, see Example 6:14, page 6-27. 
E(X*)= 17, P(=1) + o. P(X =0)= P(I,=1)= PYA) 
Var X= E (X?)- [E (X)] = P(A) - [P(A)] 
= P (A) ({1-P (A) ] 
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= P(A) P\A) .«(6-2c) 


[itustrations. If the r.v.X takes the values 0 !, 1 !,2 !, ... with probability 


~“ 
law 
eo! 


P(X=x1!)= = x= 0;1)2,... 


then Tx! PW=x!)eze" =F 1 
xa O -xe 0 
which is a divergent series. In this case E-(X ) does not exist. 
More rigorously, let us consider a random variable X which takes the values 
ue (1)! (i+1)3 i= 1,2,3,.. 
with the probability law 


1 ee oie 
pi= maa TEST ; pe 12 Sy cis 


“Here > Xj P= xi) = y (-1)'*! (=)- = a4 
ie] jel 7 

Using Leibnitz test for ajterimiting series the series on right, hand side is 

-conditionally: convergent since the terms. alternate in sign, are monotonically 

dec asine and converge to cre: By conditional convergence: we mean that 


(yd | nme 
de | 
+ 


although 5 Pix: CONNEIEES | pi X; | does not converge. So, rigorously speak- 


to) 
« 


ing, in the above: example E (X) does not exist, although 2 p;x; is finite, viz, 


fi j=l 
lag, Z. , 
As another example; let us considerthe r.v..X which takes the yalues 
aay ' 
ne SS | Ga Wer ee 


. tae : 
with probabilities Pr= 2 A 


Here also we get 


a & =! > 
k=l is ka] k 
1 1 1 ae 
o- [1-54 37 47 wel = log. 2 
and E |u| pe = 5 Z 
kal kel k 


er ar ets : : : ; , 
which is’ divergent ‘series. Hence in this case also expectation does not exist. 


AS an illustration of a contingous rv. let us-consider the r.v.,X, with p.d.f. 
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1 1 
a= nm 1+x? 
which is p.d.f. of Standard Cauchy distribution. [c.f-s. 8-9]. 


fixtpoaet f Hha-2 fo 


-oO< X¥< © 


9 


=; ax 


1 f 


1+ex° 


( *.’ Integrand is an even function of x) 


= | tog (1 +27) | > © 


Since this integral does not converge to4 finite;limit, E (X) does not exist. 

6:2. Expectation of a Function of a Random Variable. Consider a r.v. X 
with p.d.f. (p.m.f.) » and distribution function F(x). If g (.) is a function such 
that g(X) isar.v. and E [g (X)] exists (“e., is defined), then 


E [gM] = f 2 (x) dF (x)= f g (x) f (x) dx ...(6°3) 
~ - (For continuous r.v.) 
=X g(x) f(x) ...(6°3a) 


(F or discrete r.v.) 
By definition, the expectation of Y= g (X) is 


Elg = E(Y%) = J y.dHy(y)= f yh(y) dy -.:(6°4) 
Or EW) : yh (y)" ...(6°4a) 


where Hy (y) is the distribution function of Y and /i (y) is p.d.f, of Y. 
[The proof of equivalence of (6-3) and (6-4) is beyond the scope of the book. /] 
This result extends into higher, dimensions. If X and Y have a joint p.d f 
f(x, y) and Z=h (x,y) isa random variable for some function /t and if E (2) 
exists, then 


E@)=f f Mxyf(ry)dedy (5) 
or E(Z)= SE h(x,y) f(xy) 2{6°5a) 


Particular Cases. 1. If we take g (X) = X’, r being a positive integer, in 
(6-3) we get: 


E(X’)= f x’. f(x) de ; ...(6°5b) 
which is defined as u,, the rth moment (about origin) of the probability distribu- 
tion. 

Thus p,’ (about origin) = E (X"). In particular 
\ 
yi’ (about origin) = E (X) and wy’ (about origin’) = E (x7) 
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Hence Mean = X= py’ (about origin) = E (X) .-.(6°6) 
and Up = py’ — py? = E(X’)- { E (x) | ...(6°6a) 
2. If g(X)= [X-E(X)] = (X-xY, then fron: (6-3) we get : 

E[X-E(Q) = f kk -E@ lk) de= f &-x)sf@a ..(6°T) 


which is p,, the 7th moment about mean. 
In particular, if r= 2, we get 


wo= E(X<EWP=f (e- x) r(x)ad ..-(6°8) 


Formulae (6°6a) and (6:8) give the variance of thé\probability distribution of 
ar.v..X in terms of expectation. 


3. Taking g (x) = constant = c say in (6:3) we get 


E(c)=foc.f(x)de= cf f(x)de = c (6-9) 
~ eee ...(6-9a) 


Remark. The corresponding results for a discrete r.v. X can be obtained on 
replacing integration by summation( = ) over the given range of the variable X in 
the formulae (6-5) to (6-9). 

In the following sections, we shall establish some more results on Expectation 
‘in the form of theorems, for continuous r.v.’s only. The corresponding results for 
discrete r.v.’s can be obtained similarly on replacing integration by summation 
(= ) over the given range of the variable X and are left as an exercise to the reader. 

6-3. Addition‘ Theorem of Expectation 

Theorem 61. [f X and Y are random variables then 

| E(X+ Y)=E(X)+ E(Y),  ,..(6°10) 

provided all the expectations exist. 

Proof. Let X and Y be continuous r.v.’s with joint p.d.f. fyy (x,y) and 
~ marginal p.d.f ’s fx () and fr (y) respectively. Then by definition : 


E (X)= f x fx(x) dy ...(6-11) 
E(Y)= J y frly) dy ..(6:12) 


EK+Y= f f (x+y) fer (,y) de dy 
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=f f - “xX fry (x x,y) dx, dy 


+f fy fer(xty) de dy 


re - ®& 


f fry (x,y) 14a 


- © 


rf r| J fry (x,y) de | dy 


’ 


“f f(x) de “f yy) dy 


-E (X)+ E(Y) [On using (6-11) and (6- 12)] 

The result in (6: 10):can be extended to 7 variables as given below . 

Theorem 6-1(a). The mathematical expectdtion, of the sum of random 
variables is equal to the sum of their expectations, provided all the expectutions~ 
exist, 1 ) . 

Symbolically, if X1,X2, ....Xnare random variables then 

E(X,+ X24 ...4.X,)= E(X1) + E(Xy)% w+ E(XY (6-13) , 


or E 2 xi) e 2 E (X; ), ...(6-13a) 


if all the expectations exist. 
Proof. Using (6-10), for two r.v.’s X; and X, we get : 
E (X, + X2) = F (%) + E (X)) 


=> (6-13) is true forn = 2. oo(*): 
Let us now suppose that (6°13) is true forn = r (say), so that 
E ( > xi] = y E(X;) .-.(6°14) 
iwi iwi 


, r+ 3 4s r j 
E | 2%] - E | 2 Xi+ con 
ial a 


=E [ 3 EX, | + E (Xe) [Using (6-10)] 


Ni 
M 
a) 
mx 
+ 
Ry 
Ps 


[Using (6-14)] 


i) 
M 
Be 
& 


Hence if G 13) is true Tonnes r, it is also,true forn= r+ 1. But we 
have proved in (*) above that ee 13) is. true for n= 2. Hence it is true for 
n=241=3; n=3+1= 4; . arid soon. Hence by the principle of mathemati- 


6-6 Fundamentals of Mathematical Statistics 


cal [ntroduction (6-13) is true for all-positive integral values of n. 


6-4. Multiplication Theorem of Expectation ' 
Theorem 6-2. If X and Y are independent random variables, then 
E(xXY)= E(X*). Ey) -..(6°15) 


Proof. Proceeding as in Theorem.6:1, we have : 


E(XY)=f f xyfer(x,y) de dy 


=f ff xyfex) fay) de dy 


[Since X ses Y are independent] 


= f xis fy Kore 


Se (X).E-(Y) ,~ [Using (6-11) arid (6-12)) 
provided X and Y are independent. 
Generalisation to:n-variables. 

Theorem 6-2(a). The mathematical expectation of the product of a number 
of independent random variables. is equal‘to the product of their expectations. 
Symbolically, if X,, X2, ...,X, aren independent random variables, then 

E(X,X2...X.)= E (MX, )E (X2)...E Xa) 
hes E{ n Xx, - 1 EQ) at10) 
i= 
provided all the expectations exist. 

Proof. Using (6-15), fortwo .independent random variables X; and X2 , we 

get: : 
E\X,X2.)= E(X,)E(X2) 
=> (6°16) is true forn = 2. ...(*) 


‘ 


Let us now suppose that (6-16).is true form = 7, (say) so’that : 


E| n ae n E (X) 617) 
E( nx) - e Xi Kees] 

E| nx BxaG) [Using (6-15)] 

= n (E X,) E (X41) _ [Using (6-17)] 
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Hence if (6-16) is true for n= r, itis also trué for.n =. 7 +1 . Hence using (+), 
by the principle of mathematical induction we:conclude that (6°16) is true for all 
positive integral values of x. 

Theorem 6-3. /fX ts a random sapiable and a’ is constant, then 


(i) Ela (x) J= a E[V (XY) ] ..-(6°18) 
(ii) E(¥(X)+al= E [‘Y (X) ] +4, »(919) 
where ‘PY (X), a function of X isa r.v. and all ihe expectations exist. 
Proof. — es ‘ 


(i)  Efa¥ X= f €¥@)-fWdceaf Vf) de=aEY 
(ii) ETYO)+a= f [Y ) +0] fe) dx 


= -f W (x) f(y) dit a f f (x) dx 


-E [WICX) ] +a . 
|  f fQ) de = 
Cor. (i) If V(X) = X, then , 
E(dX)= aE (X) and E (X+ a) =E (X)+ a ...(6-20) 
(ii) If V(X) = 1,;thenE(a)= a. ...(621) 
Theroem 6-4. Jf X is a random variable and a and b are constants, then 
E(aX+ b)=a E(X)+ b .(6:22) 


provided all the expectations exist. 


Proof. By definition, we have « 


E(aX+ b)= f ( aa b) fx) dy | 


a fi fQlde +b j f(x) de 
“ ; = aE (X)+ b 
Cor.'. Ifb= 0, then we get 
E(aX)=4-E(X) *..(6-22a) 
Cor. 2. Takinga = 1, b= -X¥=- E(X), we get 
E(X-X)=0 
Remark. If we write, 
g(X)= aX+b ...(6-23), 
then glE]= aE (XY)+b ...(6°23a) 


é 
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Hence from (6-22) and (6-234) we get ; 
Elg@]= gE CO) | (6:24) 
Now (6°23) and (6-24) imply that expectation of a linear function ts the same 


linear function of the expectation. The result, however, is not true if g (-) is not 
linear. For instance ; 


4 on 
E(QU/X) 2 (I/EX) 3; E(X2) = [E@)]% / 
E[log(X)# log([E(X)];  E(X*) «[EQ)]’, 
since all the functions stated, above are non-linear. As an illustration, let us 
consider a random varible X ‘which.assumes only two values +1 and =i each with 
equal probability 5 . Then 
E(X)= 1x gt (-1)x3 80. 
and E(X?)= Px! + (- 1exte 1. 
Thus E(X’) # [E wy 
For a non-linear function g (X), it is difficult to obtain expressions for 
E [g (X)] in terms of g[E (X)], say, for E [log (X)] or E(x’) in terms of 
log [E (X)] or [E (X)-. However, some results in the form of inequalities between 


E |g (X)] and g[E (X)] are available, as discussed in Theorem 6 oe (enson’ S 
Inequality) page 6-15. 


6-5. Expectation of a Linear Combination of Random Variables 


Let Xi, X2, ..., X, be any m random variables and if a, a2, ;!.,@, are any n 
constants, then 


E(E a, Xie 5 a, E(X) | (6-25) 


t=l 


provided all the eepectations exist. / 
Proof. The result is obvious from (6°13) and - (6:20). 
Theorem 6-5 (a). If X20 then E (X) 20. 
Proof. If X is a continuous-random variable s.t. X = 0 then 


E(X)= f x.p(x)de= fx. plx)dx > 0, 
ae 0 


[-IfX20, p(x)=0 ee < 0] 


provided the expectation exists. - a 
Theorem 6-5 (b). Let X and Y be two random variables such that Y s Xx then 
E(Yy) s E(X), 


provided the expectations exist. 
Proof. Since Ys X, we have the r.v. 


| Y-X¥ s0 »= —X-¥ 2 0 
Hence E(X-Y) 2 0 => E(X)- E(Y) 2 0 
=> E(X) 2 E(Y) = E(Y) s E(), 
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as desired. 
Theorem 6-6. |E (X)| s E |X|, ...(6°26) 
provided the expectations exist. 


Proof. Since ¥ s | X |, we have by Theorem 6-5(b) 
E(X) s E|X|- a(t) 
Againsince -X s | X |, we have by Theorem 6-5(b) 
E(-X) s E|-X | 
=> -E(X) s E| X | w0(#*) 


From (*) and (**), we get the desired result | E (X)| s E|X|. 


Theorem 6-7. /f ,’ exists, then p,' exists foralll< s.r. 
Mathematically, if E (X") exists, then E (X’) exists forall 1s ss r,ie.,, 
AY See ee) ee Voisssr ...(6:27) 


Proof. flav oF (x) = f |.x |° dF (x) 


wl 
+f | |x |’ dF(X) ” 
x|{>l 


Ifs< r,then| x |’ <|x |’ for |x| > 1. 
en 1 


a f jx |’ dF) sf [xl aF@) +f | x |" dF (x) 
1 x|[>l 


1 
s J aF@)+f | | x |’ dF (x), 
-1 x}>1 
since for -1<x<1,|[x|°< 1. 
f |x|’ dF(x)s1l+E|X|’'<& 
=> E(c) exists 1sssr [-- E(X’) exists] 
Remark. The above theorem states that if the moments of a specified order 
exist, then aJl the lower ordermoments automatically exist. However, the converse 
is not true, t.e., we may have distnbutions for which al] the moments of a specified 
order exist but no higher order moments exist. For example, for the r.v. with p.d.f. 
p(x)=#2/x? ; x21 
= 0 ; «<1 
we have : 


E(X)=f x p@)de =2f x’ ds = 2 


ral 


1 
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rae] ¥ p(x)de = 2f hdr = ¢0 


Thus for the above distribution, 1st order moment (mean) exists but 2nd order 
moment (variance) does| not exist. 
As another illustration, consider a r.v. X with p.d.f. 


| r+1 
jie -x¥20.; a>O 
x+a) 
a vA r = : t srl x! 
win EW = (48) a"! faye a 
Put x =ay and pe Beta integral : 
ry ‘™ 
f (f+xy7 = B(mn), 
0 
we shall get on simplification : 4 
Me= (nt l)a-B(rt+lijed 
However, 
o r+l 
#5 r+) _ r+) Xx a 
Ure1 = E(X =(r+1)a f (x+ay™ 0, 


0 
as the intergal is not puiNieicens Hence in this case only the moments up to sth 
rder exist and highcr order moments do not exist. 


Theorem.6°8. IfX ts a random variable, then 
V(aX +b) = a V(X), ...(6-28) 

where a and b are constants. 

Proof. Let WY =aX+b 

Then E(Y) = aE (X)+b 

Y- E(¥) = a{X-E(X)} 
Squaring and taking expectation of both sides, we get 
E({Y-E(){.= @ E{X-E() 

=> V(YYy=@VX) => V(axX+b) =a V(X), 
where V(X ) is written fpr variance of X. 


Cor. (i) Ifb= 0, then V(aX) = a’ V(X) ...(6°28a) 
‘=> Variance is not independent of change of scale . 
(uw) Ifa= 0, then V(b) = 0 ...(6°285) 
=> Variance of a constant in zero. 
(ui) Ifa=1,then V¥+b) = V(X) ° .-.(6°28¢) 


=> Variancc is independent of change of ofigin. 
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6°6. Covariance: If X and Y are two random variables, then covariance 
between them is defined as 
Cov (X,Y) = EHX- EWHY- EM}] (6-29) 
= E[XY- XE(¥)~ YE(@)+ EQE()] 
= E(XY)- E(YE(X)- E(QX)E(Y) + EQ)E(Y) 
= E(x Y)- E(X)E(Y) .-(6°29a) 
If X and Y are independent then E (XY) = E(X)E(Y) and hence in this 
case 


Cov (X, Y) = E (X) E(Y)-E (X)E(Y) =0 ...(6:29b) 
Remarks. 1. Cov (aX, bY) =,E[{ aX - E (aX)}{ bY - E (bY) } | 
=E[a(X¥-E(X)}b{Y-E(Y)}] 


=abE[ {X-E(]{Y-E(%)}] 
ab Cov (Xx, Y) ...(6°30) 
Cov (X, Y) ..:(6°30a) 


2. Cov(X +a, Y+b) 


X-X Y-Y 1! 
ne »( 4 Oy : Oy ¥-f). Oy Oy Cov (X, Y) ; ...(6°305) 
4. Similarly, we shall get : 
Cov (aX +b, cY +d) = ac Cov(X, Y) ,«(6°30C) 
Cov (X + Y, Z) = Cov.(X, Z) + Cov (Y,Z) ..(6°30d) 


Cov (aX + bY ,cX + dY) = acoy’ + bdoy’? + (ad + bc) Cov (X,Y) _ ...(6-30e) 


IfX and Y are independent, Cov (X,Y) = 0. [c.f (6-29b)] 

However, the converse, is not true. 

(For details see Theorem 10-2) 

6-6-1. Correlation Coefficient. Thé correlation coefficient ( pyy) , between 
the variables X and Y is defined as : 
Cov (X,Y) 
For detailed discussion on.correlation coefficient, see Chapter 10. 
6-7. Variance of a Linear Combination of Random Variables 
Theorem 6-9. Let X,,X2, ...,X, ben random variables then 


pxy = Correlation Coéfficient (X,Y) = 


v| zai xi}= = Sa?V(X;)+2 5 z a:a; Cov(X,X;) (631) 


teal izl 


ne . . 
Proof. Let U= a,X,+ a.X2+ ... + 4,X, 
LEU) = a, E (Xi) a E (X14 20+ a E(X,) 


U -E(U) = aj [X; - E (Xi) + 42 [Xa - E (X2)]'+ «4 a [Xn - E (Xn) 


..(6-30f) 
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Squaring and taking expectation of both sides, we get 
E[U- E(U)[P = a E[X,- EQ] + a’ E[%)- EX] + ... 
+ a, E[X,- E(X)/ 
+22 J aja; E[{X;- E(X)} [X)- E(%)}] 


iel fj 
i<j 


=> V(U)= a,?2 V(X) + ay V(X) + ... + a,’ V(X,) 
+22. £ a;d& Cov (X%,X) 


iol jn 
i<J 


=> V z Qa; x;|- > a? V(X) + 22 = a; a; Cov (X;, X;) 
je is, is pol 
: : tf 
Remarks. 1. [fa;=1; 1=1,2,...,.” then 
V(X + X2.+ «0+ Xa) = V(X) + V(X) + ... + VM) 


° +2 2 2 Cov (X;, X;) ...(6°3 1a) 
i<j | 
2. I6X, ,X2,...,X,_ are independent (pairwise) then Cov (X;, X}) = 0, (i= j). 
Thus from (6-31) and (6:31a), we get 
V(a1X, + a2X2 +... + €nXn) = a1°V(X}) + €2’V (X2) +... + ii (6-316) 
and V(X, + X2't... +X) = V(X1) + VX) +... + V(X,,) > 
3. Ifa, =1=a, and a;=a,=.,.= a, = 0, then from (6-31), we get 
VX nt X2)= V(X1)+ V(X2)+ 2 Cov (X1,X2) 
Againifa; #1, a2=-1anda3;=444...=a,50, then 
V(X, = X.2)= V(X,)+ V(xX7)- 2 Cov (X1.,X2) 
Thus we have 
V(Xi1+-X2)= V(X1)+ V(X2)* 2 Cov (X1,X2) __ «..(631c) 
If X, and X> are independent, then Cov (X,,X2) = O and we get 
V(X1+ X2)= V(X) + VAX2)- , ...(6:31d) 


Theorem 6:10. JfX and Y are independent random variables then 
E [h(X) .k(Y)] = E [hk (X)] Ek] s —--e(6°32) 
where h(.) is a function of X alone and k(.) is a function of Y alone, provided 
expectations on both sides exist. 
Proof. ket fy (x-)- and. gy (y') be the marginal p.d.f.’s of X and Y respectively. 
Since X and ¥ are independent, their joint p.d.f. fry (x, y) is given by 


frv(x,y)= fe(x) f(y) {*) 
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By definition, for continuous.r.v.’s 


E(nay-k(]= f f k(x) k(y) Sl, y) de dy 


=~ © - 
a oe 


=f f h(x) RO) f(x) B(y) de dy 


[From (*)] 

Since E (h(X) k(Y)] exists, .the integral on the right hand side is absolutely 

convergent and hence by Fubini’s theorem for integrable functions we can change 
the order of integration to get 


Eth ny) d= | f w(x} Je) | f k(y) aly) wy 


«=m cof 


= E(h(X)).E(k(Y)], 


as desired. 
Remark. The result can be proved for discrete random variables XY and Y on 
replacing integration by summation over the given range of X and Y. 


Theorem 6-11. Cauchy-Schwartz Inequality. Jf X and Y are random 
variables taking real values, then 


[E(XY)[ s E(X’).E(Y’) ...(6.33) 


Proof. Let us considera real valuéd function of the real variable t, defined 

by 
Z(t)h= E(X+tYy 

‘which is always non-negative, since (X +1 Y) = 0, forall real X, Y and ¢. 

Thus ZQ@)=E(X+tYv20 Vt. 

=> Z()= E[X?+ 2XY+ 0?Y?] 

= E(X?)+ 2t.E(XY)+ CE(Y’) 2 0, forallt. (+) 
Obviously, Z (¢ ) is a quadratic expression in ‘t’. 
We know that the quadratic expression of the form : 


wt Y(t) 


——_—_ 
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W()=Al+ Btt C z= O forall ¢, implics that thé graph of the function 
Y (¢) either touches the ¢ -axis at only one point or not at all, as exhibited in the 
diagrams. 


~This is equivalent to saying that the. discriminant of the function V (¢) , viz., 
B? ~ 4ACs 0, since the condition B’-- 4AC > 0 implies that the function 
W (¢) has two distinct real roots which is a contradiction to the fact that V (¢ ) meets 
the ¢ -axis either at only one point or not at all. Using this result, we get from (+), 


4.-E(XY)P- 4.E(®).E(Y’) < 0 
=> [E(XY)) s E(x’). E(Y) 
Remarks. 1. The sign of equality holds in (6-33) or (*) iff 
E(X#tYyry=0 Vit => P{(X+t¥) =0]-1, 
—>P[X+1Y=0]= 1 an Pye -*le1 
=>P[Y=AX]=1 5° (A= -1/t) :..(6°33b) 


2. If the rv. xX takes the real values Xi, X2, ... Xn and’ iv. Y ‘takes the real 
Values yi, V2, +) Yr then Cauchy-Schwartz inequality implies : 


2 
2 oa5) etl .o alll ee) 
& Bayle(s zx) -(5 2) 
na 2 a A 
=> 2 Xi yi s 2 x] | 2 yi?) 
the sign of equality holding if and only if: * : 


pall constant = k, (say) for all i= | ey 


Le. iff 
y2 


yi 
3. Replacing X by | X - ae | X- w.| andtaking Y= lin (6-33; 
we get 


podem X2 e x 
— an =e ok, Sa 
ve ( ly). 


[E | x- pel P= E| X- »,|*. E(1) 
=> [ Mean Deviation about mean |* Variance (X ) 
=> MD. s SD.'>*° SD. = MOD. ~~" ...(633a) 


*6°7. Jenson’s Inequality -: 
Continuous Convex Function. (Definition). A continuous function g(x) 
on the interval / » convex if foreveryx, and x2, (41+ x 2)/2 € 1, we have 


“rN 1 + X2 
g bere < 58(%1)+ 3 8(%2) 
Remarks. 1. If x,, 42 & J, then(x;,+ x7)/2 El. 
2. Sometimes (6:34) is.replaced by the i condition : 


Forx,, x2 € /, ; - 
gl[Axi+ (l- A)rits hg (vi)+ i. Pueactie As 1 ...(6°35) 


(6°34) 
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é : ne 
(6-34) and (6°35) agree at A = >: 


3. If we do not assume the continuity of g( x ) , then (6-35) is required to define 
convexity. There are certain ‘no1-measurable’ non-constant functions g ( .) satis- 
fying (6°34) but not (6:35). If g(--) ‘is measurable, then (6:34) and (6°35) are 
equivalent. 

4. A function satisfying (6-35) is continuous except possibly at the end points 
of the interval / (if it has end points). 

5. If g is twicé differentiable, i.c., g”(x) exists for X € [interior of / ], and 

g(x) 2 0 forsuchx, then g is convex on the interior points. 

6. For any point x, interior to /, J a straighit line y= ax + b, which passes 
through (x.,, & (x.)) and satisfies g(x )2= ax+ b,forallxE /. 

Theorem 6-12. (Jenson’s Inequality). If gis continuous and convex function 
on the interval l, and X is arandom variable whose values are in I with probability 
1, then 

E [g(x )J= gE (X) |; ...(6:36) 
provided the expectations exist. 


Proof. First of all we shali show that E (X) € /. 
The various possible cases for / are : 
l= (-20,%);J/=(a,e); l={a, eo); J=(-&,b); 
I= (-%,b], 1 = (a,b) and variations of this. 
If E (X) exists, then,-0 < E(X¥)< &. 
If X= a almost surely (a.s.), t.e., with probability 1, then E (X )2 a. 
IfXs b,as.thenE (Y)s b. 
Thus E (X) El. NowE(X) canbe either a left end or a right end point (if 
end points exist) of I or an interior point of I. 
Suppose I has a left end point ‘a’, ie, X2a and E(X)+= a. Then 
X-a20as.andE (X- a)= 0 5 


Thus P(X=a)=1 or P((X- a)=0)J=1. 
Elg(X)]= Elg(a)] [- g(x)= g(a) as.) 
= g(a) (:.' g(a) is a constant ) 
= gE(x)]. \ 
The result can be established similarly if / bas a right end point ‘b’ and 
E(X)= b. 9 (x) Q(x) ° 


Thus we are now required .to establish 
(6:36) when E (X) = xo, is an interior point 
of 1. 

Let ax+ 5 pass through the point 


OX+ 
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(x,, g(x.)) and let it be below g 
[c.f. Remark 6 above]. 
Elg()]2 E@X+ b)=aE(X)+ b 
=axotb 
= g(x.)= g[E(X)] 


= ElgX)]= g{(x)]. - 
Continuous Concave Function. (Definition). A continuous function g is 


concave onan interval / if(- g) is convex. 


Corollary to Theorem 6:12. Jf g is a continuous and concave function on 
the interval I and X is a r.v. whose values are in I with probability 1, then 
Ele X)]s g[E%)] (6°37) 
provided the expectations exist. 
Kemarks. 1. Equality holds in Theorem (6-12) and corollary (6-37) if and 
only if 
P[g (X) = aX +b] =1, 
for some a and b, 


Z. Jenson’s inequality extends to random vectors. If / is a convex set in 
n-dimensional Euclidean space, i.e., the interval / in theorem 6-12 is transferred to 
convex set, g is conotinuous on /, (6°34) holds whenever .X; and X; are any arbitrary 
vectors in I. The condition g’’ (x) 2 0 for x interior to / implies 


dg (x) 
is non-negative definite for all x interior to /. 
SOME ILLUSTRATIONS OF JENSON’S INEQUALITY 


1. If E (X’) exists, then 
E(X*) 2 (EQ), ..-(6°38) 
since g (X) = X’ is convex function of X as g (xX) =2>0. 


2. IfX>0 as.i.e.,X assumes only positive values and E (X) and E (1/X) 
exist then 


1 1 
Ely] “ EQ)’ ...(638a) 


because g (X) = : is a convex function of X since 


g” (x)= = >0, for X >. 


3. (f£¥>0,a.s. then 
E(X"*) s |E(X)]”, , ,(6°38b) 
since g (X) =X i Xk >0 is a concave function 
ase" (X= -sx* <0, for¥>0. 


4. UV > 0, acs. then 
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E | log (X)] s log [E (X) ], ...(6°38¢) 


provided the expectations exist, because log X is a concave function of X . 


5. Since g (X)=E (e™) is a convex function of X for all ¢ and all_X, if 
E(e =) and E (X) exist then 


E(e™) = efF™ ...(6°38d) 
If E (X) = 0, then 
My(t)=E(e") = 1, forall. 

Thus if My (£) exists, then it has a lower bound 1, provided E (X) = 0. Further, 
this bound is attained at ¢= 0. Thus My (¢) has a minimum att = 0. 
67:1 ANOTHER USEFUL INEQUALITY. Let f and g be monotone functions 
on some subset of the real line and X be a r.v. whose range is in the subset almost 
surely (a.S.) If the expectations exist, then 


EI/MeM]2 E(f®] Elg@} (6:39) 
E{s®e®)) s E(f®]-E(g@) (6390) 


or 
according as f and g are monotone in the same or in the opposite directions. 
Proof. Let us consider the case when both the functions f and g are monotone 
in the same direction. Let x and y lie in the domain of f and g respectively. 
If fand g are both monotonically increasing, then 
yex => fW)ef@) and gly) = g(x) 
=> fY)-f@)=0 and gliy)-g) 2.0 
= Uy) -£@)] -[g &) - g @)] = 0 -(*) 
If fand g are both monotonically decreasing then fory 2 x, we have 
fy) s f) and &(y) s g(x) 
=> fy) -f@) s 0 and g(y)-g(%) s 0 
= UO) -£&)] - [a Ge ~ g(x)] = 0 --(#*) 
Hence if f and g are both monotonic in the same direction, then from (*) and 
(**), we get the same resull, viz., 


if (y) -f@)] . lg &) - g &)] 2 0. 


Let us now consider independently and identically distributed (i.1.d.) random 
variables X and Y. Then from above, we get 


E\(f(%) - £0) (g () -g (X)] 2 0. 
= EU™M)-.sMI-EUVY).g@M)-E Xe) 
+Elf(X).g@]20 © ...6-40) 
Since X and Y are u1.d. r.v.’s, we have 
EVs] = EL Ms]: 
ELM] = ELMIEle Ol = EL OE[s OM) 


(-.X and Yare independent) (°." X:and Y are identical) 
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and EU (X)g (Yj = EY X-E le MEY OE OO 


Substituting in (6°40) we get 
2 EUf&).g(xs)]-2 Eff®].E[g@] 20, 

= Eif(X).e@] = Ef@]-E le) 
which establishes the result in (6°39). 

Similarly, (6-39a) can be established, if f and .g are monotonic in opposite 
directions, t.e., if fis monotonically increasing (decreasing) and g is monotonically 
decreasing (increasing). The proof.is left as an exercise to the reader. 

SOME ILLUSTRATIONS OF INEQUALITY (6-39). 

1. IfX is a r.v. which takes only non-negative values, Le. if. X = 0 a:s. then 
for a > 0, B > 0, f(X) =X% and g (X) =X? are monotonic in the same direction, 
Hence if the expectations exist, 

E(X* .X°) = E(X").E(X*) 
=> E(X°*®) = E(X*)y E(X®); a>0, B>0 ..(6-41) 
In particular, taking o = B = 1, we get 
E(X’) 2 [E(®/, 
a result already Obtained in (6-38). 

2. If X = 0, as. and E (X“%) and E.(X ~") exist, then for a > 0, we get from 
(6-39a) 

E(x* XX) s E(X").E(X"") 


a 1 a-1),\, 
at E(x" £( 5] > E(X*"!): a>0. Jie 
In particular with a = 1, we get 
E (xX) El +} > 1, 
a result already obtained in (6:38a) 
Taking a = 2 in (6-42), we get 
E (X?) E(x) > E(X) 
E.(X*) > tt) > as : 
E(x E(L 
x] X ...(6°43) 
on using (6°38a). 


Taking o.= 2 and B = - 2.in'f (X) =X°, g (X) =X°® and using (6-39a), we get 
E(X’) . E(X"?) = E(X’.X7’) = 1. | 
=> E(x 2 


1 
E (x~’) ...(6°434) 


Which is a Weaker inequality than (6;43).. 
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3. 1f Mx (t) = E (e™) exists for all ¢ and for some r.v_X, then 
Myiss) =E le eav) xy =E (eo, ew 
2 E(e™) . E(e™) 
= Mx (u) . M x(v) 
. My(u+v) => Mx(u).Mx(v), for u,v 2 0. 
Example 6-1. Let X be a random variable with the following probability 
distribution : 
Xx : -3 6 9 
P, (X =x) : 1/6 1/2 1/3 


Find E(X) and E(X’) and using the laws of expectation, evaluate 
E (2X +1)’. 
(Gauhati Univ. 5.Sc., 1992) 
Solution. E(X) = 2x. p(x) 


= (-3)xi4+6x5+9x i 


1 
3 


E(X’) = Xx? p(x) 
1 1 1 93 
= 9x Z+36x>5 +81 = 
E (2X +1) = E[4X’+4X+1] = 4E(X’)+4EQX) +1 
= 4xeaxite = 209 


Example 6-2. (a) Find the expectation of the number on a die when thrown. 

(b) Two unbiased dice are thrown. Find the expected values of the sum of 
numbers of points on them. 

Solution. (a) LetX be the random variable representing the number on a die 
when thrown. Then X can take any one of the values 1,2,3,..., 6 each with-équal 
probability 4%. Hence 


I 1 1 I 
E (xX) gx ltex2tex3t... +7 x6 


I 1 6x7 7 
ot 9 + — ed _—_=  =S = 
g(l+2+3+... +6) ee > A(t) 


Remark. This does not mean that in a random throw of a dice, the player will 
get the number (7/2) = 3-5. In fact, one can never get this (fractional) number ina 
throw of a dice. Rather, this implies that if the player tosses the dice for a “long" 
period, then on the average toss he will get (7/2) =3°5. 

(6) The probability function of X (the sum of numbers obtained on two dice), 


Sie ate le 2 ll 
vis | %6 | 346 | 46 | 546 | 96 | | 445 | 6 


e 


S 
Value of X : x 
Probability 
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E (x) > pix 


= at ik Sila Ma Mia +7* 36 


+8x2 gt Ox a 36 = +10x 3-4 11 xo + 12x J 


= L246 4124 206304 424.40 436 + 30-622 + 12) 
ge = 
= 3g % 252 


Aliter. Let_.X; be the number obtained on the ith dice (i = 1, 2) when thrown. 
Then the sum of the number of points on two dice is given by 
S = X; +X; 


ae E(S) = E(X,)¢4F (X2) = i+ =7 [On using (*)] 


Remark. This result can be generalised to the sum of points obtained in a 

random throw of n dice. Then 
E(S)= DT EK)= 2 (7H= 
ial ial 

Example 6:3. A box contains 2” tickets among which "C; tickets bear the 
number t ; i=-0, 1, 2.....,n. A group of m tickets ts drawn. What ts the expectation 
of the sum of their numbers ? 

Solution. Let X ; ; 1 = 1, 2, ..., m be the variable representing the number on 
the ith ticket drawn. Then the sum ‘S’ of the numbers on the tickets drawn is given 
: by 


S =X +X7+...+Xn = LX; 
E(S) = 2 E(Xi) 
i=l 
Now X; is a random variable which can take any one of the possible values 0, 
1, 2, ..., 2 With respective probabilities. 


"Co/2", "C,/2", "C2/2', bs eC ./2 . 

= [ 1."C, a 2.°C3 > 3.°C3 + i... + n.C, 

n(n- n(n-1)) 3 n(n-1)(n-2) 
2! 3! 


E (Xj) 


= | tn + 2A +...¢7.3 


1+ (ray MEDD gs 


[""*Co+" "Cy 7 Saas OL Tre aie OH 


.(1+1)""' = 


Nils Vs Vls le 


ML 
2 
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Hence E(S) = 5 (n/2) = ae 
tal 


Example 6-4. /n four tosses of a coin, let X be the number of heads. Tabulate 
the 16 possible outcomes with the corresponding values of X. By simple counting, 
derive the distribution of X and hence calculate the expected value of X. 

Solution. Let H represent a head, 7 a tail and_X, the random vanable denoting 
the number of heads. 


S. No. Outcomes No. of Heads | S. No. Outcomes No. of Heads 
(X) (X) 


1 HHHH 4 9 H THT 2 
2 HHHT é 10 THTH 2 
3 HHTH 3 11 THHT 2, 
4 HTHH 3 12 HTTT\ 1 
5 THHH 3 13 THTT 1 
6 HHTT 2 14 TTHT 1 
7 HTTH 2 15 TTTH 1 
8 TTHH 2 TTTT 0 


The random variable X- takes the values 0, 1, 2,3 and 4. Since, from the above 
table, we find that the number of cases favourable to the coming of 0, 1, 2, 3 and 
4 heads are 1, 4, 6, 4 and 1 respectively, we have 


1 4 1 6 3 
P(X=0)= 55, PX=D =F =7, PX&= 2) = 16 ~ 8’ 
Ps3)= SF and P=4) =. 
Thus the probability distribution of Y can be summarised as follows : 
ar 0 1 2 3 4 
ie oe ee re a 
as 16 4 8 4 16 
4 > 
1 3 1 1 
E(X) = bea ad ee ee ae at4-F6 
= 1 + 3 a 3 + 1 = 2 
4 4°4 4 
Example 6-8. A coin ts tossed until a head. appears. Whai ts the expectation 
of the number of tosses required ? {Delhi Univ. B.Sc., Oct. 1989] 


Solution. Let XY denote the number of tosses required to get the first head. 
Then X can materialise in the following ways :, 


E(x) = z x p (x) 
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Probability p (x) 


1/2 
1/2 x 1/2 = 1/4 
1/2 x 1/2 x 1/2 = 1/8 


TH 


1 L 1 1 
Sik, ex Q to +4x— +... 


8 16 .+(*) 
This is an arithmetic-geometric series with ratio of GP being r = 1/2. 


Let Sa l.pt2.gt3.gt4.cet.. 
1 1 1 1 
Then 75 = 4*2-3t3 167 
1 1.1 #41 
‘ _1 ee sca es hee 
(1-3)S yatta tae: 
1 72. | 
~ 2> = T-(17y 7! 
[Since the.sum of an infinite G.P. with first term a and common nmatio'r (< 1) 
is a/(1 -r) ] 
=> S=2 
Hence, substituting in (+), we get 
E(X) =2 


Example 6-6. What ts the expectation of the number of failures preceding 
the first success in an infinite series of independent trials with constant probability 
p of success in each trial ? (Delhi Univ. B.Sc., Oct. 1991] 


Solution. Let the random variable X. denote the number of failures preceding 
the first success. Then X can take the values 0, 1, 2, ..., 0. We have 


p(x) = P(X =x) = P{[x failures precede the first success ] = q‘ p 
where q = 1 -p 1s the probability of failure ina trial. Then by def. 


E(X) = = xp(x) = 2 x.g'p=pq = xq"' 
x=0 xe0 
= pq{1+2q+3q + 4g’ +...] 
Now 1 + 2q + 3q° + 4q° +... is'an.infinite arithmetic-geometric series. 
Let S = 14+29+3q'+4q'+... 
qS = q+2q°+3q'+... 


(l-qg)S=1+q+q+q +... = = 
m _ 1 1 » 
(1-qy 
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1 
1+29+3q+4q°+... = ——, 
(1-4) 


Hence EG) -—4.. 21 


Example 6-7. A bex contains ‘a’ white and ‘b’ black balls. ‘c’ balls are drawn. 
Find the expected value of the number of white balls drawn. 
[Allahabad Univ. B.Sc., 1989; Indian Forest Service 1987] 
Solution. Leta variable X;, associated with ith draw, be defined as follows: 
X; = 1, if ith ball drawn is white 
and X; = 0; if th ball drawn is black 
Then the number ‘S’ of the white balls among ‘c’ balls drawn is given by 


S=X,+X.+...4X.= IX; => ES) = = E(x) 


rel te] 


Now  P(X;=1) = P (of drawing a white ball) = aaa 
and P(X; +0) = P (of drawing a black ball) = —— 
a 
E(X) = 1.P(X;=1)+0.P(X;=0) = ea 
a ca 
Hence E(S) = a eed are 


tel 
Example 6-8. Let variate X have the distribution 
P(X=0) = P(X=2) = p; P(X¥=1) = 1-2p, for Ospst. 


For what p is the Var (X) a.maximum ? 
[Delhi Univ. B.Sc. (Maths Hons.) 1987, 85] 
Solution, Here, the r.v. X takes the values 0, 1 and 2 with respective 
probabilities p, 1 - 2p and p,Osps;. 
E(X) = 0x p+1x(1-2p)+2xp=1 

E(X*) = Oxp+1’x(1-2p)+2?xp = 1+42p 

Var (X) = E(X*)-(E(X)]' = 2p; Ospsi 
Obviously Vat(X) is maximum whenp = >, and : 

[Var (X) Jax = 2x4 = ] 


Example 69. Var(X) =0. => p[x =E(X)]=1.-Comment. 
Solution. Var (X) = E[X-E(X)]’ = 0 

=> | {X¥- E (X) [ = 0, with'probability 1 

=> [X - E (X)] = 0, with probability 1 

at PIXE] 51 
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Example 6-10. Explain by means of an example that a probability distribu- 

tion is not uniquely determined by its moments. 
Solution. Consider a r.v. X with p.d.f. [c.f. Log-Normal distribution & 8-2-15 
f(x) = ee exp [- 3 (osx) | ;x>0 


= 0; otherwise 


Consider, another r. v. Y with p.d.f. 


g(y) = [1+ asin (2 logy) | f(y) = ge(y) (say), y>0 ...(##) 


which, for - 1< a <1, represents a family of probability distributions. 


E(Y")= f y’ (1+asin(2 x log y)| f(y) dy 
= fy fO)dy+a.f y".sin (2 log y) f(y) dy. 


ae 1 
= EX tance fy SINE Ya one 


= (log yy lay 


= EX’+ 


= f ew-2n «sin (2 m2) dz 


| log y =z => yee] 


@ 1 ; 2 
= EX'+ x J e727 F-9 . Sin (2 7 2) dz 


.—) 
<< 
N 


= EX’ +o i J ert, sin (2 my) dy 


ai r=y => am ea) SU Ree, 
r being a positive integer |. 


= EX’, 
the value of the integral being zero, since the integra id i is 4n odd function of y. 
= E(Y") is independent of ‘a’ in (**), 


Hence, { g(y) = g.(y); -1sas1}, represents a family of distributions, 
each different from the other, but having'the same moments. This explains that the 
moments may not determine a distribution uniquely. | 

Example 6-11. Starting from the origin, unit steps dre taken to the right wuli 
probability p and to the left with probability q (= 1 - p). Assuming independent 

‘movements, find the mean and variance of the distance moved from origin after n 
steps (Random Walk Problem). 

Solution. Let us associate a variable X; with the ith step defined as follows : 

X; = +1, if the ith step is towards the nght, 
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= -1, if the ith step is towards the left. 
Then S =X, +X2+...+X, = IX;, represents the random distance 
moved from origin after 7 steps. 
E (Xi) = 1xp+(-1)xq = p-q 
E(X;?) = ?xp+(-1)xq = p+q=1 
Var (Xi) = E(Xi)-[E()) = (q+p)-(p- 4) = 474 


E(S,) = 3 EK) = n(p-4) 
V(S) = E V(X) = 4npq 


[ *.. Movements of steps are. independent }. 
Example 6-12. Let r.v. X have a density function f (.), cumulative distribu- 
tion function F (.), mean p and variance o’. Define Y = a+ BX, wherea and 
B are constants satisfying -~<a< and B>0. 
(a) Select a and 8 so that Y has mean 0 and variance 1. 
(b) What is the ‘correlation coefficient Pyy between X and Y ? 
(c) Find the cumulative distribution function of Ym terms of-a ,B and 
F (-). 
(d) If X ts symmetrically distributed about -p, us Y necessarily symmetrically 
distributed about its mean? 
Solution. (a) E (X) = wu, Var (X) = 07. We wanta and Bs.t. 


E(Y) = E(a+ 8X) = a+By = 0 .-(1) 
Var(Y) = Var(a + BX) = 8’.0° = 1 ...(2) 
Solving (1) and (2) we get : 
8 = 1/a, (>0) and a = -p/o ..-(3) 
(b) Cov (X,Y) = EWY)-E(Q)E(Y) = E[X(a+8X)] 
[ E(Y)=0) 


= a.E(X)+B.E(X’) s+ apt Bf{o'+p’] 
5 Cov(X,Y) _aputBlo+p’] ; 
xi = &£®* 


Oy Oy o.l1 ( ‘Oy= 1) 


sa [-wto ee] a1 
- [On using (3) ] 
(c) Distribution function Gy (.) of Y is given by : 
Gy(y) = P(Y sy) = P[a+BXsy] 
= P(X s(y-a)/B) 
=> Gy(y) = Fr( 5%) ; 


(d) Wehave: Yeats = (x -) = 6 (X- jp) [On using (3) ] 
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Since X is given to be symmetrically distributed about mean p, (X — 2) and 
- (X — 4) have the same distribution. 

Hence Y=f(X- wp) and -Y = -~B(X-4p) have the same distribution. 
Since E (Y) = 0, we conclude that Y is symmetrically distributed about its mean. 

Example 6-13. Let X be a r.v. with mean and variance. o’. Show that 
E (X - b)’, as a function of b, is minimised when b = 

Solution. E(X- bY = E[(X-p)+(p-5)] 

= E(X-p)'+(u- by +2(n- dE (X-y) 

Var (X) + (u- b) [ E(X-p) =0) 

=> E (X-b) 2 Var(X), .o(*) 
since (u - b)’, being the square of a teal: quantity is always non-negative. 

The sign of equality a in (*) iff 

(u-b’ =0 => ped. 
Hence E(x - - by’ is ih ee when wt = 6 and its minimum value is 


E(X~p)’ = ox’. 
Remark. This result states that the sum of squares of deviations i is minimum 


when taken about mean. 

[ Also see § 2-4, Property 3 of Arithmetic Mean] 

Example 6°14. Lef a, a2,...,a, be arbitrary real numbers and A,, A, 
wy An De events. Prove that 


> ee 3 a; a; P (A;A;) = 0 


iol seat 
[Delhi Univ. B.A. (Spl. Course ~ Stat. Hons.), 1986] 
Solution. Let us define the indicator variable : 
Xi2h, = 1 if Aj occurs 
= 0 if A; occurs. 
Thefi using (6°2b) : 


E(X) = P(A); (i=1,2,..,n) (i) 
Also X;X; = Lana, 
=> E (XX) = P (A;A)) -..(it) 


- 2 
Consider, for rea] numbers @;, a2, ..., dn, the expression ( z 4; X; | , which 
fel 
is always non-negative. 
; 2 
=> ( = a; Xi] = 0 
iel 
=> [ zax,]( Bax) 20 
isl j=l 
=> 'S £ aa; XX 20, Ms 
fet jel ‘ mm (1) 
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for all a;’s and a;’s. 
Since expected value of a non-negative quantity is always non-negative, on 
taking expectations of both sides in (ili) and using (i) and (ti) we get : 
a s a; a; E (x; X;) 20 > ££ 2f Q; a; P (A;A)) 2 0. 


fel jel vel jel 


Example 6-15. Ina sequence of Bernoulli trials, let X be the length of the 
run of either successes or failures starting with the first trial. Find E (X) and 
V ). 

Solution. Let ‘p” denote the probability of success. Then q = 1 -p is the 
probability of failuure. X ="1 means that we can have any one of the possibilities 
SF and FS with respective probabilities pq and qp. 
<P P(X=1) = P(SF)+P(FS) = pq+qp = 2pq 
Similarly 

P(X =2) = P (SSF) + P(FFS) = p'q+qp 
In general 
P(X=r) = P [SSS. SF] + P (FFF. .FS| = p’.q+q.p 


E (Xx) = = rP(X=r) = 2 r(p'.q+q .p) 
= pq Erp + Erg) 
= pq (+29 +30". ee ae yy 


= pq {(1-py*+(1-4)*) = pq lq? +p’) 
(See Remark to pes 6:17) 


“ es se iSe'l|| oe ape 
plate i|- ae 
E (X’) -{E (X)|? = E(X (X= 1)} +E (X)-(E &)}? 


V(X) 


Now 


EX - 1) 


5 r(r-1)P(X=nr) = Er (r- 1) (p'q+ 4p) 
read so? 


Zr(r-1)pq+ Ir(r-l)dp 
re2 rea2 


pq z r(r- 1) p'"?+qp Sr(r-yq 
= 2p’q = 

re2 
= 2p’'q(1-p) > +2qp(1-qy° 


2 2 
= 2 P_ + 4 


r(r-1) ,- _2rr-l) ,- 
HD pag agts $TORD gs 
2 a2 2 


6°28 Fundamentals of Mathematical Statistics 


ata [Es 7) eae rae 


-(2- 4) eaed 
q Pp q P 

Aliter. Proceed as in Example 6-17. 

Example 6-16. A deck of n numbered cards is thoroughly shuffled and the 
cards are inserted i into n numbered cells one by one. If the card number ‘v’ falls in 
the cell ‘i’, we count it as a match, otherwise not. Find the mean and variance of 
total aihiber of such matches. [Delhi Univ. B.Sc., (Stat. Hons.), 1988] 

Solution. Let us associate a random variable, XY; with the ith draw defined as 
follows : 

Y= 1, if the th card dealt has the number ‘i’ on it 
‘710, otherwise 


Then the total number of matches ‘S’ is given by 


S 2 X,+X1+...4+X%, = 2X; 


tel 


E(S) = z E (Xi) 
Now E(X) =1P(X;=1)40.P(X;=0) = P(X;=1) -- 
Hence E(S) = r(S)en- 1 
V(S) = Vim +X2+.. - +X,) 
= SV(K)+2 bes Cov (X;,X;) a) 
Now V(X) = E(X;’)-{E x} 
z P.PKi=1)+0 Pi =0)-( | 
-i_1 a-l 
now n .06(2) 
Cov (X,X) = E(X:X) - E (X) E (X) 3) 
E(X;X;) = 1,P (X;X)=11) + 0.P (X;X;=0) 
_(m-2)! 1 
ont n(n -1)’ 


since X;X; = 1 if and only if both card numbers i and j are in their respective 
matching places and there are (m - 2) ! arrangements of the remaining cards that 
correspond to this event. 

Substituting in (3), we get 


Mathematical Expectaions 6°29 


poe a 
n(n- 1) n’ (n-1) (4) 
Substituting from (2) and (4) in (1), we have 


1 1 
Cov (X;, X}) = oR” 


n n-l n n 1 
V(S) = 2 2 2. | = 
(S) Z| nv fa jar | ae aa 
n-1\ 2, 1 n-1 1 
= | n | +2 Sa 7 n tn”! 


Example 6-17. /f ¢ ts any. positive real number, show that the function 
defined by 
pix) = e"(1-ety | (#) 
can represent a probability function of a random variable X assuming the values 
1, 2, 3, ... Find the E (X) and Var (X) of the distribution. 
(Nagpur Univ. B.Sc., 1988] 
Solution. We have 
e>1,V t>0 => e'<1 => 1 =-e's0 


Also etetsoyv t>0Q 
e 


-1 
Hence p(x) = e'(i-e'). 20 V ¢t>0,x=1,2,3,... 


Also > p(x) ev! >» (l2e')! eget r a'; 7 
rel xe'l rel [a=l-e'] 


-¢t Z 2 ¥ 3 =f e 1 ay 
xe (l+tat+at+at+..) =e =a) 
ee‘fl-(l-e)yizsee'(ey' =1 
Hence p(x) defined in (*) represents the probability function of a r.v. X. 


E (X) = Xx. p (x) = e' >» x(l-e'y' 
xeal 


=e@' © x.a"'; [a=1-e'] 


= @'(1+2a+3a?+4a°+...) = e'(1-a)y? -oo(*), 
- e' (e'y? <a é 
Tx p(x) =e! Lx .a"! 


re] 


= &' [1+ 4a+9a’+ 16a’ +...] 

= e‘(1+a)(1-a)’ = e'(2-e')e* 
Hence Var(X) = E(X’)-[E@ J = e'(2-e')e%—e 

= e[(2-e')-1] = (1-e') 

= &(é -1) 


E (X’) 


Remark. 
() Consider S = 1+2a+ 3a” + 4a’ + ...(Arithmetico-gcometric serics) 
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=> aS = a+2a*+3a°+... 
=> (l-a)S = l+dt+a?+a+... = Yi-e) => S=(1-a)y’ 
Exa*'=14+2a+3a+4a +... = (1-a)y? »-(*) 
rel 
(it) Consider 
S=1+2.a+3?.a@4+4.a+5 .a'+... 
=> S = 1+ 4a+ 9a’ + 160° + 25a‘ +... 
-~3aS = -3a- 12a’ - 27a’ - 48a‘ -... 
+3a’S = + 3a’ + 12a°+27a'+... 
-a’S = Be av Ag Sih: 
Adding the above equations we get : 
(l-a/S21+a => S=(1+a)(1-a)? ..(**) 


VY a! = 14404 9a* + 16a°+... = (1+a)(1-a)? 


xyel 


The results in (*) and (**) are quite useful for numerical problems and should 
be committed to memory. 

Example 6-18. A man with n keys wants to open his door and tries the keys 
independently and at random. Find the mean and variance of the number of trials 
required to open the door (i). if unsuccessful keys are not eluminated from further 
selection, -and (tt) tf they are. [Rajasthan Univ. B.Sc.(Hons.), 1992] 

Solution. (i) Suppose the man gets the first success at the xth trial, 1.é., he is 
unable to open the door in the first (x - 1) trials. If unsuccessful keys are not 
eliminated then X is a random variable which can take the values 1, 2, 3,.... ad 
infinity. _ 

Probability of succcss at the first trial = Yn 


Probability of failure at the first trial = 1 - (1%) 


If unsuccessful keys are not eliminated then the probability of success and 
consequently of. failure is constant for each trial. 


Hence | p(x) = Probability of 1st success at the xth trial 
(, vy 1 
= ted _ 
n n 
© r 1 r-1} 1 
E(X) = 3 xp(x)= & x{1-] = 
xe] xe] n n 
= : 2 x At) where le ee 
N yal n 


E (X) = ~ [1 +24 +34? + 44? + ..J = = (1-Ay? 


" [See (*), Example (6°17)] 
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ty o e rm) ; 1 xo! 1 
E(X?) = es z x°(1- . 
= 1 >> x At! 
nN xe 
= S142 A$ 3? AP A? ta] 
: = (1 +A)(1-Ay? [See (##), Example (6-17)] 
; 1 2'3 
1 1 1 
al t+(4-a)|[ 1-(4-3) f 
= (2n-i1)n 
Hence V(X) =E (X- {EX} = Qn-i)n- =? -ne=n (n= 1) 


(ui) If unsuccessful keys are eliminated from further selection, then the 
random variable XY will take the values from 1 toy. In this case, we have 
Probability of success.at the firststrial = Yn 


Probability of success at the 2nd trial] = (a -1) 


Probability of success at the 3rd trial = (pn - 2) 
and so on. : 


Hence probability of 1st succes at 2nd trial = ( 1- "| a 


Probability of first‘success at the third trial 
1). 1 \ 1 1 
= (1-2) (1 ec ef n’ 


p (x) = Probability of first success at the xth trial = “ 


and so on. In general, we have 


E (xX) = 2 xp (x) -2 E x = mt 
E (X?) = E x?p(x) = 4 gyi e “ED Ons) 
xel ‘ xel 
eure ee: 
Hence V(X) = E(X?)- [EQ = SAP ISD _ (AE) 


n+1 
12 


bo aa : n-1 
2 (2n +1) = 3 (n-+ 1) = 7 oe 


6°32 oo oe Fundamentals of Mathematical Statistics 


Example 6-19. Jn a lottery m tickets are drawn at a time out of n tickets 
numbered 1 ton. Find the expectation and the variance of the sum S of the numbers 
on the tickets drawn. [Delhi Univ. B.Sc. (Maths Hons.), 1987] 

Solution. Let X; denote the score on the ith ticket drawn. 


Then S=X,+X.+...+Xn = =X, 

ial 

is the total score on, ine uChete can: 
E(S) = z E (Xi) 


Now each X; is a fandom variable which assumes the values 1, 2, 3, . 


each 
with equal probability 1/n. 
; E(X) = (142434... +n) +t) 
Henée E(S) = | 
fet. 2 2 
V(S) = V&itXat .. + Xn) 
= : V(X) + 2 as Cov (X;,X)) 
tel 
E (X;) = Ltesteste nen! 
a i. n(n +1) (2n +1) 7 (n+ 1) (2n +1) 
no 6 6 
V(X) = E(X%;’)- (E(x) 7 
_ (2+1) n+ 1) - n+1 * wel 
6 2 12 


Also Cov (X;,X)) = E(X;X) —E (X) E (X) 


To find E (X;X;) we note that the variables X; and X; can take tlie valucs as 
shown below: . 


Xx; X; 

1 235, voy M 

2 1,3,...,7 

n 1, 2, ..., (2-1) 


é' ° 
Thus the variable X;X; can take n(n-—1) possible values and 


P(X; = IN X= = ea k=l. Hence 
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E (x; X,) = 


Hence 


between them and O otherwise. If X,,X2, 


D213 4 esis: +1.n 
1 +2.142.3 + 00... 4 2. 
n(n-1) WP cdc dedeveeaasee sss caveussveucssoxe 
+n.l+n2+.. +n(n-*): 
1(i+243 4.0... +n)-1') 
1 +2(14+24+34......¢n)—-27 
= n(n-1) W veeessenesans's vacksessinssuseeasusesacencasepeeses 
+n(14+2+...... n-1+n) 
“ao 7 [(1+24+3+.. tn - (427+... +7°)| 
| 
_ a awed _n(n-+ 1){2n +1) 
7 _ n(n-1) } 6 
poor 
12 (n ~ 1) 
.. (n+ 1) Bn? -n-2) n+1\ 
Gov (XM) = ea (7S ] 
(n + 1) 2 2 
ey 13" -n-2 3(n 1) | 
- _@+1) 
7 12 
~ (n-1 (n + 1) 
V(S) = 3 (5 Je2 PT ee 


oe m(m-—1) { -("+1) 
7 2 *"* 2! 12 ‘ 
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[since there are "C, covariance terms in Cov Ci, X))] 
vis) = BGs) [w- ae | - mins 1) - _in(n+ 1){n- m) 


Example 6: 20. A die is ae (n +2) tunes. After each jee a ‘+’ is 
recorded for 4,5 or 6 und ‘— for 1,2 or 3, the signs forming an ordered sequence. 
Vy each, except the first and the last sign, is attached a characteristic random 
variable which takes the value 1 ifboth the neighbouring signs differ from the one 


ables, find the mean and Variance'of' X = z Xj. 


Solution. 


tel 


Y= 5X => E(X) = 5 E(X) 


wey Xn Are Characteristic random vari- 
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Now E (Xj) = 1P (X; = 1) + OP (X; = 0) ed P (X; = 1) 
For X; = 1, there are the following two mutually exclusive possibilities : 
(i) - + -, (i) + - + 
and since the probability of each sign is; , we have by addition probability theorem: 
3 3 
;=1) =P fi) = [i 1) si 
P(X;=1) = P(i)+ Pi) = (5) +(4] 


2 4 


E(X) = 
Hence EW) = 3(4)- - 5, 
V(X) = V(Xi + Vat... +X) 
= z V(X) +2 EE Cov (x, X) a) 
Now = E(X;”) = PP dian s0P Hie = P(X=1) = 4 
VX) = EQ)- [EXP = 5-2 


Now E(X;X;) = 1.P(X;=10X;=1) + OP (X%;=0NX; = 0) 
4+0.P(X;=10X;=0) + 0P(X%;=0NX;=1) 


= P(X¥;=10X;=1) 
Since there are the following two mutuaily exclusive possibilities for the event 
: (X; = 1 NX; = 1), 


@- + - + 
(ii)+ -—- + ~,wehave 
‘ay 1) o1 
P(X;=1NXj=1) = P(@)+P(u) = 3] (3) = 3 
Cov (X;,X) = E(X,X) - EX) E (X) 
ee ae 
es a 4°4° 16 
Hence V(X) = (3/16) +2 x= Cov (X,, X}) [From (*)] 
iel i<j] 
3n 


cat 2 [Cov (%1, X2) + Cov (X2, X3) + 


1 
\ ee + Cov (X,=1, X,)| ¢ 
: 3n 1 -5n-2 
76 t2@-D-ag = 5 


Example 6:21. From a point on the circumference of a ciréle of radius ‘a’ ,a 
chord is drawn in a random direction, (all directions are equally likely). Show that 
the expected value of the length of the chord is 4a/n and that the variance of the 


? 


Mathematical Expectations 635° 


length is 2a? (1-8/ x’). Also show that the chance ts 1/3 that the length of the chord 
will exceed the length of the side of an equilateral triangle inscribed in the circle. 

Solution.. Let P be any point on the circumference of a circle of radius ‘a’ and 
centre ‘O’. Let PQ be any chord drawn at randam and let ZOPQ = 8. Obviously, 
6 ranges from - 1/2 to.x/2. Since all the direc- 
tions are equally likely, the probability differen- 
tial of 8 is given by the rectangular disuribution 
(c.f. Chapter 8) : 


dF (8) = f (8) d (8) a as 
a. es <0<2 
= = ¥) 2 


Now, since 2PQR is 4 right angle, (angle in 
a semi-circle), we have 


PO =cos§ => PO= ='PR cos 6 = 2a cos 0 


PR 
2 2 
E (PQ) =f, (PQ)f@)d(@) = =f, cos odo 
-= sin 0 w/2 _ 4a 
-m/2 4 
= 
E (PO) | = f a 7 y POP S@d0 =f", costode 
2 
ad ie 2cos’6d0 = 4a? oe (1 + cos 26) d@ 
xvQ 7 


(Since cos’ hy is an even function of 0) 
4d’ , sin 20 - 24a’ x 


1 
V(PQ) = E[(PQ)*]-[E(PO)f = 22° 
We know that the length of the side-of an equilateral triangle inscribed in a 


circle of radius ‘a’ is a’ V 3. Hence 


P(PQ>aV3) = P(2acos @>aV 3) = P(cos o>) 


7 -X 1 
= P(|a|<2] = P| 6 <0<F) | 

1/6 n/6 1x 1 

* Jog fO® Sf /6 ena 3 
Example 6-22. A chord of a circle of radius ‘a’ is'drawn parallel to a given 
Straight line, all distances from the centre of the circle being equally likely. Show 
that the expected value of the length of the chord is na/2 and that the variance of 
the length is a’ (32 - 3x”)/12. Also show that the chance is 1/2 that the length of 
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the chord will exceed the length of the side of an equilateral triangle inscribed in 
the circle. 

Solution. Let PQ’be the chord of'a circle with centre O and radius ‘a’ drawn 
at random parallel to the given straight line AB. Draw OM 1 PQ. Let OM=y. 


Obviously ¥ ranges from — a to a, Since all distances from the centre are equally 
‘ 


p a q A B 
likely, the probability that a random value of x will lie in the small-interval ‘dx’ is 
given by the rectangular distribution [c.f. Chapter 8]: 
dF(x)'= f(x)dx = ———~ = —,-asxsa 
Length of the chord is 
PQ = 2PM = 2Va’?-x 
a 2a ; 

\ Hence E (PQ) ies PQ dF (x) = ls Va’ -x’ dx 
e f o Va? - x dx, 
a? 0 


(since integrand is an even function of x). 


= 2 [aici lal . 
a}2 2 a} 19 
_2-@ » xa 
a’'2°2 2 


E((Poe}=f% , (Poy Fe) = = f%  @-x)de 


4 7a... 44, x (4 
= Sg @ -X)dr = — | a*x - 3 (0 
_ 4 2° _ 8a" 
a 3 3 
Hence 
8a n'a’ 
Var (iength of chord)= E [(PQ)°]-[E(PQ)]} = Se 


a’ 
5 (32 - 377) 
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The length of the chord is greater than the side of the equilateral triangle 
inscribed in the circle if 
2Va?-x > av3 => 4(a@’-2’) > 3a’ 
ie, v<a/4 = |x| <a/2 
Hence the required probability. is 
a a a/2 
P(|x|<a/2) = P(-5 <X<5] = fen dF (x) 


1 pa/2 1 
~ 36) ~a/2 1 73 


Example 6:23. Let X,,X2,....X, be a sequence of mutually independent 
random variables with common distribution. Suppose X, assumes only positive 
iz.egral values and E (X;) =.a, exists; k= 1,2,...,n.LetS, = X,+X2+ ...4+Xn. 


(i) Show that E ( =") =, rt for lsmsn 


(ii) Show that E (S,~' | exists and 


E(= =1+(m-n)aE(S,;'), forisnsm 
(iii) Verify-and use the inequality x + x°' = 2, (x >0) to show'that 
E(= 2 m for m,n21 
Ae n ; (Delhi Univ. M.Sc. Stat.), 1988] 
Solution. (i) We have 
Xi+X2+...+X, | _ 7 
| Xi +X2.+...+X, = E (i) = : 
a E Xx +Xo+...+X, ae 4 
Sn 
M1) ae ( #2 Xn 
7 e(S}+8(S}+ ee (F] = 


Since X;’s, (i= 1, 2,...,) are identically distributed random variables, 
(X:/S,), (¢ = 1, 2, ..., 1) are also identically distributed random variables. 


X; 
nE(si} = 1 
X; 1 
=> E(5'] = e ; t'= 1,2, nN ..(*) 
Now 
Ae X, +X2+ ...+ Xm M1  X2 Xm 
e(=| E| i f ls, *S, S, 
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~E(S)+e(S)+-.+2(3] 


n 


1 1 : se 

Bette [(m times) ] [Using (*)] 
m 
n 


»(m<n) 


(ii) Since X;’s assume only positive integral values, we have 
ns X1+X2+ et X, < @ 


~ ~za>0 + 0<5S;'s 


— Sf 


Since S, ' lies between two finite quantities 0 and nrwe get 
0< E(S;') < ; 
Hence E (S, ') exists. 


*(s) ~E 


X, +X2+ 0.0. Xn t+ Xnap t oss a 
S, : 


Knot, es 
“  S§ 


oe) ise 


= 1 +E(“s4}4 


vill) 


S;, 
Since Xn+1;Xn+2 ---,Xm are independent of S, = Xi +X2+ ... +X,, they are 
independent of S, ‘ also. 
E a 1+E (X,.,:)E(S,') +... + E-(X,,) E (Sz') 
14+[E Xna1) +... +E (Xn)] E (Si') 
1+(m-n)aE(S,'), lsnsm ° 


[wEQ)eaV i] 


(iii) Verification of x +3 29 (ed), 

x + ; 22 
=> x +12 2x (multipliationvalid only if'x > 0) 
=> (x-1) 20 


which is always true for x > 0. 
If 1.<.m <n, result follows from (*). 
If1<n<~m, then using (+*), we have to prove that 


1+(m-n)aE(S,;')2 = 
=> (m-n)aE (S; . > —— 
-]1 all 
7 Es Jaz an +-.(##*) 


In (*#), taking x = > 0, we get 
E(x)+E(x')2 2 


-1 


=> E| > +E | 2 22 
an an 
1 a ae 
=> 7a E (S,) + an E (S,°) 2, 2. 
=> 1 ans anE(Sz') = 2 
an 

= an E (S,') 21 
2h Ae 

=> E(S, ) = an’ 


which was to be proved in (***). 


Example 6-24. Let X be ar.v. for which B, and B, exist. Then for any'real k, 
prove that : 
5. 2 Bi-(2k+k’) .a(*) 
Deduce that (i) B:, = By (ii) B. = 1. When is B2 = 1? f 
Solution. Without any loss of generality we can take E(X) = 0. [If 
E (X) = 0, then wé may start with the random variable Y = X - E (X) so that 
E(Y) = 0.] 
Consider the real valued function of the real variable ¢ defined by : 
Z(t) = E{(X?+tX+kwf20V ¢t, 
where up = EX’, .. {i) 
is the th moment of X about mean. 
Z(t) = E[X* 40° X72 4+ ys UX? + Wp KX? +A py tX] 
Matt po +k pz + 2t ps + 2k ps 


[Using (1) and E (X) = 0] 
t? p+ tps t+ pst Kus +2kps & 0: for all ¢. ...(ii) 
Since Z(t) is a quadratic form ing, Z(¢)20 for alle iff its discriminant is 
s 0, 2.e., 
iff (2 us)? = 4 wo [pat Kp? + 2k py? ] s 0 


=> Ps Berk? + 2k] 5 0 
M2 


[Dividing by 4? > 0] 


6-40 Fundamentals of Mathematical Statistic, 


=> Bi —- Bo ~ (2k +k’) s 0 

=> Bo = Bi -(2k +k’) 
Deductions. (i) Taking k=0 in (+) we get B2 = B (#8) 
(it) Taking k = - 1 in (*) we get: B2.2B,+1 »+o(##4) 


a result, which is established differently in Example 6-26. 
(iii) Since B; = ps/p2 is always non-negative, we get from (#**) : 
B2.21 oa +o (# #4) 
Remark. The sign of equality holds in (#*+#+*), Le, B2 = 1 iff 


Bo=tt=1 > wa w 


Mz 

2 
= E|X-E(®' = [EX -E()] 
= E(Y?)-[E@pP=0, © (Y = [X-E (X)]?) 
=> Var(Y) = 0 
=> P{Y=E(Y)] = 1 [See Example 69°} 
= P((X-py =EX-p)y 4 1 
= 3 P[X-py=0'] = 1 
= P[(X-p)=20] = 1 
=> P[X=pso] = 1 


Thus X takes only two values 4 +0 and w.-0 with respective probabilities 

p and q, (say). 

. E(X) = p(ut+o)+q(u-o) = 4 

=> p+q=i and (p-q)o =0 
But since o 2 0, (since in this case B2 is defined) we have : ; 

p+q=1 and p-qg=0 > p=q-=h. | 

Hence B2 = 1 iff the r.v. X assumes only two values, each with equa! 

probability 1/2. 


Example 6-25. Let X and Y be two variates having finite means. 
Prove or disprove : 
(a) E [Min (X, Y)] s Min [E (X), E (¥)] 
(b) E [Max (X, ¥)] 2 Max [E (x), E (Y)] 
(c) E [Min (X, Y) + Max (X, Y)] = E(x) + E(Y) 
[Delhi Univ. B.A. (Stat. Hons.), Spl. Course, 1989] 
Solution. We know that 


Min (X,Y) = 5(X¥+Y¥)-|X-Y| (i) 
and Max (Xx, Y) = 5(X+Y)+|X-Y| w(t) 
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(a E [Min (X, Y)] = 3E (X+Y)-E|X-Y| (iii) 
Wehave: |E(X-Y)|s E|X- AN 
=> -'E(X- YJ 2-E|X-Y 
= E|X-¥|2-|E(X-¥)|--[E()-ECY) 


Substituting in (izi) we get : 
E [Min (X, Y)] s SE(X+Y)-E|X-Y| 


s S[EQ)+E(Y)-|EQ@)-E()| [From (*)] 


=> E [Min (X, Y)] s Min [E a E (Y)] 
(b) Similarly from (ti) we get : 


E [Max (X, Y)] = LE + yy eE|X 48) 
s SE(X+Y)+|E(X+¥)| 
(-“|E(X+¥)| s E|X+Y]) 
= + (EQ) +E I+|EM+E)| 


Max [E (X), E(¥) ] 
ic, E[Max(X,Y)] 2 Max[E (X), E(Y) ] 


(c) [Min (xX, Y) + Max (X, Y)] = [X+ Y] 
=> E [Min (X, Y) + Max (X, Y)] = E(¥+Y) | 
= E(X)+E(Y), ~ 


as required. 

Hence all the results in (a), (b) and (c) are true. 

Example 6-26. Use the relation E (AX* + BX°+CX‘)Y 2 0, X being a 
random vartable with E (X) = 0, E denoting the mathematical expectation, to 
show that 
24 Ma+b Wasc, 
Ma+d Lm» Mose 


La +¢ Lo +c M2 
U,, denoting the nth moment about mean. 


2 0, .(*) 


Hence or otherwise show that Pearson Beta- coefficients satisfy the ine- 
quality 

B.-B,-1 2 0. 
Also deduce that B, 2. 1. 
Solution. Since E (X) = 0, we get 

E(X’) =p, ' ..(#*) 
Weare given that 

E (AX7+BX’+CX‘y 20 
=> ElAX"*+ BX” +CX~™ +24BX"*? + 2ACX"** + 2BCX”**] 2 0 
=> A’, + B’py + C’uo, + 2ABb a op + 2ACHesc + 2BCUy,. 2 0 
[From (##)] ...(###) 
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for all values of A, B, C. 
We know from matrix theory that the conditions for the quadratic form 
a'x’ + b'y’ + c'2 + 2f'yz + 2g'zx + 2h'xy, 
to be non-negative for all values of x, y and z are 
a’ h' g' 
h' b’ f' 
g | ie c’ 


a20, = (i) Ay # 2 0,and = (ti) = 0 
Comparing with (+*+*), we have 
a’ = Ura, b' = ry», c' = Mac, f' = Wo+c, 8 = Mare,’ = Ware 


Substituting these values in condition (iii) above, we get the required result. 
Taking a =0,b=1andc=2 in (*) and noting that uo = 1 and p, =0, we 
get 


1 0 [2 
0 M2 ps = 0 
M2 3s pa 

= M2 Ha — Ws + Ho (- pz) 2 0 


Dividing throughout by 3 (assuming that is finite, for otherwise B2 will 
come infinite), we get 


Es -_& -120 
M2 M2 
=> B2- Bi - 1,2 0 
= B2 = 6, +1 


Further since B; 2 0, we get B2 211. 
Example 6-27. Let X be a non-negative random variable with distribution 
function F. Show that 


E(X) = f (1-FQ@)jad. ; (i) 
0 
Conjecture a corresponding expression for E (X”). 


[Delhi Univ. M.Sc.(Stat). 1988] 
Solution. Since X 2 0, we have : 


RHS.= f [1-P(Xsx)] dx = f 1 ~ ff (u) du || ax 
where f() is the p.d.f. of rv. X. | 


R.HS. = J | S f (u) du |4x 


ooe (it) 
From the integral in bracket (1), we have, u 2x and since x ranges from 0 to 
0, u also range from 0 to™. Furtheru = x => x s u andsince x is non-negative, 
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we have 0 s x su. [See Region R,.in Remark 2 below]. Hence changing the 
order of integration in (ii), [by Fubini’s theorem for non-negative functions], we 
get 


RHS.= f | ft.de |fw) du = fu. f(w)du 


= E (x) [Since f (-) is p.d.f. of X | 
Conjecture for E (X*). Consider the integral: 


f 2x[1-FQ)] a = f | fred) a 


= f f 2x dx | f (u) du, 
(By Fubini’s theorem for non-negative functions). 
= f u’.f(u).du = E(X’) 
Remarks. 1. If X isa iomcnegntive fv. then 
Va,X = EX*-[E QO)’ = f 2x[1-F @)]de- pe .» (iii) 
0 


2. If we do not restrict ourselves to non-negative random variables only, we 
have the following more generalised result. 
If F denotes the distribution function of the random variable X then : 
ry » 0- 


E(X) = f [1-F@)] de - f F(x) dr, ...(iv) 


provided the integrals exist finitely. 
Proof of (iv). The first integral has already been evaluated in the above 
example, i.e., 


f [1 — F (x)] de = f u.f(u)-du (Vv) 


Consider : 


0 x 


0 Q 
J FQ@)d =f P&sx)d& = f | f(u) du |a 


mY 


ree 
=f J 1- dx | f(u) du 


‘Changing, the order of integration in the Region R, whereu s x]. 
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' 0 
= f ur f(a) de (vi) 


Subtracting (vi) from (v), we get : 

2 0 o 0 

f 1 -F@Jdx - f Fe@)de = fuf(durf uf) du 
0 -o 0 -0 


= f uf (u) du 


= E(x), (Since f (.) is p.d.f. o/ X) 
as desired, 
In this generalised case, 


Var (X) = f 2x[1-Fr(x) +Fx(-x) Jae - (EY 


3. The corresponding analogue of the above result fordiscrete random variable 
is given in the next Example 6-28. 
Example 6-28. If the possible valites of a variate X are 0, 1, 2, 3, .... then 


E(X) = = P(X>n) 
naQ 
[Delhi, Univ. B.Sc, (Maths Hons.), 1987] 


Solution. Let P(X =n) = p,, n=0, 1, 2, 3, ... my (7) 
If E (X) exists, then by definition : 
E(X) = 5 n.P(X=n) = 5 n.Dn (ii) 
Consider : = = 
5 P(X>n) = PX>0)+P(X>1)+PX>2)+.., 


nad 


P(X21)+P(X22)¢P(X23)+... 


= (pi + p2 +p3 + pat......) 
+ (prt ps + pa t.....) 
+ (D3 + Pa + Ps + ....,) 
oH csdarccsgeayieaseet 
= pit+2p2 +3p3 + sooees 
= £ NPp 
nel 
= E (X) [From (ti) | 


As an illustration of this result, see Problem 24 in Exercise 6(a). 


Aliter. R.H.S.= 2 P(X>n) = = P(X2n) 
naz=0 n=) 
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: 2 (220) 


Since the series is convergent and p.(x)20 W_ x, by Fubini’s theorem, chang- 
ing the order of summation we Bet: 


R.HLS. = > ( > pe) = 5 z | Pw 3 z= 7 


xa] neal 


sincex2n = NSX and xX assumes ony ositive integral values : 
R.HS. = x x p (x) = z xp (x) = E(X) 
Example 6-29. F oF ee pile igs Ya and Y, show that 
(E(w +YP}” s [eo + (£079) _.(*) 


Solution. Squaring both sides in (*)-we weve to prove : 


E(X+YY s (eu?) +({E(Y ay) a 
=> E(X?)+E(Y?) + 2B (XY) s E(X?)+E(Y2)+2VE®2E(Y3) 


=> E (XY) s VE(X2)E(Y?) 
s [E (XY)P s E(X’).E(Y’) 


This is nothing ‘but Cauchy-Schwartz inequality. [For proof see Theorem 
6:11 page 6-13.] 

Example 6-30. Let X and Y be independent non-degenerate variates. Prove 
that 

Var (XY) = Var (X). Var (Y) 
iff E(X) = 0, E(Y) = 
[Delhi Uniy. B.Sc. (Maths Hons.), 1989] 
Solution. We have 


Var (XY) = E (XY? -[EQY)P = E(X°Y’) -[EQYP 


= E(X’)E(Y’)-(EQO/P (EM), --(#) 
since X and Y are independent. 
If E(X) = 0 = E(Y) 
then Var (X) = E (X”) and Var (Y) = E(Y’) } 0(#*) 
Substituting from (**) in (*), we get 
Var (XY) = Var (X). Var (Y), 
as desired.. | 
Only If. We have to prove that if | 
Var-(XY) = Var (X). Var a »+(###) 
then E (X) = Oand E(Y) = 


Now (*##) gives, [on using (* 


E(X?).E(Y?) - (EQOP [EMP = (EW?) - (EQ? | x EC”) - (ECF | 
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= E(X FE (Y’)- E(x *\[E (YP [E ny E(Y*)+ [E (X)} [E (YP 
=> E(X*){E()) - (EQ (EM) + (EQOY Lr’ ) - [EC] rd =q 


> EMP {E@)-~(E MP} +E OP EW) - MP} = 


=> [E (Y)}° Var (X) + [E (X)P Var (Y) = 0 sol #448) 

Since each of the quantities [E (XO), {E (YJ, Var (X) and Var (Y) is non- 
negative and since X and Y are.given to be non-degenerate random variables such 
that Var (X) > 0 and Var (Y) > 0, (x) holds only if we have E (X) = 0 =E (Y), 
as required. 


EXERCISE 6(a) 

1. (a) Define a random variable and its mathematical expectation. 

(b) Show that the mathematical expectation of the sum of two random variables 
is the sum of their individual expectations and if two Variables are independent, the 
mathematical expectation of their product :. the:product of their expectations. 

Is the condition of independence necessary for the fatter ? If not, what is the 
necessary condition ? 

(c) If.X is a random variable, prove that | E (X)| s E(|X]). 

(d) If X and Y are two random variables such thatX <s Y, prove that 

E(X) s E(Y). 

(e) Prove that E [(X ~c)’ | = [Var (X)] + [E (X) - c]’, where c is 4 constant. 

2. Prove that : 

(a) E (aX + bY) = a E(X) +b E(Y). 
where a and b are any constants. 

(6) E (a) = a, a being.a constant. 

(¢) E [ag (X)] = aE [g(X)} 

(d) E [gi(X)+ gX) + ... + 8(X)] RE [91%] + E [92%] + ... + E [aX)] 

(Ele s Elle. 

(f) If g(X) = 0, everywhere then E [g (X)] 2 0. 

(g) If g (X) = Oeverywhere and E [g (X)] = 0,.then g (X) = 0, ie., the random 
variable g (X) has a one point distribution at X = 0. 

3. Show that if X is non-negative random variable such that both E (X) and 
E (1/X) exist, then 

© (1/X) = 1/E (Xx). ” 
. IfX, Y are independent random variables with E (X) = a EX *) = 6, and 
bate a 1, 2,3,4, find E (XY + Y¥?). | 

S. (a) IX and Y are two independent ‘random variables, show that 

Var (aX + bY) = a’ Var (X) +b Var (¥). * 
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(b) With usual notations, show that 


Cov (a X + bY, cX + dY) =ac Var (X) + bd Var (Y) + (ad + bc) Cov (X, Y) 
(c) Show that 


Cov] Vax. ¥ bx |= ¥ ¥ ab, Cov (X%.X,) 
r=) r= j=l c=) 
6.(a) Define the indicator function /4 (x) and show that E (/4 (X)) = P(A). 
(6) Prove that the probability function P (X € A) for set A and the 


distribution function Fy (x), (- 2 < x < oo), can be regarded as expectations of 
some random variable. 


Hint. Define the indicator functions : 
I(x) = lifxeA ly (x) 
=QOifx¢eA 
Then we shall get : 
E (la (X)} = P(X € A) and E (4 (X)) = P(X Sy) = Fx(y) 
7. (a) Let X be acontinuous random variable with median m. Minimise’ 
E.\X - bl, asan function of b. 


Ans, £E 1X — b\| is minimum when b = m = Median. This states that 
absolute sum of deviations of a given set of observations is minimum when taken 
about median. [See Example 5-19.} 


(b) Let X be a random variable such that E |X| < o. Show that 
E.\X — Clis minimised if we choose C equal to the median of the distribution. 


{Delhi Univ. B.Sc. (Maths Hons.), 1988] 
8. IfX and Y are symmetric, show that 
ae 
+ 
dead 
X 


{Xx 
7 F + A ae? 

Hint. l 

xX+F¥ (:. X and Y are symmetric. ) 

9. (a) If ar.v. X has a symmetric density about the point a and if 

E(X ) exists, then 
Mean (X) = Median (X) =a 
Hint. ety ent ~- ) Re +x); f(x) pon: of X. prove that 


lifxsy 
Oifx>y 


i 
try 
— 
</> 
+|+ 
<I 
L__J 
i 
ar | 
>< 
+ | 
~ 


=> | = 2E 


bite neateiee tax anyteddes [(s- —a)f(x)dx=0 


— - © 


(b) If X and Y are two random vanables with fe icvanances then show 
that 


E- (XY) $ E(X°) . E (Y’) A*) 
When does the equality sign hold in (*) ? [Indian Civil Service, 1987} 
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10. Let X be a non-negative arbitrary r.v. Mh distributidn function F. 
show that * 


E(X) = 1 - FyQ@)] de - | Fy (x) dx, 


= 


in the sense that, if either ies so does the other and the two are equal. 
[Delhi Univ. B.Sc. (Maths Hons.), 1992} 
11. Show that if Y and Z are independent random values of a variable 
X, the expected value‘of (Y — Z)’ is twice the variance of the distribution of X. 


[Allahab ad Univ. B.Sc., 1989] 
Hint. E(Y)=£(Z)=E(X) =, (say) ; 0; =02=0r=0" , (say). ...(*) 
E(Y¥ - Z) = E(Y) + E(Z) - 2E(NE@) 
(-.. Y,Z are independent ) 
(Oy + py) + (OZ + M2) - 2W? 
= 26° = 207 [On using (*)] 
12. Given the following table : 


i" 


‘Compute (i) E (X), (ii) E(2X+3), (iii) E(4X+5), (iv) E(X”) 
(v) V(X), and §=(v): V (2X £3). 

13. (a) A and B throw with one die for a stake of Rs..44 which is to be 
won by the playr who first throws a 6..1f A has the first throw, what are their 
respective expectations? 

Ans. Rs. 24, Rs. 20. 

(b) Acontractor has to choose between two jobs. The first promises a profit 
of Rs. 1,20,000 with a probability of 34 or a loss of Rs. 30,000 due to delays with 
a probability of 4 ; the second promises a profit of Rs. 1,80,000 with a probability 

of 4% ora loss of-Rs. 45,000 with a probability of 2. Which job should the 
vontractor choose so as to maximise his expected profit? 

(c) A‘tandom variable X can assume any positive integral value n witha 
probability porportional to 1/3". Find the expectation of X. 

[Delhi Univ. B.Sc., Oct. 1987] 

14. Three tickets afe chosen at random without. replacement from 100 
tickets numbered 1, 2, ..., 100. Find the mathematical expectation of the sum of the 
numbers on the tickets drawn. 

15. (a) Three urns contain respectavily 3 green and 2 white balls, 5 green 
and 6 white balls ahd 2 green and 4 white balls. One ball is drawn from each urn. 
I ‘tnd the expected number of white balls drawn out. 

Hint. —_ Let us define the r.v. 
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X;=1, if the ball drawn from ith urn is white 
= (0; otherwise 
Then the number of white balls drawn is S =X, + X2 + X3. 


E (S)=E (X,) + E (X2) + E (Xs) = 1x 241 x S41 x2a Re, 

(b) UmA contains 5 cards numbered from 1 to 5 and um B contains 4 cards 
numbered from 1 to 4. One card is drawn from each of these urns. Find the 
probability function of the number which appears on the cards drawn and its 
mathematical expéctation. 

Ans. 11/4. 

16. (a) Thirtcen cards are drawn from its pack simultaneously. If the values 
of aces are 1, face cards 10 and others according to denomination, find the 
expectation of the total score in all the 13 cards. 

[Madurai Univ. B.Sc., Oct. 1990] 

(b) Let X be a random variable with p.d.f. as given below : 

x: 0 1 2 3 
p(x): 1/3 1/2 1/24 = 1/8 
Find the expected value of Y = (X - 1)’. [Aligarh Univ. B.Sc. (Hons.), 1992] 

17. A player tosses 3 fair coins. He wins Rs. 8, if three heads occur; Rs. 3, if 
2 heads occur and Re. 4, if one head occurs. If the game is to be fair, how much 
should he lose, if no heads occur? (Punjab Univ. M.A. (Econ.),. 1987] 

Hint. X : Player’s prize‘in Rs. . 

E(X)=Ixp (x) =%(8+9+3 +4) 
For the game to be fair, we have : 
E (X%)=0 => 20+a=0 = a=-20 


Hence the player Joses Rs. 20, if 
no heads come up. 


18. (a) A coin is tossed until a tail appears. What is the expectation of the 
number of tosses ? 


Ans. 2. 

(b) Find the expectation of (i) the sum, and (ii) the product, of number of points 
on n dice when thrown. 

"Ans. (i) 7n/2, (ii) (7/2)" 

19. (a) Two cards are drawn at random from ten cards numbered 1 to 10. Find 
the expectation of the sum of points on two cards. 

(b) Anurn contains 7 cards marked from 1 to n. Two cards are drawn at a. time. 
Find the mathematical expectation of the product of the numbers on the cards. _ 

(Mysore Univ. B:Sc., 1991] 
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(c) Ina lottery m tickets are drawn out of n tickets numbered from 1 to n. What 
is the expectation of the sum of the squares of numbers drawn ? 

(d) A bag contains n white and 2 black balls. Balls are drawn one by one 
without replacement until a black is drawn. If 0, 1, 2, 3, ... white balls are drawn 
before the first black, a man is to receive 0’, 17, 2”, 32, ... rupees respectively. Find 
his expectation. [Rajasthan Univ. B.Sc., 1992) 

(e) Find the expectation and variance of the number of successes ina series of 
independent trials, the probability of success in the ih trial being p; (: = 1, 2, 
wee, MN). [Nagarjuna Univ. B.Sc., 1991} 

20. Balls are taken one by one out of an urn containing w white and b black 
‘balls untii the first white ball is drawn. Prove that the expectation of the number of 
black balls preceding the first white ball is b/(w + 1). 

[Allahabad Univ. B.Sc. (Hons.), 1992} 

21. (a) X and Y are independent variables with means 10 and 20, and variances 
2 and 3 respectively. Find the variance of 3X + 4Y. 

Ans. 66. 

(b) Obtain the variance of Y = 2X, + 3X2 + 4X3 where X;, X2 and X3 are three 
random variables with means given by 3, 4, 5 respectively, variances by 10, 20, 30 
respectively, and co-variances by Gx, x, = 0, dx,x, = 0, Ox, x, = 5, where ox x, stands 
for the co-variance of X, and X, . 

22. (a) Suppose that X is a random variable for which E (X) = 10 and 
Var (X) = 25. Find the positive values of a and b such that Y=aX¥-b, has 
expectation 0 and variance 1. 

Ans. a@=1/5, b=2 

(b) Let X, and X2 be two stochastic random variables having variances k and 
2 respectively. If the variance of Y = 3X, -X, is 25, find k 

(Poona Univ. B.Sc., 1990) 

Ans, k =7. 

23. A bag contains 2m counters, of which half are marked with odd. numbers 
and half with even numbers, the sum of all the numbers being S. A man is to draw 
two counters. If the sum of the numbers drawn is odd, he is to receive that number 
of rupees, if even he is to pay that number of rupees. Show that his expectation is 
S/{'n (2n - 1) ] rupeés. | (LF.S., 1989) 

24. A jar has n chips numbred 1, 2, ..., n. A person draws a chip, returns it, 
. draws another, returns it, and so on, until a chip is drawn that has been drawn 
before. Let ¥ be the number of drawings. Find E (X). 

Delhi Univ. B.A. (Stat. Hons.), Spl. Course, 1986] 

‘Hint. Obviously P (X > 1) = 1, because we must have at least two draws to 
get the chip which has been drawn before. 

P (X>r) = P [Distinct number on ith draw; i = 1, 2, ..; r ] 
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n 
n 


x 
x 
= | 
x 
x 


n n 
P(X>ny=| 1-4 |f 1-2 |... 1H 0.2.3, 
n n n i(™) 


Hence, using the result in Example 6:28 


E(X) = = P(X>7n 


= P(X > 0) + P(X> 1) + P(X > 2) +... 
ee ee ee ae bee ites | 
n n n 
1 2 r-1 
+ LS 1 at 1 - , {+... (Using (*)] 


25. Acoin is tossed four times. Let X denote the number of times a head 
is followed immediately by a tail. Find the distribution, mean and variance of X . 


Hint, S={H,T7} x {Ht} x {H,7T} x {HT} 
= { HHHH, HHHT, HHTH, HTHH, HTHT.,....., TITT} 


DG 0, 1, l, 1, De or 0 
x 0 I 2 E (X) = %, E(X’?) =% 
pt) “ne %e Yo VarX =%-%6 = Vo. 


26. Anurncontains balls numbered 1, 2, 3. First a ball is drawn from the 
urn and then a fair coin is tossed the number of times as the number shown on the 
drawn ball. Find the expected number of heads. 


[Delhi Univ. B.Sc. (Maths Hons.), 1984] 
Hint. B; : Event of drawing the ball numbered j . 
P(B) = 43; 7 = 1,2,3. 
x No. of heads shown. X 3 — taking the values Q, 1, 2, and 3. 


P(X=x)= = PB). P(X= xIB)=5 P(e xB) 
(x= =1{ P(x=01B) +P (K=018) +P (X=01B)) 
Ee AP a 
eslatdtel oa 


P(X=1)=4[ P(X=11Bi) +P (X= 1B) +P (X=! | Bs)} 


i {i 2 . 3 
“3/2747 8 
e.g. P(X=0| B2)=P[ No head when two coins are tossed ] = % 


P (X= 1 
as cw Sk 1,3 2 
Similarly paxan=3(o+4+5}= 24 


B3) = P[ 1 head when three coins are tossed ] = % 
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ee! 1\_ 1 
PX=3)=3 (04045) =, 
A AO 3 
E(X) = Sx P X= N= 54+ 4 +547! 


27. An urn ane PN white and @N black balls, the total number of balis 
being N, p + q = I. Balls are drawn one by one (without being returned to the 
urn) until a certain number n of balls is reached. 


Let X; = 1, 1f the ith ball drawn is white. 
= 0, if the ith ball drawn is black. 
(i) Show that E (Xi) = p, Var (X;) = pq. 


(it) Show that the ¢o-variance between X; and X; is — SG # k) 


(iti) From (7) and (ti), obtain the variance of S, =X, + 7 +... +X, 
28. Two similar decks of nm distinct cards each are put into random order and 
are matched against each other. Prove, that the probability of having exactly ; 


matches is given by 
neor 
2 Cl P20. 1,2,. 
r ie So kA 


Prove further that the expected number of matches and its variance are equal 
and are independent of n. 
29. (a) if. X and Y are two independent random variables, such that E (X) = 


My, V(X)= of and E (Y) =A., V(Y) = 5%, then prove that 


V(XY) = Oj o3 + Mi o3 + rt: of [Gorakhpur Univ. B.Sc., 1992] 


(b) If X and Y are two independent random variables, show that 


V(X 
(E ONE LE pe = SCF + C+ Cr 


where Gea era 


are the so-called coefficients of variation of X and Y ?[Patna Univ. B.Sc., 1991] 
30..A point P is taken at random.in a line AB of length 2a, all positions of 
the point being equally likely. Show that the expected value of the area of the 


rectangle AP. PB is 2a?/3 and the probability of the area exceeding 1/2a? is 


V2. [Delhi Univ. B.Sc. (Maths Hons.), 1986] 
31. If X is a random variable with E (X) = w satisfying P (X <0) =0 
show that P (X > 2) $ 1/2. [Delhi Univ. B.Sc. (Maths Hons.), 1992] 


OBJECTIVE TYPE QUESTIONS 
1. Fill in the blanks : 
(i) Expected value of a random variable X exists if ......... 
(ii) If E (X* exists then E (X°) also exists for ......... 
(iii) When X is a random variable, expectation of (X—constant)? is mini- 
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mum when the constant is ... 
(iv) E|X -A| is minimum whenA = .... 
(v) Var (c) =..., where c is a constant 
(vi) Var (X¥ +c) =..., where c is a constant 
(vit) Var (aX + b) =..., where a and b are constants. 
(vit) IfX isa r.V. with mean p and variance o’ then 


E ew Sas var(“ Lu — 
(@} Oo 


(ix) [ E (XY) )°.... E(X’).E(Y”). 
(x) V(aX + bY) =... 
where a and b are constants. 
II. Mark the correct answer in the following : 
(i) For two random variables X and Y, the relation 


E (XY) =E (X) E (Y) 


holds good 
(a) if X and ¥ are statistically independent, 
(b) for all X and Y, 
(c) if X and Y are identical. 
(ii) Var (2X + 3) is 
(a) 5 (b) 13 (c}4, if Var X = 1. 
(iii) E (X - k) is minimum when 
(QQkK<E(X), (b)k>E(X), (ce) k=E (X). 
III. Comment on the, following : 
If X and Yare mutually independent variables, then 
(i) E(X¥+Y+1)-EW+1)E(Y)=0 
(iu) X and Y are independent if and only if 
Cov (X, Y) =0 
(itt) For every univariable distribution : 


(a) V(cX)=0V(X) (b+) E(c/X) = c/E (X) 


(tv) Expected value of a r.v. always exists. 


IV. Mark true or fulse with reasons for your answers : ad 


(a) Cov(X,Y=0 => X and Y are independent. 
(b) If Var (X) > Var (Y), then. X + Y and X - Y are dependent. 


(c) If Var (X) = Var (Y) and if 2X¥ + Y and X — Y are independent, whens and 
Y are dependent. 
(d) If Cov (aX + bY, bX +.aY) = ab Var (X + Y), then X and Y are dependent. 
6-8. Moments of Bivariate Probability Distributions. The mathematical 
expectation of a function g (x, y) of two-dimensional random variable (X, Y) with 
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“p.d.f. f(x, y) is given by 


Ele N]=fo of". s@ySfeyardy (6-43) 


(If X and Y are continuous variables) 
= TE xy PX=xuNY=y), (6-43 a) 
i) 


(If X and Y are discrete variables) 
provided the expectation exists. 


In particular, the rth and sth product moment about ongin of the random 
variables X and Y respectively is defined as 


$ oo r $ 
un =E(X'Y)= fo of x’ y fey) dedy 
Or My = EE xi’ yj? P(X=x NY ey) ...(6°44) 
ij 
The joint rth centra) moment of X and sth central moment of Y is given-by 
wn =E|{x-E@} {y-E0} | 


= E[(X- px)! Y-w)'], [EO =o, EY) = wy] ...(6-45) 
In particular 
Hoo =1= poo, io = 0 = por 
pio =E(X) , po =E(Y) 
p20 = Ox? » Bo = Oy? and pit = Cov (X,Y). 
6-9. Conditional Expectation and Conditional Variance. 
Discrete Case. The conditional expectation or mean value of a continuous 
function g (X, Y) given that Y = v,, is defined by 


E{g(X,¥)|Y=y;} = = g (xi, yj) PX =xi|Y=y;) 


z g (xi, yj) P(X =x, Y=y)) 


7 P(Y<=y;) .«.(6°46) 
ie, E[g(X,Y)|Y=y;] is nothing but the expectation of the function 
g ({X, y;) of X w.r.t. the conditional] distribution of X when Y = yj. 
In particular, the conditional expectation of a discrete random variable X given 
Y=y is 
E(X|Y=y)= ra P(X=x;|Y= yj) (6-47) 
The conditional variance of X given Y'= y; is likewise given by 
U 


V(X|Yay) =E[{X-EQX|Y=y) P|Y=y;] ..(6-47a) 
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The conditional expectation of e(X, Y) and the conditional variance of Y given 
X =x; may also be defined in an exactly similar manner. 
Continuous Case. The conditional expectation of g(X, Y) on the pen 


—- 


yy is defined by < 
E{g(X,Y)'Y=y}= f°. g@y) fairly) de 


fr BOWS ey) de 


(6°48) 
fr) 
-In particular, the conditional mean of X given Y = y is defined by 
oo AS y) de 
E(x|Yezy)= 
ee ie) 
Similarly, we define 
foo) 
So Visy)ay : 
== - 6°48 a) 
E Y X =x)= OO val 


The conditj-»nal' variance of X may be defined as 
: 
V(X|Yey)=E [x-Eq! Y=y)| \¥=-y] 
Similarly, we define 
2 
V(y|X=x)-E[|Y-EW|X=2)] |x=x] (6-49) 


Theorem 6-13. The expected value of X is equal to the expectation of the 
conditional expectation of X given Y. Symbolically, 


E (X) +E [E (X|.Y)} : ...(6°50) 
[Calicut Univ. B.Sc. (Main Stat.), 1980] 
Proof. E([E (xX|Y)j=E[ Ex PWexlY=y)] 


P(X=x,0Y=y) 
ELD Bey P(Y=y)) | 


“>| 3{ xi Pen OY} P(Y=y)) 


=z x, P(X=x,NYV=y,) 
> [ x (Z P(X=xi0Y*y)} | 
= Sx, P(¥ =x) = EX). 
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Theorem 6°14. The variance of X can be regarded as consisting of two parts, 
the expectation of the conditional variance and the variance of the conditional 
expectation. Symbolically, 

V(X)=E(VAX|Y]+V ERY) (6°51) 

Proof. E [V(X | Y)]+V[E (X|Y)] 


-E[E@?|/)- (EQIP | 

+E ({EX|Y)}]-(E (EX YH 
=E[E (X*|Y))-E[{E X|Y)}*} 

+ E({E X|Y)}) -(£ {E&Y} 
=E[E(X?|Y)] -(E@Y (c.f. Theorem 6-13) 
= E[Zx;'P(X=x,|Y¥=y)]-[E OY 


-E| x? P(X=xiNY=y)) ~(E (Xx) 


P(Y=y;) 
-5 ») x2 PW eunl«y) P(y=y)|- [ECF 
= z x°E PW=x,NY=y))|-[E(%)/ 


P(Y=yj) 
= Ex;? P(X=x)-[E WP 
= E (X’) - [E (X)’ = Var (X) 


Hence the theorem. . 
Remarks The proofs of Theorems 6-13 and 6-14 for continous r.v.’s X and 
Y are left as an exercise to the reader. 


Theorem 6°15. Let A and B be two mutually exclusive events, then 


E(X|AUB)=-_ WE ari i el?) ...(6°52) 
where by def., 


E W1A)= 5a = x; P(X = xi) 


Proof. E (X|A U\B) = Lx; P(X =x) 


1 
P(A U B) x,ECAUB 
Since A and B are mutually exclusive events, 
Xx P(X=x)= © x,P(X=x)+ © x; P(X =x;) 


xEAUB mEA rE B 


E(X|AUB)= 57 pH IPADEMA)+P BEB) 


Cor. E (X)=P(A)E(X|A)+ P(A)E(X|A) (6°53) 
The corollary follows by putting B = A in the above Theorem. 
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Example 6-31. Two ideal dice are thrown. Let X; be the score on the first die 
and X2 the score on the second-die. Let 'Y denote the maximum of X, and X2, 1.e., 
y= max (X,, X2). 
(i) Write down the joint distribution of Y and X,, 
(tt) Find the mean and variance of Y and co-variance (Y, X:). 
Solution. Each of the random variables X; and X2 can take six values 1, 2, 3, 
4, 5, 6 each with probability 1/6, ie., 
P(X, =)=P(X.=)=% ; t=1,2,3,4,5,6 (1) 
Y = Max (X, X2). 
Obviously 
P(X,=1,Y=j)=0, if j<i=1,2,...,6 


P(X,=i,¥=i)=P(X,=i,Xsi)= 3 P(X, =i,X)=)) 
jel 


= P(X, =i) P(X) =)) 
jel 


"Xi, Xz are independent.) 
2 a oe 
= > reer - £=1,2,..., 6. 
P(X =1,Ysjs)=PM=1,X2.=)) 5 jr 
= P(X, = i) P(Xe= j= 52 feted: Diaac6. 
The joint probability table of X, and Y is given as follows: 


3 5 ] 9 11 
E(Y) = 1. 5g tae t3 td tS +650 


-4f +6 415428 +45 +66] == “ 


3 5 7 Z 11 791 

y? 2 2 2 2 2 2 Tat 
Ae “155+ 2°36 +736 +4 36 *° 36 36 *° 36 * 36 
ee -(1)'. _ 2555 


V(Y=E(Y’) -[(EMy =— 36} 1296 
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EX) == [1 +2434445+6] 2 
E(X,Y)=4 ie 2a 13g04 4+ 5. a t6e 
424644 8. ann 12 
+95 +12so+ 153- + 18.52 36 
+16 + 20. te 
+25. 2 +3055 + 36% 


=x [21 +44 +72 +108 +155 +216] =; ox 616 


Cov, NEN -EQTEN) 
_616 21 161 _ 3696 - 3381 315 


—————= ——— 


~ 36 6°36 £4216 + 216° 
Example 6-32. Let X and Y be two random variables each taking three values 
-=1, 0 and 1, and having the joint probability distribution : 
(i) Show that X and Y have different expectations. 


(it) Prove that X and Y are uncorrelated. 
(itt) Find Var X and Var Y. 
(iv) Given that Y = 0, what is the conditional probability distribution of X ? 
(v) Find V(Y |X = -1). 
Solution. (i) E (Y) = 2 pj y; = - 1(-2) + 0(-6) + 1(:2) = 0 
E (X) = p; x; = - 1(-2) + 0(-4) + 1(-4) = -2 
ke E(X)#E(Y) 
(ii) E (XY) = py yi x; 
= (— 1)(- 1)(0) + O(- 1)(-1) + 1-1-1) 
+ O(— 1)(-2) + 0(0)(-2) + 0(1)(-2) 
+ 1(— 1)(0) + 1(0)(-1) + 1(4)(-1) 
=-—0:1+01=0 


Cov (X, Y) = E (XY) --E (X)E (Y) =0 
=> Xand Y are uncorrelated (cf $ 10:5) 
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(iii) E(Y?) = (- 1)°(-2) + 0(-6) + 1°(-2) =-4 
V(Y)=E(Y*)-[E(Y) = 4 

E (X?) = (- 1)°(-2) + 0(-4) + 1°(-4) = -24+-4=6 
V(X)=6- 04= -56 
e te ta = 
fx) Pe ~1]¥=0)-P FS ANY=) 

P(X=0NY=0). 2 

P(Y=0) 


P(¥=1 | Y=0)=————-- + -=-.- 


P(X=0| Y=0)= 


~W 


(v) V(¥|X=~-1)sE(¥|X=-1)-{E(¥|X=-1)} 

E(Y|X =-1)=ZyP(Y=y|X=- 1) =(— 1)0 + 0(-2) + 100) = 0 
y 

E(Y|X=-1)=Zy’P(Yey|X=- 1) = 1(0) + 0(-2) +10) =0 
V(Y|X¥=-1)=0. 

Example 6°33. Two tetrahedra with sides numbered i to 4 are tossed. Let X 
denote the number on the downturned face of the first tetrahedron and Y denote 
the larger of the downturned numbers. Investigate the following : 

(a) Joint density function of X, Y and marginals fy and fy , 

(b) P {Xs 2,Ys 3}, (c)p (X,Y), (aE(Y|X=2), 

(e) Construct joint density different from that in part (a) but possessing same 
marginals fy and fy . [Delhi Univ. B.A. (Stat. Hons.), Spl. Course, 1985] 

Hint. The sample space is S = {1, 2,3, 4} x {1, 2,3, 4} and each of the 16 
sample points (outcomes) has probability p = Vie of occurrence. 

LetX : Number on the first dice and Y: Larger of the numbers on the two dice. 
Then the above 16 sample points, in that order, give the following distribution of 
Xand Y. 

Sample Point : (1,1) (1,2) (1,3) (1,4) (2,1) (2,2) (2,3) (2,4) 
: 1 1 1 1 2 2. 2 2 


x ‘ 

Y > 1 2 3 4 #2 2 3 4 
Sample Point : (3,1) (3,2) (3,3) (3,4) (4,1) (4,2) (4,3) (4,4) 

X > 3 3 3 3 4 4 4 4 

Y > 3 3 3 4 4 4 4 4 


Since each sample point has probability p = 16, the joirit density functions of 
X and Y and the marginal} densities fy and fy are given on page 6°61. 

Here p = 6. 

(b) P(Xs2,Ys3)=<p+pt2p+pt+p=6p=%. 


() Var X) @ EX? - (EWP =F -2=5 (Try it) 
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Var (Y) = EY?-[E (Y= al = 3 (Try it) 


Cov (X, ¥) = E (XY) - — ee or Cry it) 
p (xX, Y)- 4 
V4 x 5564 “Ta 
(d) E(Y|X=2)=3y.fy|x=2)=3y LEa20y 
=4.2yf (2, y)=4 [0+ 4p+3p+ 4p) = 44p = + 
(e) Let 0 < € <p. The joint density of X and Y given in (e) above is different 
from that in (a) but has the same marginals as in (a). 


Example 6-34. (a) Given two variates X, and X, with joint density function 
f (41, X2), prove that conditional mean of X, (givenX,) coincides with (uncondition- 
al) mean only if X, and X2 are independent (stochastically). 

(b) Let f(x, x2) = 2ixi{ x2, 0<x, <x, <1, and zero elsewhere be the taint 
p.af. of X; and X,. Find the conditional mean and variance of X, given X2 = x2, 
0<x,< 1. [Delhi Univ. M.A. (Eco.), 1986] 

Solution. (a) Conditional] mean of X2 given X; is given by : 

E (X;|X, ='x,) = f Xo f (x2 | x1) de 
a ..(*) 
where f (x2 | x1) is conditional p.d.f. of X2 given X, = x1. 
But thé joint p.d.f. of X, and X, is given by, 
f (1, X2) = fi. Or) Ff (2,| 41) 


= fea}x) =D 


where f,(.) is marginal p.d.f..of X1. 
Substituting in (*), we get 
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yey X2f 1X2) | gy alae 
Elkan S| EGS | (**) 
Unconditional mean of X2 is given by 

E (X2) = fix fr 2) dra, 11(#**) 


x2 


where f,(-) is marginal p.d.f. of X2. 
From (**) and (***), we conclude that the conditional mean of X, (given X) 
will coincide with unconditional] mean of X2 only if 


(x1, 42 
Cee «fr (x) 
=> f (1, X2) = fi G1) - fa 2) 
i.e., if X; and X; are (stochastically) independent. 
(b) f (1, x2) = 21 x1’ x2’; 0 <x, <x,<1 
= 0, otherwise 
Marginal] p.d.f. of an is given by 


fr 2) = “f feu) = 21 x, fx dx, 


Ly 


—w 


_ 3 
=21 x2 3 


ae: 0<x,<1 


‘, Conditional p.d.f. of X1 aes X2) is given by 


fi (x1 jy “3253 0<x1<%23;0<x2< 1 


Conditional mean of X;, is 
x2 x2 


sitio teailanentes dx 
2 0 
s 2 3x2 


x 
4 pe re 3 O<x <1 


x2 
Now 


*2 ®2 
3 
E (X1"|X2=22) = f x’ f (x1 | x2) des = 5 fx des 
0 “2 0 


2 
Var (X, | X2 =xX,)=E (X,? |X; =X,)- [E (X,-| X2 = X2) 
3 : 
= 5 x," - 16 x = 30 
Example 6-35. Two random variables X and Y.have the followinig joint 
probability density function : 


—yx" 3; 0<x, <1. 
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f@,y)=2-x-y; Osxs1, Osysl 
=0, otherwise 
Find (i) Marginal probability density functions of X and Y. 
(tt) Conditional density functions. 
(itt) Var (X) and Var (Y). 
(iv) Co-variance between X and Y. 
(Dibrugarh Univ. B.Sc. (Hons.), 199}] 


Solution. (i) fy (x) = f oe f (x, y) dy 
=f Q-x-y)dy=3- 


f(x) => —», O<x<1 
* otherwise 
Similarly f(y) = -y, O<y<1 


=(), otherwise 


(i) frre ]y) =) 2 =*—) og yet 


fry) M&-y) ’ 
foxy |x) = ao) ere 0<(x,y)<1 
EM =f) rh @denfy x(5-x]ares 
E(Yy)= “So yhoa=for(5- yes 
f 4)1 
(i) EX)=fye(5 ee Gees 
V(X) = Pia [EWP 4 -ett 
Similarly V(Y)= i aa 
(iv) E(XY) =f of 9 2 (2-x-y) de dy 
af! - 2.2 i 
if? HY ay 
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yy itl 

3 6/)0 6 

Cov (X, Y) = E (XY) -E(X)E (Y) = 5-: 12 ~~ 144° 


Example 6:36. Let f (x, y) = 8xy,0<x<y<1;f(x, y) =0, elsewhere. Find 
(a) E(¥|X =x), (6) E (XY |X =x), (c) Var (¥ |X =x). [Calcutta Univ. 
B.Sc. (Maths Hons.), 1988; Delhi Univ. B.Sc. (Maths Hons.), 1990}: 


Solution. fr(x)=f_., f(x, y) dy 
-3xf' y dy 
= 4x (1-x’),0<x<1 
frv= fo f(x y) dx 


= By fg xde 
= 4y’,0<y<1 
ferw Oly) = Fed? » Shi) =5 — Oxxcyst. 
oo eetxnnnfty(Ba)on3{ ES) 34 
(b)  EQY|X=x)=xE(Y|X= yo} Ei tesx) 
2 cornea ards 


Var (Y|X =x) =E (Y?|X=x)-(E(¥|X=x)] 


14x 4 (lexex'y 


a 


2 9 (14x 


EXERCISE 6(b) 
1. The joint probability distribution of X and Y is given by the following tablé:. 


178 1/24 = 1/12 


14 1/44 0 
1/8 1/24 1/12 


(t) Find the marginal probability distribtition of Y. 
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(ii) Find the conditional distribution of Y given that X = 2, 
(iit) Find the covariance of X and Y, and 
(iv) Are X and Y independent ? 
2. A fair coin is tossed four times. LetX denote the number of heads occurring 
and let Y denote the longest string of heads occurring. 
(i) Determine the joint distribution of X and Y, and (ii) Find Cov (X, Y). 


Hint. 


2/16 1/16 


(it) Cov (X, Y) = 0°85. 
3. X and Y are jointly discrete random variables with probability function 
p (x, y) = 1/4 at (x, y) = (-3, - 5), (- 1, - 1), 1, 1), G, 5) 
=0, otherwise 

Compute E (X); E (Y), E (XY) and E (X| Y). Are X and Y independent ? 

4, X, and X2,have a bivariate distribution given by 

P(X 2x, NX2 =x) i , where (x1, X2) = (1, 1), (1, 2), (2, 1), (2, 2) 

Find the conditional mean and variance of X, given X2 = 2. 

5. Two random variablesX and Y have the following joint probability density 
function : 

f@, y)=k(4-x-y); Osx52; Osys2 
= Q, otherwise 

Find (i) the constant k, 

(it) marginal] density functions of X and Y,, 

(ut) conditional density functions, and 

(iv) Var (X), Var (Y) and Cov (X,Y). (Poona Univ. B.Sc., Oct. 1991) 
6. Let the joint probability density function of the random variables X and Y 
i : 

fii y)=2(x+y-3xy); O<x<1,0<y<1 
=(Q, otherwise 
(i) Find the marginal distributions of X and Y. 


(ii) Is E (XY) = E (XY) E(Y)? 
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(iii) Find E(X+/Y) and E(X-Y). [Calicut Univ. B.Sc., Oct. 1990] 

7. (a) LetX and Y have the joint probability density function 

f(x,y) F2,0<x<y<l 

= 0, otherwise 

Show that the conditional mean and variance of X given Y¥ = y are y/2 and 
y?/12 respectively. 

(b) If fx, y)=2; O<x<y,0<y<] 

Find () E(YIX), (ME(XIY). 

8. Ge an example to show that E (Y) may not exist though E (XY) and 
E(Y'X) may bothexist? [Dethi Univ. B.A. (Stat. Hons.) Spl. Course, 1985] 

Hint. Consider the joint p.d.f. : 

fix ypex. e799. y 20,720. 
= 0, otherwise. 
Then we shall get : 


Sx (x) = J fayayeetsx > 0 
0 


r i 
fr) = J fon») deo y20 


fQlx) = 7 a hi "sy 20 


E(¥) = yfQ)=% =>  E(Y) does not exist, 


E(XY) = ils Ste y)de dy 
00 


E(Y|X=x) = y.fOlxdy=4 


= Both E (XY) nae (E(YI\X= x) exist, though E (Y) does not exist. 

9. Three coins are tossed. Let X denote the number of heads on the first 
two coins, Y denote the number of tails on the last two and Z denote the number 
of heads on the last two. ; 

(a) Find the joint distribution of (i) X and Y, (ii) X and Z. 

(6) Find the conditional distribution of Y given X = 1. 

(c) FindE(ZIX=1). 

(d) Find px,y and px,z. 

(e) Give a joint distribution that is not the joint distribution of X and Z in 

(a) and yet has the same marginals as f. (x, 2) has in part (a). 
[Delhi Univ. B.Sc. (Maths Hons,.), 1989]. 

Hint. The sample space is S ’ ={H, T} x | H, T}x | H, T} 
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= {H, T) x (HH, HT, TH, TT} 


and each of the 8 sample: points (outcomes) has the probability p = 1/8 of 
occurrence. 


X : Number of heads on the Ist two coins. 
Y : Number of tails on the last two coins. 
Z “Number of heads on the last two coins. 


Then the distribution of X, Y and Z is given below : 


Sample Point:. HHH HHT HTH HTT THH THT TTH TTT 
Probability p p p 


P P P p P 
X 2 2 l ] I l 0 0 
Y 0 l 1 2 0 | I 2 
Z 2 | l 0 Z.7 1 | 0 


Joint Distribution of X and Z 


1/8 1/8 


1/8 2/8 


P(¥=0,X=1)_18_. 
(bt) P(¥s01X= 12h WEOX= 1) ve! 


P(X=1)  I2~4 
] 
Similarly, P(Y=1/X= I) =78=5 5 P (Y= X= N=R=4 
1/8 2/8 1/8 
(c) E(Z|X=1)=22.P(Z1X=1)=O0x +1 x +2 He 
(d) dy =o KY ald 
xy = — = — 
OxSy Vinxain 22> 
Cov (X, Z) -1/4 J 


XZ = SS 


(e) Let 0 < € $ 1/8. The joint probability distribution of (X, Z) given belc 
has the same marginals as in part (a). 


1/4 
UB+E 1/8 - 1/2. 
es € 1/8 + a oF 
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10. Let fy (x, y)=e"'”: O<x<%, 0<y<u 
Find : 
(a) P(X >1) (d) m sothatP (¥+Y<m)=\% 
(b) P(1<X+Y<2) (e) P(0<X¥<1|Y=2) 
(c) P(X <Y|X < 2Y) (f) Pxy 
Ans. fy (x) =e"; x20; fr(y)=e"; y20 
(a) 1/e (b) Hint. X+Yisa Gamma variabte with parameter n = 2. 
[See Chapter 8] (2/e - 3/e’). 
(ROSY Nek) = P(X¥<2Y)  P(¥<2Y) % 4 
(d) Use hint in (b). e"(1+m)=%;  (e) (e- 1)/e 
() fa (x, y=fr@) fr(y) = Xand Yare independent => pywv=0. 
11. The joint p.d.f. of X and Y is given by : 
f@%, y)=3Q@+y); Osx51,0sys1;, Osxtysi 


Find: (a) Marginal density of X. (by P(X¥+Y<%) 
(c) E(¥|X =x) (d) Cov (X,Y) 5 
Ans. (a) f(x) => (1-22) - Osx. 
0-S OS-~x 1 
()P(X+¥<M=f | f 3 (x +y) dy de= = 
0 0 
l-x 
(1 -x) (x + 2) oe : mS 
Cerro a @ean-f| fx £0,y)dy de = 
1 3 3 13 
Cov (x, y) = EY) - EQ) E (VY) =55- oxo =~ 355: 


6:10. Moment Generating Function. The moment generating, functioga 
(m.g.f.), of a. random variable X (about -origin) having the probability function 
f(x) is given by 
M(t) =E(e)=f, &® f(x) dr, 

(for continuous probability distribution) 


ay ¥ e* f (x), 


(for discrete probability distribution) 
the integration or Summation being extended to the entire range of x, ¢ being the 
teal parameterand it is being assumed that the right-hand side of (6°54) is absolutely 
convergent for some positive number / such that = h < ¢<h. Thus 


x’, Cx” 
Mx (the E(e)=E 7 tT Hoe 


2 r 
=1+tE(X) + E(X?) Ht SEX) Pav 


(6:54)... 
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r 


cor ‘eer 
=1+fp, +77 be Ft Uy, +... .-(6°55) 


where w=E(X%)= f x" f (x) dx, for continuous distribution 
= Lx" p(x), for discrete distribution, 


? 


‘ nee : ft. 
is the rth moment of X about origin. Thus we see that the coefficient of 7 in 


Mx(t) gives p, (above origin). Since My (t) generates moments, it is known as 
moment generating function. 


Differentiating (6-55) w.r.t. ¢ and then putting ¢ = 0, we get 
2 


. to 
Fr { Mx (‘) } ; rn 


«a0 


=> (6°56) 


tu 
In general, the moment ‘sites function of X about the point X =a is 
defined as | 


My (2) (about X = a) = = Ele ef Xa) 
-E|141X- ber “(X= a)?+ et Way t 


=z1+tp,' +t m+ on (6°57) 
where 1,’ = E {.(¥ - a)'}, is the rth moment-about the point X = a. 

610-1. Some Limitations of Moment Generating Functions. 
Moment generating function suffers from some drawbacks which have restricted 
its use in Statistics. We give below some of the deficiencies of m.g.f.’s with 
illustrative examples. 

1. A random variable X may have no moments although its m.g.f. exists. For 
example, iet us consider a wena r.v. with probability function 


f@)s= Pres tae 
= 0, otherwise 
1 
| Here EQ)= = Xx f(x) = 2 GD 
: ee 
Sd Lidetes oa 
es 
xe] X 


Since < : is a divergent series, E (X) does not exist and consequently no 


xel 


moment of X exists. However, the mgt. of X is given by 
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a a oe e& 
Mx) 2 2 f(x) a x(x +1) 
@ z* 
= ete)’ G2) (*) 
Pe Se An SA SE 
12 °23°34°45 °°" 
etal leglt 1 est 14 
=z] 1 3 |+7(5 3|*713 ; +24 5|+ 
eloazeree,. |.4_2_2_2 
2 3 4 23 4 °5 
if7 2 2“ 
= ~ log (1-2) -— 23 at 
= - log (1-2) +147 log (1 - z), |z| <1 
-1+[7-1 log (1 -z), |z|<1 


=1+(e'-1)log(1-e), «<0 
[-o|z|<1 => |e|<1 => 1<0] 
And My(t)=1, fort=0, [From (*)] 
while fort > 0, My (t) does not exist. 
2. A random variable.X can have m.g.f. and some (or all) moments, yet the 
m.g.f. does not generate the moments. For example, consider a discrete r.v. with 
probability function 


-1 
P(Xs 7 ree x=0,1,2,. 


Here E(X')=3 Q2*)'P(X¥=2%)nets 2? 
E=0 E=0 xX 


ae! .exp (2) =exp (2"-1) 
Hence all the moments of X exist. 
The m.g.f. of X, if it exists is given by 


r) -1 oo 
Mx(t)= = exp ¢.2)[] 20" 2 expt. 2) 
x=0 x. x=0 Xe 


By D’Alembert’s ratio test, the series on the R.H.S. converges for ts 0 arid 
diverges for t>0. Hence M,y(t) cannot be differentiated at.¢=0 and has no 
Maclaurin’s expansion and consequently it does not.generate moments. 

3. Ar.v.X can have all or some moments; but m.g.f. does ne exist except 
perhaps at one point. 

For example, let_X be a r.v. with probability function 
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~1 
P(X=22')=—_; y =0, 1, 2,. 


= 0, sicwae 
Since the distribution is symmetric about the line _X = 0, all moments of odd order 
about origin vanish, /.e., 
E(X**)= . = Porst =0 
a 2 1 a (27 y 
E(X*%)= 5 (#2) (sect) -s 


x20 - € a0 v! 


-- . exp (2”) = exp (2” - 1) 
Thus all the moments of X exist. The m. : f. of X, if it exists, is given by 
Mx (t) = : a (el +e" ) 


sige . | sast2) 


tT 


x20 
which converges only for ¢ = 0. 
As an illustration of a continuous probability: cistnbunon, consider Pareto 
distribution with o.d.f. 
a® 
PCy ani ; x2a;0>1 


: ‘ -0 lo 
\ 6 w79-! ~ 6 Peg 
E(X}=0.a° f de=6.a°.)-—5 | 
which is finite iff r - @ <0 => @>r and then 
7e-8 ae 
8 a 6.a 
E(X’)=69a 0-"~> Q-r? @>r 


However, the m.g.f. is giv n by : 


= te 


Mx(t)=0.a° f ari dx , 
which does not exist, since ¢” dominates x°*' and (e/ x°*') > 0 as ¥ —>- and 
hence the integral is not convergent. 

For more illustrations see Student’s t-distnbution and Snedecor’s F-distribu- 
tions, for which m.g.f.’s do not exist, though the moments of all orders exist. (c.f. 
Chapter 14, § 14-2-4 and 145-2, ]. 

Remark. The reas6n that m.g.f. is a poor tool in comparison with characteristic 

function (c.f. § 6°12) 18 that the domain of the dummy parameter ‘t’ of the m.g.f. 
depends on the distribution of the r.v. under consideration, while ‘characteristic 
function exists for all real t, (~ © <1< oo), If m.g.f. is valid for ¢ lying in an interval 
containing zero, then m.g.f. can be expanded with perhaps some additional restnic- 
tions, 
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6:10-2. Theorems on Moment Generating Functions. 
Theorem 6:17. Mey (t) = Mx (ct), c being a constant. ...(6°58) 
Proof. By def., 
L.H.S. = Mex (t) = E (e™) 
R.HLS. = My (ct) = E (e%*) = LHS. 
Theorem 6-18. The moment generating function of the sum of a number of 


independent random variables is equal to the product of their respective moment 
generating functions. 


Symbolically, if X1,.X2,-:.,A, are independent random variables, then the 
moment generating function of theif sum X, +X) + ... +.X,is given by 
_  Myxexye +X, (t) = My, (1) My, (t) ... My, (¢) .-(6°59) 
Prvof. By definition, 
My. er (t) =E | e! (X, + X24....4X,) 
=E| e* of® e'*| 
= E (e'™") E (e'*) ... E (e'™) 


( °.- X1, Xo, ..., X, are independent) 
8 My, (2). My, (¢) ... My (t) 


- 


Hence the theorem. 


Theorem 6-19. Effect of change of origin and scale on M.G.F. Let us 
transform X to the new variable U by changing both the origin and scale in X as 
follows : 


X-a 
U= ae where @ and /: are constants 


M.G.F. of U (about origin) is given by 
Mu (t) = E (e%) = E[ exp{ t(x-a)/h }]), 
=E| olkihi a ah set EF ( et’) 
= Ea) = eo My (th) ...(6°60) 
where My (t) is the m.g.f. of X about origin. 


In particular, if we take a = E (X) = p (say) and h = ox = 0 (say),:then 


= Z (say), 
p, ¢ 
is known as a Standard variate. Thus the n.g.f. of a standard variate Z is given by 
Mz (t) =e ””° My (t/o) ...(6°61) 
Remark. E (Z) -£ (=) --E(X- 7) 


1 1 
=5 EF @)-pac-p)=0 
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avi tct\tyy_y 

and V@ v( a a er! [c,f. Cor. (i) Theorem 6-8} 
sib =+ 
=GVM)a5o=1 Ic f, Cor. (iii) Theorem 6-8} 


- E (Z)=90 and V be ig., the mean and variance of a standard 
variate are Q and 1 respectively. 


610-3. Uniqueness Theorem of Moment Generating Function. The 
moment generating function of a distribution, if it exists, uniquelly determines the 
distribution. This implies that corresponding to a given probability distribution, 
there is only one m.g.f. (provided it éxists) and corresponding to a given m.g.f., 
there is only one probability distribution. Hence My (f)= My (t) => X and Y are 
identically distributed. [For detailed discussion, see Uniqueness Theorem of Char- 
acteristic Functions — Theorem 6-27, page 6-90] 

6:11. Cumulants. Cumulants generating function K (¢) is defined as 

Kx (O= log. My (6), ..(6-62) 
provided the right-hand side can be expanded as a convergent series in powers of 
t. Thus 


Ky (j= aK emt. see KTS + oo = 10g Mx (0 
I | 1+ rb 
= log wraMe Tew ST we +H, rite .-.(6°62a) 
where x,= coefficient of “in Kx (@ 18 called the rth cumulant. Hence 


2 3 4 
eT a OB 


7° 
=|’ b+ pe’ Sawt x + | 
°° 
Hw tid oy +a at } 
t? wd if TT a | 
Alu (+b sit oe Hu t+ [bo S| +... 


Comparing the coefficients of like powers of ‘¢ on: both sides, we get the 
relationship between the moments and cumulants. Hence, we have 


air eee 


2 
Ki = par’ = Mean, 57 = 5S => Ke =p py) =p 


Ms) 1 2p ps ay? 


ull 
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Also 
Ka pt | Une , 2's |, V3 yu? ye pr” 
4! 4 2\ 4 3! 3 2 4 
=> Ke = fa’ — 3 pte? — 4 py’ pt’ + 12 py” pr’ ~ 6 py 
= (pa = Spey’ py’ + 6pte” pa? — 3p) — 3¢ pe — Qpta” a? + pr) 
— = Png 2 = y os 2 
= fy — 3C Ha’ — pa)" = fa — 3p = fy — 3K (+. [lp = Xe) 
=> [ls = Ka + 3 Ky” 
Hence we have obtained: 
. Mean= x, 
[2 = K, = Variance 
os | ...(6-626) 
bls = Ky + 3 Ka" 


Remarks, 1. These results are of fundamental importance and should be 
committed to memory. 


2. If we differentiate both sides of (6-62a) w.r.t ¢ ‘r’ times and then put 
‘= 0, we get 


co| Sok o] . (6-626) 


611-1. Additive Property of Camulants. The rth cumulant of the sum of 
independent random variables is equa! to the sum of the rth cumulants of the 
individual variables. Symbolically, 


Ke (X, # X34... + Xu) = Ky (Xi) + K, (X2) +... + K(X, ), ...(6°63) 
where X;; i= 1,2, ..., 2 are independent random variables. 
Proof. We have, since X; ’s are independent. 
My, X24... +Xq (0) = My, () Mx, (2) eee My, (4) 
Taking logarithm of both sides, we get 
Kx, +x2+...+%» ()= Kx, + Kx, © + ... + Kx, © 
Differentiating both sides w.r.t. ¢ ‘r’ times and putting ¢ = 0, we get 


d’ d’ 
Fe dt’ ~ Kx, +x. .+Xq (a) | - ae’ — Kx, (t) 7 
+} ky aK o| 
dt 4 2 - Xa 


=> K(X + X24... +X) HK Ki) + K (Xd) +.. ake 
which establishes the result. 


6-11-2. Effect of Change of Origin and Scale on Cumulants. If we take 
=* ~4 ~at/H)Mz (t/h) 
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Ku()=- + Kx@/h) 


; 1? mr i at 
Ky 4p-Ky! + Ka ate .t+K, ao BS K, (th) 


+x, LL +...¢, a) +... 


where k,’ and «, are the rth cumulants of U and X respectively. 

Comparing coefficients, we get 

,_ Ein 4a Fa 
Ky = h and ere -r=2,3,... ...(6-63a) 

Thus we see that except the first cumulant, all cumulants are independent of 
change of origin. But the cumulants are not invariant of change of scale as the rth 
cumulant of U is (1/h’) times the rth cumulant of the distribiution of X. 

Example 6:37. Let the random variable X assume the value ‘r' with the 


probability law : 
P(X=r)=9'"'p; r=1,2,3.,... 
Find the m.g f. of X and hence its mean and variance. 
(Calicut Univ. B.Sc., Oct. 1992] 


Solution. My (t) = E (e”) 
xe q’h pat (ae y 


r=1 


tard 


r= 


= eget > (ety! = pet [1 + get + (ge? +...] 
1 


= (7) 
1 - ge' 
If dash (’) denotes the differentiation w.r.t. t; then we have 
d (1 + ge’) 
Mx’ (t) = —"—— ,, Mx” (0 = pe! : 
11,’ (about origin) = Mx’ (0)=—2 = + 
(1-9) p 
[2’ (about origin) = Mx” (0) = oP C4). teal 
(l-qy p 
Hence mean = jt)’ (about origin) = 


: ; ; + 
and variance = plo = pz — ph ant 


Example 6-38. The probability density igen of t 
foltows the following probability law : 


om variable X 
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itt ette Ol): creeg ees 


Find M.GF. of X. Hence or otherwise find E (X) and V (X). 


{Punjab Univ. M.A. (Eco. ), 1991] 
Sclution. The moment generating function of X is 


Mx (t)= ~¢~ | 35 75 (- = ae hs 


0 
-fi vo( 19521) a 
; ] ) 
For xe (— 9, 8), x-98<0 => O0-x>0 


.. |x-9|=O0—x VxeE (— 0%, 09) 
Similarly, i ates V xe (0,00) 


My (1) = 20 J exp E [045 )|a+36 J exp [-2(5-+J]@ 


-1 


- oye 0G a] 


a ales 
+76 ° i_, exp| 04 ‘| 
] 
_ e”. ‘ & _ e* 
~2(0t+i) 21-6) 1-0%7 
=e" (1-071 i 
07? y ay | 4.4 
alae | i (14+ 04+ 0% +...] 
367? 
=1+O0s+ v +... 


E (X) = w’ = Coefficient ae in Mx (O= Q 


bla’ = Coefficient ome aT in — ()= 36° 


Hence Var (X) =) —'?=3 6-67 =26" 
Example 6-39. If the moments of a variate X-are definéd by 
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E(X')=0-6: r=1,2.3.... 
show that P(X =0) =0-4, P{X = 1) = 0-6, P (X22) =0. 


[Delhi Univ. B.Sc. (Maths Hons -), 1985] 
Solution. The m.g.f. of variate X is : 


.-) t’ : c--) ra 
Mx (t) = = “Walt = 1 (0°6) 
=0-44+06 — = 04406 é (i) 
720 ° 
But My ()=E(e*)= = e* P(X=x) 
xrs0 


=P(X=0)+e.P(X=1)+ = e%.P(X=x) ..(ii) 
re2 


From (i) and (ii), we get: 
P(X =0)=0-4 ; P(X¥=1)=06 ; P(X¥22)<=0. 
Remark. In fact (i) is the m.g.f. of Bernoulli variate X with P (X =0) = 
q = 0-4 and P (X = 1) = p = 066 [See § 7-1-2] and P (X 2 2) = 0. 


Example 6-40. Find the moment generating function of the random variable 
whose moments are 
=(r+1)!2’ 
Solution. The: m. gf f. is given by 


tf 


MENS = 6 is = ee z an (r+1)!2’ 


e > (r+ 1) (20)’ 
Mx (t) = T4200) 53 (20744 (2) +. 


=(1-2t)? 
Aliter. The R.HS. is an arithmetic-geometric series with ratio r = (20) 
Let S=14#2r +3744 +... 
Then rS = r+2r4+3r 4+... 
1 
= 2 SSF!" 
(1-r)S=l¢+rere+... G-n 
1 : 3 
=> § =—— —=(1-ry’ =(1-2/°° 
(i-r) 


Remark. This is the m.g.f. of Chi-square (’) distribution with parameter 
(degrees of freedom) n = 2 [c.f. Chapter 13]. 
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Example 6°41. [f 1,’ is the rth moment about origin, prove that 
é 
, r-] 
LH, = 2 ( | ) a, Kj, 
where K; ts the jth cumulant. 


Solution. Differentiating both sides of (6-62a) in § 6-11, page 6-72 w.r.t. 


t, we get 

(2 -! 
Ky t Ke? K3n tea Re a ee 
Ba ae? re "(r-1)! 


4 4 4 ?2 + Uh - 
rE a 
+ ply f + fio 2 { + eee + pl, + eee 


2 r-| 
=> Geaetee ate en ce 


ft? rv 
x[1 + yo t+ Ha a + tn Se..,| 
Pigott ,t ieee sew 
= My + Hy f+ Ms Dt ee + Me (F—1) pte. 


t 


r-] 
Gat on both sides, we get 


Comparing the coefficient of 
, , , r—! , 
y= Ky Wey + (r= 1) Kp a + ( 2 Jig What tk, 
("3 | , ‘(r- 1 , r— | , 
= 0 ) mea + ( | H,_2 Ko + 9) ) wees ks 
r-l\ , 
se (‘ - 1) BS K, 


j=l 
which is the required result. 
, es @ ° ° 
6°12. Characteristic Function. In some cases m.g.f. does not exist, since 


oo 


the integral | e!*- f(x) dx or the series & e* p (x) does not converge absolutely 
—0o x 


for real values of t for some distributions. For example, for the continuous 
probability distribution 


dF (x) ‘= ! 


C a 
(1 + a ad 
the m.g.f. does not exist, since the integral 


dx;m>1,-°2%<x<0,. 


| an 
“ + x2" dx, 


My (6) -c| é 
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does not converge absolutely for finite positive values of m because the function 
e™ dominates the function x” so that e*/x"" — 0 as x —) ©, 
Again, for the discrete probability distribution 


SG) =—35 3x0 1,2,3,... | 


= (0), elsewhere 
ss ~ ( io 
My () = Eerfa=S > S| 
z - xe2l 
The series is not convergent (oy D’Alembert’s Ratio Test) for ¢ > 0. Thus there 
does not exist a positive number A such that Mx (¢) exists for—h<t<h. Hence 
Mx (t) does not exist in this case also. 


A more serviceable function than the m.g.f. is what is known as characteristic 
function and is'defined as 

gx (= Ee) =f el F(x) dx 

(for continuous probability distributions) 

= Le f(x) 

ae (for discrete probability distributions) 

...(6-64) 

If Fy (x) is the distsribution function of a continuous random variable X, then 


ox()= J" oi oF (x) A6-64a) 


Obviously 6 (¢) is a complex valued function of real variable ¢. It may be noted 
that 


lo@ I= 


| £00) de < [ie If) dx=[ f(x) de= l, 


sin 2 |=| cos tx+isin |’? = (cos? wx + sin’ ox)! = 1 
Since 4 > (t)|s 1 cei function x(t) always exists,- 
Yet another advanta ge of characteristic funcuon lies intne fact that it uniquel deter- 


mines the distribution function, é.e., if the characteristic function of a distribution 

is given, the distribution can be uniquely determined by the theorem, known as the 

Uniqueness Theorem of Characteristic Functions (c.f. Theorem 6-27 page 6-90}. 
6-12:1. Properties of Characteristic Functions. For all real'‘¢’, we have 


(i) 6@)=J°_ dF (@)=1 (6-646) 


(ii) 19@Mi<s1=9(0) ..(6-64AC) 
(iti) > (¢) is continuous everywhere, i.e., > (¢) is\a continuous function of tin 
(— ©0, oo), Rather © (‘) is uniformly continues in ‘¢’ 


Proof. For h #0, | $x (t+ 4) — $x (0) | = ae [eosin ct] arog 
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s| 
The last integral does not depend on ¢. If it tends to zero as > 0 then 
$y (4) is uniformly. continuous in *?’. 


Hence by Dominated Convergence Theorem (D.C.T.), taking the limit 
inside the integral sign in.(*), we get 


eltx (efx a2! 1) | dF (x) 


efhx _ | | dF (x) ii(®) 


Now ex) 4+1=2 


ethx macil | < 


efhx — | | aF (x) <2 | dF (x) = 2. 


ry 


oo 


tin [oreo e000] [By 
=> jim, by (tth=oy OV. : 


Hence $x (#) is uniformly continuous in ‘t’. 


efx — | | dF (x) =0 


(iv) by (2) and 0, (t) are conjugate functions, ie,, dy (-f) = oy (4), where a 
is the complex conjugate of a. 
Proof. by (t) = E (eX) = E [cos tX + isin tX] 
=> dy, (t) =E [cos tX —i sin tx] 
= E [cos (-t).X + i sin (-t) X] 
= E (e“™) =, (-?). 
6°12:2. Theorems on Characteristic Function. 


Theorem 6°20. /f the distribution function of a r.v. X is symmetrical about 
zero, i.e., 
I-F@)=FCxX) = f)afO), 
then dy (t) ts real valued and even function of t. 
Proof. By definition we have 


ox @ = | ee | es eos, (x=~y) 
= [ ef Ody Lf) =f0)) 
= dy (-t) st) 

=> dy (f) is an even function of t. 
Also by (=o) ~ (c.f. Property (iv) § 6-12-1] 
bx (t) = dy (-t) = ox (From *) 


Hence dy (f) is‘a real valued function of ¢. 


Theorem 6°21. If X is some random variable with characteristic function 
"Ox (t), and if p1,’ = E (X’) exists, then 
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=(-i) |p 79 o| 
=0 .-.(6°65) 
Proof. o()= f eg x F(x) de 


Differentiating (under the integral sign) ‘r’ times w.r.t. f, we get 


wo (XY eo fx) dx = (i f c 2 x e™ f (x) de 


te0 


2 oh = (ir [J 7, 8 FO) ae] 


ar Jr. x'f(x)dx=i’ EX) =i' p’ 


vee) Fa hada 
t=0 i=0 


The theorems, viz., 6-17,6-18 and 6-19 on m.g.f. can be easily extended to the 
characteristic functions as given below. 
Theorem 6:22. oc (0) = x (ct), c, being a constant. 
Theorem 6-23. /[fX, and X2 are independent random variables, then 
Oxi =ox ( ox, -{*) 
More, generally for independent random variables X, ; i= 1,2, ..,n, we have 
Oxi + x24. Xe O= Om O bx) .--Oxn (OO 
Important Remark, Converse of (*) is not true, ie., 
ox, +x2 (0) = $x, () ox. (0) does not imply that X, and X2 are independent. 
For cxample, let X bea oe Cauchy variate with p.d.f. 


f@)= eee — 0 <xX<o0o 
Then dn (Dee (c.f. Chapter 8) 
Let X2=X, ie., P (X; = X2) = }.. wo **) 


Then ox, () =e" 
Now = $x) +x2 (¢) = 2x; (0) = ox, (20) = & 


= bx, (2) ox, (0) 
i.e, (#) is satisfied but obviously X, and X2 are not independent, because of (**). 


In fact, (*) will hold even if we take X, = aX and X2= 6X, a and b being real 
numbers so that X, and X; are connected by the relation : 
“ =X= Xa => ax, = bX le 
As another example let us consider the joint p.d.f. of two random variables X 
and Y given by 


f@y= zal +3 ?- Y)1; IxIl<a, lyls<a, a>0 


= Fy clean 


24e| 
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Then the margina! p.d.f.’s of X and Y are given by 
ae a 
= =—: < 
8 (x) jo. fy dy=57 3 Ix1Sa 


(on simplification) 
hon=J% fa naeS—; lyse 
Then 
dx (1) = = e! g(a) dr=5- hae ei ar 
tee at sin at 
-  Qait sat 
Similarly by (= Sint 
sin ae 
ox r= (sin J & 


The p.d.f. & (z) of the random an Z=X+Y is given by the convolution 
of p.d.f.’s of X° and Y, viz., 


k@= f(u,2—u) du 
l 2 2 
=73) [1+ u(z—u) {u —(z—ux) 1 du 
=a) (1 + 327u? — 22u’ = 2u) du, 


the limits of integration for u being in terms of z and are given by (leftas an exercise 
to the reader) 


~asusz+a;u<sd0 
z—-asusa;u>O 


and 
Thus 
= ey er eee i ~2aszs0 
*k (2) = re (14324 ~ 2eu? ~ au) du = "2? 50.< 2520 
0, elsewhere 
Now 


2 ! = 
Over () = 62 () = Jf ne o* k (z) dz 


(). 2a +z +Z oits a3 z gil 
= : —_——_ d 


N 
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=fo (e+e) [Aa | a 


[Changing z to -z in the first integral] 


_1 (2a 

=F (2a — z) cos tz dz 

_ 2-208 2at _ 1 —cos 2at 

4a‘t 2a’? (on simplification) 

_f sin at 

| at 

= dy (0). or (0) [From (*)} 
But g(x). h(y) #f (x, y) 
= X and Y are not independent. 


However 
xi. x2 (ta, 2) = E (e879) = ox, (0) 0 
implies that X, and X2 are independent. 
| (For proof. see Theorem 6-28) 

Theorem 6:24. Effect of Change of Origin and Scale on Characteristic 

Function, if U == 7: 
bu (1) = eo" bx (¢/h) 

In particular if, we take a = E (X) = pt (say) and k= 0, =6 then the characteristic 
function of the standard Variate 


,aand h being constants, then 


: ye AS Sd 
Ox ‘Co 
is given by oz (t)=e’? ox (t/o) (6-66) 


Definition. A random variable X is said to be a Lattice variable or be lattice 
distributed, if for some h>0, 
| X isan imeger |=1, " 


his called a mesh. ; 


Theorem 6-25. Jf| dx (s) |= lfor some s #0, then'for some ie a,X-aisa 
Lattice variable with mesh h=2n/\s\. 

Proof. Consider any fixed ¢. We can write 

ox (0) = | or (0) | e sat , (a dependent on 1), since any complex number z can be 
written as z =|z|e . 

: lox (0 1= 6 be = Oa 0 

= E (cos¢ (XK -—a)+isin« (X-a)]=E [cost (X—a)] 
since left-hand side being real, we must have E [sin:t (X — a)}] =0. 
; 1 — [x () {= E (1 —cos ¢(X — @)] (*) 


’ 
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If | dy (s) |= 1, 5 #0 then for some a dependent ons, we have from (+) 
E[{1-—coss(X-—a)]=0 we(**) 

But since | — cos s (X —a) is anon-negative random variable, (+*) 

= P[l=coss(X-—a)=Qj=] 

=> P [cos s (X — a).= lJ=1 


=> P [s (X¥ — a) =2nr] = 1 
2nn 
=> P| (xa) = 228 |= 1, for some n=0, | ey ere 


Thus (X — a) is a Lattice variable with mesh h = = : 
6-12-3. Necessary and Sufficient Conditions for a Function  (f) to be 


Characteristic Function. Properties (i) to (iv) in § 6-12-1.arc merely the necessary 
conditions for a function 6 (©) to be the characteristic function of an r.v. X. Thus if 
a function 6 (t) does not satisfy any one of these four conditions, it cannot be the 
characteristic function of an r.v. X. For example, the function 
9 ()=log (1 +9), 
cannot be the c.f. of r.v.X since $ (0) = log 1=0 #1. 
These conditions are, however, not sufficient. lt has been shown (c.f. Methods _ 
of Mathematical Staustics by H: ane) that if © (0) is near ¢ = 0.of the form, 
$()=140(¢7+8),5>0 »-(*) 


where 0(¢) divided by ¢’ tends to zero as «0, then $(f) cannot be the 
charactéristic f UNCON unless it is identically equal to one. Thus, the functions 
ea didaiel =1+0(¢*) 
(ti) $= a= 1400 ‘) 


being of the form an are not characteristic functions, though both satisfy all the 
necessary conditions. 
We give below a setof sufficient but not neccesary Conditions, duc to Polya for 
a functiori (0) to be the characteristic function : 
(2 is a characteristic funcuon if 
' (1) $(0)=1, 
(2) $=6C9 
(3) 9 (t) tS continuous 
(4) (© is convex for ¢>0, i.e., for 4, 12 > 0, 
26 (5 (a + 2} S O(n) + 0 (n) 
(5) lim $ (0) =0: 
{— oo 
Hence by Polya’s conditions the functions e~'*'! and [1+|¢{]°' are charac- 
teristic functions. However, Polya’s conditions are only sufficient arid not neces- 
‘sary for a characteristic function. For example, -if X ~ N'(L, 0°), 
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‘: 22 
g(QeeltBrr eo” [cf § 8-5] 
and o(-N #9. 
Various necessary and sufficient conditions are known, the simplest seems to 
be the following, due to Cramer. 
"In order that a given, bounded and continuous function  (¢) should be the 
characteristic function of a distribution, it is necessary and sufficient that 
 (O) = 1 and that the function 


6 (x, A) = rae be-wel- On dy 


is real and non-negative for all real x and all A > 0. 
6-12-4. Multi-variate Characteristic Function. Then 


Xi hh 
X=| “71 and =| 2}, P real 
X, la 
be z x 1 column vectors. Then characteristic function of X is defined as 
dx (1) = E (e'%) = E [eh 2+ + tay ...(6-67) 
We may also write it as . 
ox, .x2, oe Xq (t, ho, veg tn) or ox (fh, £2, soos t,) 
Some Properties. 


(i) ox (0,0, iQ) = l 

(ii) hxO=bO 

(itt) IoxO| <1 

(iv) . dx (¢) is uniformly: continuous in n-dimensional Euclidian space. 
(v) Iffx(-) is p.d.f. of X, 


o 4s 


ox (0) = | oe | Sx (x1, X2, sony Xa) C8 PH dx, dx2... dXn 


—co —o¢ 


(vi) x = — (¢) for all t, then X and Y have the same distribuiton. 


(vii) If i i | Ox (41, 2, . . ts) | dt, dlz.. . dt, < oo, 


then X is absolutely continuous an hae a uniformly continuous p.d. f. 
thy X2s o0es L455 dy (t1, bay soy ta) At di, dle 
fi 1, %2 Xa) = on J je ox (hi, b2 ) dt, atz . 
(viii) The random variables X,, Xs. ..., X, are (mutually) independent iff 
Ox1.X2, Xe (Lis f2y ney la) = Oxi (Lt) Ox, (le) --- Oe (fe) 


Remark. Multivariate Moment Generating'Function. Similarly, the m.g.f. 
of vector X = (X1,X2, .., Xa) is given by : _— 
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My (t) =E (e"’*) a 2 (e™! + 62X2 +... tnXn) (668) 
We may also write : 

Mx (1) = May, xo, Xyllay fy oony bn) = E (eM #2 * tiny 
In particular, for two variates X, and Xr 


Mx (1) = Mx, xp(th, 2) = E (e142) = y ya 8B OKXs) 


ls! 
fa0'ga0 ...(6-69) 
provided itexists for — hy < 4 < h, and — h2 < 2 < hy , where h, and hz are positive. 
Mx,,x2 (4, 0) = E (™") = Mx, () ..(6-69a) 
Mx,,x2 (0, ta) = E (e?) = Mx, (t) ,..(6-69b) 
If M (t,, ¢2) exists, the moments of all orders of X and Y exist and are given by: 
| 4 r : 
E (X= rus _9 M(O, 0) (6°70) 
0 t t=t,=0 Ot." 
hain r 
E(X,")= oi ot Zo > 0.0 ...(6-70a) 
f if =f2=0 y h 
E(X,"X2)= go) 0'T*M (0,0) ...(6°70b) 
Ot 0b, eG d4'0n° 


Cumulant gencrating function of X = (Xi, X2)’ is given by : 
Krag (4, 2) = log Mx, x, (G, &). (6-71) 


Example 6-42. For a distribution, the cumulants are given by 
K=n[(r—-1)!), 2>0 

Find the characteristicfunction. (Delhi Univ. B.Sc. (Stat. Hons.), 1990) 

Solution. The cumulant generating | function K (0), if it exists, is given by 


K(O= y o cD we n{(r -1){}= > wo 


r=1 r= 


"feat a  Jentegt-in 


=~ n log (1 — it) =log (1 — it) " 
Also we have 
K (0) = log 6 (= log (1 - it)“ 
o()=(1—it) ” 
Remark. This is the characteristic function of the gamma distribution : (cf. 
§ 8-3-1) 
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—t yn- 
f(x) = TW) ;sn>0,0<x<o~, 


Example 6°43. The moments about origin of a distribution are given by 


: _T(vtr 
Br SPW) 


Find the characteristic function. 


(Madurai Kamaraj Univ. B.Sc., 1990) 
-Solution. We have 


oy =r Oy => am. roe 


yy mt, (Vv+r—-1)! 
ae ae (Vv-1)! 


>i (it)' vtr-!C = x (-1)".- YC. (it)’ 


[-- YC, = (-1)"." +r- IC, =» (-1)) ."C,=¥tr- IC] 


\oo 


o(t) = » YC, (-it)’ = (1 ~ it)” 


r=0 
Example 6°44, Show that 


; x2) x) 
eft =] + (elt — 1) xO) + (ef — 1) arte t (e" — 1)’. ae 


where XO =x (x-1)(-2)...(4-r +1). Hence show that 


Un’ = (D' > (t)], 20, where D = Fle eT and UU, is the r factorial 
moment, 


Solution. We have 


; (2) x) 
R.H.S.= 1 + (e# — 1) x) + (e# — 1)?. wt. +(e 4). Se. 


= 1+ (e#-1) GC) + (e* — 1)? C2) 


+ (e*— 1)9 (°C3) +... + (e# - 1) CC,) 
=[1+(e#-D)P=e*=LHS. 
By def. 


(= [" eft* f (x) dx 


a — SSS, — 
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00 , Ge. 
nfo, te - 1x +(e — stings daa 


=1+¢4~1) J? pdr & {=I foe 2 F(x) det... 
WEN anya. 
[Do @lx0=| PO] =f ¢(e) de= be! 
d(e) t=O 


where [i is the rth factorial moment. 
Theorem 6-26. (Inversion Theorem). Lemma. If (a-—h,a+h) is the 
continuity interval of the distribution function F (x), then 


F (a@+h)—-F (a—h)= lim ajr sont e (0) dt, 
T 3 © 


6 (0) being the characteristic function of the ladies 
Corollary. If (¢) is absolutely integrable over R’, i.e., if 


J le (dt <es, 


then the derivative of F (x) exists; which is bounded; continuous on R’ aid is given 
by 
2 a 29 oo - ux 
f@=FR)=FTI_ LE OOM, 6-12) 
for every x € R’. 
Proof. In the above lemma replacing a by x and on dividing by 2h, we have 


F(xt+h)—-F(x-h) _ sok li [2 ye eed 


2h OR roe Tht 
Il [oo sinhkt x 
On ht e o()dt 
— F(xth-FAa-h | ee o SINAt _ix 
cc) Sl? OF Se * (0) dt 
h+0 2h 2 p49 ht 
Since J* lo @ldt<e, 


the integrand on the rigtit hand side is bounded by an iritegrable function and hence 
by Dominat#d Convergence Theorem, we get 
lim F@+h)-FQ-h)_ 1 [fo ,. sin At : en ia 
jim Fesh—fenat Jr. im | we Gar 
By mean value theorem of differential calculus, we tiave’ 


lim FQ@+h)- aA h) _ =F’ (x) 


h->0 =f (x), 
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where f (.) is the p.d.f. Seen aee to > (¢). Thus 


f@=FW=LJ?_ &* oma, 
as desired. 
Remark. Consider the function ¥, defined ’by 


=| eo 6 (0 dt 


—C¢ 
Now if F (x) =f (x) ss then 


Fe _ 1 ~ ix a 
ae re zene Lf o(t) e 
= ae fa nol 
Hence — Ba 


ao O at all points where F (x) is continuous. In other words, if the 
prbability distribution is continuous 
Fe 


—-+0 asc 0 
2c 


If, however, the frequency function is discontinuous, i.e., distribution is 
discrete, consider onc point of discontinuity say, the frequency f; at x=x;. Then 


the contribution of x; to ¢ (0) isf;e"™ and hence its contributions to % will be 


ng 
| peter ™ at 
=-¢ 
c 
lim Zw lim + f J HOD at 


¢ —) 00 C0 


ae | eff G-2) 
gape i t (xj— x) f c 
_ 0 for x#x; 
~ | fj for x=4; 

’ Hence if F.2c ~»0 ata point, there is no discontinuity in the distribution 
function.at that point, but if it tends to a. positive number f;, the distribution is 
discontinuous at that point and the frequency isf;. This gives us a criterion whether 
a given characteristic function represents a continuous distribution or not. 

Theorem 6-27. Uniqueness Theorem of Characteristic Functions. 
Characteristic function uniquely determines the distribution, i.e., a necessary and 
sufficient condition for two distributions with p.df.’s f; (.) and f;(-) to be identical 
is that their charac.eristic functions 9, (0) and $2 (0) are idensical. 
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Proof. Iff; (.))=f2(.), then from the definition of characteristic function, 
we get 


oO=) ee fwd |  AWd=h(0 


Conversely if ; (¢) =2 (0; then from-corollary to Theorem 6-26, we-get 


A@=s J &*™ aode a) e™ wi) dt= hw 


Remark. This is one of the most fundamental theorems in the distribution 
theory. It implies that corresponding to a distribution there is only one characteristic 
function and corresponding to a given characteristic function, ‘there is only one 
distribution. This one to one correspondeace between characteristic functions and 
the p.df.’s énables us to identify the form of the.p.df. from that of characteristic 
function. 

Theorem 6:28. Necessary and sufficient condition for the random vari- 
ables X, and X2 40 be independents thattheir joint characteristic function is equal 
to the product of their individual characteristic functions, i.e., 

Ox,.x2 (Ais 2) = Ox, (4) Ox, (2) .--{*) 


Proof. (i) Condition is Necessary. If X, and X2 are independent then 'w c 
have to show that (*) holds. By def.:, 


bx, x, (tar 2) = E (eit + ha) = EB (elih, elt) 
= E (e"') E (e%)(.. X,, X2 are independent) 
(ii) Condition is sufficient. We have to show that if (*) holds, then X, and 
X, are independent. 
Let fx,.x, (x1, 2) be-the jointp:d.f. of X, and X2 and fj (xi) and f2 (x2) be the 
marginal p.d.f.’s of X; and: X; respectively. Then by definition (for continuotis 
r.v.’s), We get 


dx, (4) = ef (ny) dn 


bx (t)= fe!” f(a) de 


x, (41) Ox, (42) -| Ls a fi (a) dx, | | | elf2% f2 (x2) es 


= f i eit xy +12 x2) i, (x1) fh (x2) dx, dx, 
-0© -© wo(* +) . 
by Fubini’s theorem, since the integrand is bounded by an integrable function. 
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Also by defi 
bx,, x, (i, 2) = J J e' (x1 + f2%2) f (x1, X2) dx de 


If (*) holds, we get from (**) . 


dxx(had= fo J elt 2% fx) fy Ge) dey der 


Hence by uniqueness theorem of characteristic functions, we get 

fm) =fi (a) f Oa), 
which; implies that X, and X,_ are. independent. 

Remarks, 1. For discrete r.v.’s, the result is established by using, summation 
instead of integration. 

2. The result can be generalised to the case of-more, than two -variables. 

Necessary and sufficient condition for the mutual independence of random 
variables X;, (¢= 1, 2, 3, .... 2) is that 

Dx), Xa, eXu (ly £25 2005 bn) = Ox, (th) Dixy (fa)... Ox, (la) 

3. In terms of moment generating functions, the necessary and sufficient 

condition for the r.v.’s Xi, X2,...,X, to be mutually independent is that. 
My Xo, tn (hy bay ots ba) = Mx, (4) May (2) «2. Mx, (le) 
provided m.g.f.’s exist. 

Theorem 6:29. Hally-Bray Theorem. [If the sequence of distribution 
functions | F, (x) } converges to the distribution function F (x) at all the points of 
continuity of the latter and g(x) is bounded continuous function over the line 
Ri (— 09, e), then 


Tim J g(x) dF.) = | 9 (x) dF (a) w(*) 


Ao _ ek 


Corollary. If F.(x)— F(x), then the corresponding sequence of charac- 
teristic functions ¢, (4) of Fy (x) converges to the characteristic function 6 (4) of 
F at every point ‘/’. peor ; 

Proof. cos wx and: sin «& are continuous and bounded functions of x 
for all ¢ and hence from (*), we get 


lim J COs ix dF .(x)= | cos x dF (x) 


n <> 00 


and lim | sine dF,(x)= | sine dF (x) 
Rm-veo _ L =e 
*. lim J (cos (x+i sin ix) dF, (x)= J (cos (x+ isin &). 
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» lim J e*aR@= | e*aF() 


=> o.() > O@ an ro 


Theorem 6:30. Continuity Theorem for Characteristic Functions. 
For a sequence of distribution functions { F, (x)} with the corresponding sequence 
of characteristic functions {, (#)}, a necéssary and sufficient condition that 
F, (x) > F (x) at all points of continuity of F is that for every real 1, 
6, (0) >> (t), which is continuous at r= 0 and (1) is the characteristic function 
corresponding to F. 
Example 6-45. Let F, (x) be the distribution function defined by 
F,(x)=0 for xS-—n 
X+n 
2n 
= 1 forx2n 
Is the limit F, (x) a distribution function ? If not, why? 


for -n<x<n 


n 
Solution. 4 (= | e'* ~ dx (°° falx) = Fr (x) 
—fR 
_ 1 oft _ gt in |- sin nt 
2n it nt 
6()= lim 6.()= lim S27 
n—- 0 n— co 
. t 1 if r=0 
~ | 0 if «#0 
i.e., (¢) is discontinuous at ¢ = 0: 
and lim Fa (x)= + 
n => 00 2 
Hence F (x) is nota distribution function. 


Example 6-46. Find the distribution for which characteristic function is 
. 22 
@) $(=(q+pe)", () OM=E'°” 


n ] 
Solution. (a) o(=(qtrpeY= L "Cp qe" 
j=0 
We have 


© e . . } n te 7 e,¢ 
= | e™ gaya f ei LCG pd Je tds 
j=0 


—C¢ —C¢ 
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c 


"C jp gn -j r it “ral 
[ } 


;=0 
-Cc 
(4) If x #/, 
n evit (xj) [° ic (x~j) _ eit (xX -j) 
x Dy nC. n—} jo oe "C; -j k 4 
ee Pa I-i-(x—-J)l_¢ j=0 ea i (x-j) 
“ 2i sin {c (x 
= nC’, ; n-j Aisin te =i 
j=0 jP’q (x-J) 
lim 7, 7.0 Vx 
cre ZC 


Hence there is no discontinuity in the distribution function when x #/. 
(ii) If x =j, 


c 
un 


§.= 2 ci gr -4 | a =2¢ 2, "C; pi qh=J = 2c (q + p)" = 2c 
J=0 is 
Since Xe > | atx =j, the distribution function is discontinuous and its 
frequency is "C; p/ q"~/. 


Cc 


(b) Let & | enix - FOr gy 

: 

c c 
5.1 < | enix 308 ar <| e739" dt 

-C —¢ 

< | e739 = Xt 

0) 
lim 3 0 Vx. 
C > 00 


Hence the distribution function is continuous for all x. 


oo 


. 2 6? 
f@® -1 | enitt pW OP Ct 


Put to+e =E,ie., odt=d& 
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f= He ex | a gta 


f@eh ex eee Nom _ : ex ee — 00 < X< 00 
an “*P 20°| « ov 20? 2¢97|’ 
which is the p.d.f. of normal distribution. 

Example 6-47. Find the density function f (x) corresponding to the charac- 
teristic function defined as follows : 


_faetel, tls 
o0=| 0. ttl>1 


Hence 


[Delhi Univ, B.Sc. (Maths Hons.), 1989] 
Solution. a Inversion Theorem, the p.d.f. of X is given by : 


f@= ag! 6 (4) det 


“25 J 
- On: 


Tmo, 


1 
on ix rs rt ar 
(1+ 0) dt+ an e (1-0: dt* 


Now 


F 164 p= : l f ix 
—t —_ EEE =z, oe 
a (1+) dat= ~ ix (l+a9 l a sf dt 


x wlh~-w i-t 
| l «x 
=-— =! 
. x aye ©” 
Similarly, 
| =i 
l~)dt=—+—5 (EF -1 
Je (1-1 dt= =+ (ix) (e ) 
oe ee er | 
f@= “nr {e lie 3a 
tx’ 2? Pf x” 
EXERCISE 6 (c) 


1. Define m.g.f. of arandom variable. Hence or otherwise find the m.g.f. of: 


***for—1<72<0, [t|=—-¢ and for 0<¢<1, [t/=+¢ 
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(i) Y=aX+b, (ii) a . 


[Sri Venkat Univ. B.Sc., Sept. 1990; Kerala Univ. B-Sc., Sept. 1992) 


2. The random variable X takes the value n with probability 1/2", n = 
|, 2, 3, ... Find the moment generating function of X and hence find the mean 
and variance of X. 


3. Show that if X is mean of n independent random variables, then 


m0 =[Mx(‘)] 


4. (a) Define moments and moment generating function (m.g.f.) of a 
random variable X. If M (#) is the m.g.f. of a random variable X about the origin, 
show that the moment LL,’ is given by 


u,’ = fore [Baroda Univ. B.Sc., 1992] 
6=0 


(b) If 1,’ is the rth order moment about the origin and x; is the cumulant of 
jth order, prove that 
ober Ny 
5 Kj J = 1 nae, 
(c) If p,’ is the rth moment about the origin of a variable X and if u/ =r!, 


find the m.g.f. of X. 
5. (a) A random variable ‘X” has probability function 


pQ)=353x= Ne 2. Oy aes 


Find the M.G.F., mean and variance. 
(b) Show that the m.g.f. of r.v. X having the p.d.f. 
f=t,-l<x<2 
= 0, elsewhere, 
er! — e-f 
is M (t)= 31 °t #0 
=1,r=0 {Gujarat Univ. B.Sc., Oct. 1991] 
««) A random variable ‘X’ has the density function : 
] 
Ff) > te 
= 0 ,.elsewhere 
Obtain the moment generating function and hence the mean and variance. 


6. X is a random variable and p (x) = ab*, where a and 6 are positive, 
a+ b = | and x taking the values 0, I, 2, ... Find the moment generating 
function of X. Hence show that 


My =m, (2m, + 1) 


»N<x < | 
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m, and mz being the first two moments, 


7. Find the charactenstic function of the following distributions and their 
yarianccs : 


(i) dF (x)= ae ™ dx, (a>0,x>0) 

(ii) dF(xy=te7'*! dr, (-o<x<e) 
(tii) P(X =)) tineu=(*] Cr (O<p<l1,g=1-p,j=9, 1, 2, ..., n). 
8. Obtain the m.g.f. of the random variable X having p.d.f., 


x, forO<x<l 
joo=| 2-x, forl<x<2 
0, elsewhere 


Determine py’, He, Hs and py. 
i 1 ‘ 

Ans. -"- ). wy’=1, pe =7. 

9. (a) Define cumulants and obtain the first four cumulants in terms of cenral 
moments. 

(b) 1€ X isa variable with zero mean and cumulants «,, show that the first 
‘wo cumulants 4, and J, of X° are given by l; = Kz and [2 ='2 Ky" + Kk. 

10. Show that the rth cumulant for the distribution 

f(@)=ce™, where c is positive and Q <x<o0o 


iS | ¢-v! 
c 


11. If X is a random variable with cumulants x,; r= 1, 2, .... Find the 
cumulants of 
(i) cX, (ii) c +X, where c is aconstant 
12. (a) Define the characteristic function of a random variable. Show that the 
characteristic function of the sum of two independent variables is equal to the 
product of their characteristic functions. 
(b) If X is arandom variable having cumulants k, ; 7 = 1, 2, ... given by 
K=(r-1)! pa’; p>0,a>0, 
find the characteristic function of X. 
(c) Prove that the characteristic function of a raniiom variable X is real if an? 
only if X has a symmetric distribution about 0. 
13. Define $(¢), the characteristic function of a random variable. Find the 
characteristic function of a random variable X defined.as follows : 
0, x<0 
f@)=; 1, 0sx<1 
0, x>1 
Ans, ec" /it 
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14. For the joint distribution of two-dimensional random variable (X, Y) 
given by 


1 
f®N=75 [i +zy@?- YY lzlsa; ly|Sa,a>0 
=(Q, elsewhere , 
show that the characteristic function of X+Y is equal to the product of the 
characterisuc functions of X and Y. Comment on the result. 
Hint. See remark to Theorem 6-22, page 6-81. 


15. Let K (th, t2) =log; M (h, 2) where M (h, t2) is the m.g.f. of X and Y, 
Show that: 


2 
ok 0.9) - =E(X): TRO. = VerX: of. 


an Univ. B.A.(Stat. Hons.), Spl. Course 1987] 


= Cov (X, Y) 


OBJECTIVE TYPE QUESTIONS 


1. Comment on the following, giving examples, if possible : 
(i) M.g.f. ofar.v. always exist. 
(ii) Characteristic function of a r.v. always exists. 
(iti) M.g.f. is not affected by change of origin or/and scale. 
(iv) dx.r (0) = ox (0). by (©) implies X andY are independent. 
(v) dx (t) = by (1) implies X and Y have the same distribuuon. 
(vi) >(0)=1 and |o(4)|S1. 
(vit) Variance of ar.v. is 5 and its mean does not exist. 
(ix) It is passible to find ar.v. whose first k moments exist but (k + 1)" 
moment docs not exist. 
(x) Ifar.v. X has asymmetrical distribution about origin then 
(a) ox (t) is even valued function of t. 
(b) $x (t) is complex valued function of ¢. 
II; (a) Can the following be the characteristic functions of any distribution ? 
Give reasons. 
(i) log(1+0, (i) expe‘), Gi) +e). 
(b) Prove that $ (0) = exp (~t), cannot be a characteristic function unless 
= 2., 
IIE. State the relations, if any, between the following : 
(i) E(X") and dx (0). 
(it) My (0) and My- q(t), abeing constant. 
(tii) Mx() and My-a)/n(t), a and h being constants. 
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(iv) dx (t) and p.d.f. of X. 


(v) pt, and jt’. 
(vi) First four cumulants in terms of first four moments about mean. 


IV. Let Xi,X2,...,X_. be a i.i.d. (independent and identically distributed) 
rv.’s with m.g.f. M(t). Then prove that 
Mx (0.=[M (t/n)]’, 


i n 
x= x X;/n 
t=1 


V. If X,, X2,...,X. are independent r.v.’s then prove that 


where 


M ()}= TI My, (c:0). 
i= 1 


LY 6 Xi i= 
i=1 


VI. Fill in the blanks : 


(i) If J | dx () |dt < 0, then p.d.f. of X is given by ... 
(ii) X,; and Xz are independent if and only if... . 
(Give result in terms of characteristic functions.) 
(iti) If X; and Xz are independent then 
dx, -x, (N=... 
(iv) @ (0) is ... defined and is ... forall ¢ in (— ©, 0). 
VII. Examine critically the following statements : 
(a) Two distributions having the same set of moments are identical. 
(6) The characteristic function of acertain non-degenerate distribution is 


3 
=| 


Cs 

6-13. Chebychev’s Inequality. The role of standard deviation as a 
parameter to characterise variance is precisely interpreted by means of the well 
known Chebychev’s inequality. The theorem discovered in 1853 was later on 


discussed in 1856 by Bienayme. 
Theorem 6-31. If X isa random variable with mean 1 and variance 0”, 


inen for any positive number k,we have _ , 
P{IX-pleko}sa7e2 : (6-73 ) 4 
or = P{X-pl<ko}21-G/e) (6-73 a) 
Proof. Case (i). X is a continuous r.v. By def., 
o =ox = E[X-E(X)P=E[X-p]}* 


= | (x-w)*f@) de, where f(x) is p.d.f. of X. 
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U-ko +ko eo 
= | (x — 11)? f(x)dx + | (x — 1)? f (x) dx + | (x ~ )*f (x) dx 
co p-ko p+ko 
u-ko eo 
> | (x= p)2f (x) dx + | (e-Pef(Qxdr 
—0o pt+ko 


We know that : 


xsSsp-koandx2ptko & Ix-pl2ko va (*%) 
Substituting in (*), we get 
U-ko oo 
o2 >k2o2 | | f(x)dx+ jo de 
—00 pr+ko 
=k? o2 [P(X S$p—ko) +P (X 2p + ko)] [From (**) 
=k? 62, P(IX -—p1l2ko) [From (**)] 
=> Pp ‘(l X ~pl2=ko) s IVk?, A Gia | 
which establishes (6-73) 
Also since 


P(\X-pizko}+P{lX-pl<ko} =], we get 

P{I\X-pl<ko)=1-P{IX-pl2ko) 21 - {1/k?} [From (***)| 
which establishes (6-73a). 

Case (ii). In case of discrete random variable, the proof folldws exactly 
similarly on replacing integration by summation. 

Remark. In particular, if we take ko =c > 0, then (*) and (**) give 
respectively 


G2 
p{|xX-plze|s S and P {|X - ni <c}21-% 
= P{|x-EQ|2c} <4 
c (6-73 b) 


and P{|x-E(%)|<c} 21-5" 


613-1. Generalised Form of Bienayme~Chebychev’s Inequality. Let 
g (X) be a non-negative function of a random variable X. Then for every k > 0, 


we have 
P(g (Xyzk}s Set 674) 


[Bangalore Univ. B.Sc., 1992] 
Proof. Here we shall prove the theorem for continuous random variable. 
The proof can be adapted to the case of discrete random variable on replacing 
integration by summation over the given range of the variable. 
Let S be the set of all X where g (X) 2k, i.e, 
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S={x: g(x) 2k) 


then dF (x) = P(X € S)=P [g (X) 24), sc(*) 


S 
where F(x) is the distribution function of X. 


Now E [g (xX) ] - | ¢ (2) dF (x) 2 | g @) dF) 
98 S 
2 k.P[g (X) 24k) [- on S, g (X) 2k and using (*)] 


=>  Pig(2m sel 


Remarks 1. If we take g (X) = {X — E (X)}? = {X - 1}? and replace k by 
2.02 in (6:74), we get 


P {(X - yy? > k? oh < EX =p? _ o? 


K2g2  ~ k2 G27 2 
~ P{|x-pl2ko} sie, / (6-744) 
which is Chebychev’s inequality. 


2. Markov’s Inequality. Taking g (X) =1X | in (6-74) we get, for any 
k>0 


PUIXIzk] s=*". (6-75) 
which is Markov’s inequality. 
Rather, taking g (X) =| X I’ ane replacing k by k’ in (6-74), we get a more 
generalised form of Markov’s inequality, viz., 


; 
P(IXV 2k] wt ...(6°75a) 


3. If we assume the existence of only second-order moments of X, then we 
cannot do better than Chebychev’s inequality (6-73). However, we can 
sometimes improve upon the results of Chebychev’s inequality if we assume the 
existence of higher order moments. We give below (without proof) one such 
inequality which assumes the existence of moments of 4th order. 


Theorem 6°31la. E | X I4 < 0, E (X) =0 and E (X*) = 6? 


—_ot 
P{\|Xl>ko} See eT .. (6-76) 


If X ~ U (0, 1], (c.f, Chapter 8], with p.d.f. p (x) = 1,0<x< 1 and =0, 
otherwise, then 
E(X") = lr +1); (r= 1, 2, 3, 4) 
E(X) = 1/2, E (X?) = 1/3, E (X°) = 1/4, E (X4) = 1/5 ..(*) 
Var (X) = E (X2) -[E (X)]? = 1/12 
ly = E(X -p)* = E(X ~7)4 = 1/80 
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3 [On using binomial expansion with (*)| 
Chebychev’s inequality (6: 73a) with k = 2 gives : 


i | 
Pll -3 =. 0-75 
With k = 2 , (6-76) gives: 


x-4 : LZ0 — 44 _4 
P| | yo+lo-le 49 
4 45 
= P| |x-4 es ~o = nO, 


which-is a much better lower bound than the lower bound given by Chebvehev’s 
inequality. 

6-14. Convergence in probability. We shall now introduce a new concep, 
of convergenee, viz., convergence in probability or stochastic convergence which 
is defined as follows: 


A sequence of random variables Xj, X2, ...,.Xn,... iS said tO converge in 
probability to a constant a, if for any € > 0, 


lim : 
n= P(|Xn—a|<e)=1 ...(6°77) 
or its equivalent 


lim 
naw PUAn—alze)=0 (6-770) 
and we write 


P — 
ee spain = a8 (6-776) 


If there exists a random variable XY such that X, ~X P aasn— ©, then 


we Say that the given sequence of random variables converges in probability to the 
raudom variable X. 


Remark. 1. _ If a sequence of constants a, > aasn— ©, then regarding 
the constant as a random variable having a one-point distribution at that point, we 


can say thatas a, /,aasn—>o., 


2. Although the concept of convergence in probability is basically different 
from that of ordinary convergence of sequence of numbers, it can be easily verified 
that the following simple rules hold for convergence in probability as well. 


rx,20a and Y, 4, B as n —> o, then ‘ 
(i) Xn+Yn 2, a8 as n> © 

P 

er ad 


ap as n> 
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(tit) An P San co, provided B=0. 
Y B 
614-1. (Chebychev’s Theorem). As animmediate consequence of Cheby 
-chev’s inequality, we have the following theorem and convergence in probability. 
"If X\, X2, ...,Xn tS a Sequence of random variables and tf mean p, and 
standard deviation Gn of Xn exists for all n and if On -—asn— ©, then 


Xn — Un P 0 asn>~x 
Proof. We know, for any ¢ > 0 


2 
P| | Xn — Mn ze} s on _,Qasn—>oa 
a 


Hence Xn—WUn P 0 asn—>o 


6°15. Weak Law of Large Numbers. Let X1, X2, ..., X, be a sequence of 
random variables and \4, 2, ..., Un be their respective expectations and let 
Bn = Var (X1 + X2+ ... + Xn) < 


Then P X,+X2+ vee Xn Mitit... +n ellie 
n n ...(6°78) 
for alln > no, where & and 1 are arburary small positive numbers, provided 
lim Br 
— »( 
naxx n° 


Proof. Using Chebychev’s Inequality (6-735), to the random variable 
(X, +X2+...+X,)/n, we get for any ¢ > 0, 


X eee n eee n n 
Pp 1+X2+... +X, 7 paitaet +X te ore 
n n n2e€~ 
XD+... n 
Sy (enti. | ee ne a 
n he no 
oy P| Xi, X2t...+Xn Mit Matin + oe fat — 
n n n° e° 


So far, nothing is assumed about the behaviour of B, for indefinitely increas- 


B 
ing values of nm. Since € is arbitrary, we assume —s —()}, as nm becomes 
n'é 


indefinitely large. Thus, having chosen two arbitary small positive numbers 
€ and 1), number 7 can be found so that the inequality 
Bn 
ne 
will hold for 1 > no. Consequently, we shall have 
P| | Xi +X+...tXn  WL+ vet... + on 
n n 


<1, 


ce} etny 
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for all 2 > no (e, )). 
This conclusion leads to the following important result, known as the 
(Weak) Law of Large Numbers: 


"With the probability approaching unity or certainty as near as we please, 
we may expect that the arithmetic mean of values actually assumed by n random 
vartables will differ from the arithmetic mean of thetr expectations by less than any 
given number, however small, provided the number of variables can be taken 
sufficently large and provided the condition 


n 
z>7O0an> 
n 


is fulfilled”. 


Remarks. 1. Weak law of large numbers can also be stated as follows: 


Xn Li Bn 
4 Bn 
provided —> > 0 as n> &, symbols having their usual meanings. 
n 


2. For the existence of the law we assume the following conditions: 
(i) E (X;) exists for all i, (ii) B, = Var (X, + X2+ ... + X,) exists, and 
(iii) B,/n? +0 as n> &. 
Condition () is necessary, without it the law itself cannot be stated. But the 
conditions (i) and (iii) are not necessary, (ti) is however a sufficient condition. 
‘3. If the variables X1, Xn, ..., Xn are independent and identically distributed, 
t.e., if E (X;) = p (Say), and Var (X;) = o* (say) for alls = 1, 2,...,” then 


B, = Var (X1+X2+ ...+Xn)= & Var (Xi) 
jel 


the convariance terms vanish, since variables are independent. 


B,, Han o° s(*) 
° B, ° 
Hence ; a aa a ae (o’/n) =0 
n os 


Thus, the law of large number holds for the sequence | Xn} of i.i.d.r.v.’s and we 
get 


Pt read 


cef>tonvnam 


n 
ie, P{|X-pw|<e}rlasn>” 
=> P{| X,-w|2ze}7~0asn>o 


where X, is the mean of the nm random variables .X, X2, ...,.Xn- This result implies 
that_¥, converges in probability to p, ie., | 


XY, Au 
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Note. If X, is the mean of n i.i.d. r.v.’s X,, X2, ..., X, with 


E (X;) =; Var (X) = 02, then (On using (*)] 
E(X,) = 1 ;and Var (X,) = Var (2 x;/ n) ...(6-80) 
Theorem 6°32. If the variables are uniformly bounded then the condition, 
lim B, = 0 
n— co n2 rae 


js necessary as well as sufficient for WLLN to hold. ; 
Proof. Let €&; = X; — a;, where E (X,) = a; ; then E (€,;) = 0, (i= 1, 2, ..., n). 
Since X;’s are uniformly bounded, there exists a positive number c < oo 
such that | E;1<c. 


If p =P[1E, +&) +... +& lsne] 
then 1 -p =P[IE, +8) +... +8,1>ne] 
Let U, =F, +E. +... + &,, 


then E(U,) = 2 E(G)=0 
and Var(U,) =E (Us) =B, (say). 


=> B, = | U; dF (U,), where F (U,) is d.f. of U,, 


= | U2 dF + | Us dF 


Ur< n? ¢? U?> n? ¢? 


<n? €? | dF +n? c? | dF 
lu, l<ne lu, l>ne 
<n? 2? p +n? c? (1 -p) 
Bn 
n2 
If the law of large numbers holds, 
1-p =P[l1E,+& +... +&,l>ne]o0asno om. 
Hence as n + », (1 —p) — 0, and 


Se*p+c?(1—-p) 


B ry e e e 2 
— <e2p +c? 8, € and 8 being arbitrarily small positive numbers. 
a 


B 
Herice mr > Oasn— ~, 


6°15°-1. Bernoulli’s Law of Large Numbers. Let there be x trials of an 
event, each trial resulting in a success or failure. If X is the number of successes 
inn trials with constant probability p of success for each trial, then E (X) = np 
and Var (X) = npq, q = 1-p. The variable X/n represents the proportion of 
successes or the relative frequency of successes, and 


6-104 Fundamentals of Mathematical Statistics 


E(X/n) =+E (X) =p, and Var (X/n) == Var (X) = 74 


Then 


=> P mf = 
n? ...(681 a) 


tor any assigned e€>0. This implied that (¥/n) converges in probability to 
pasn>o, 


Proof. Applying Chebychev’s Inequality [Form (6-73 b)|.to the variable 
X/n, we get for any € > 0; 


arate 
|” n € 


X 
male 


since the maximum value of pq is at p=q=1/2 te, max (p = 1/4 ie, 
pqs 1/4. 


Since € is arbitrary, we get 


xX 
ale 
= P| 


6-15-2. Morkoffs Theorem. The law of large numbers holds if for some 
5 >0, all the mathematical expectations 


E(|Xi|'**);i=1, 2, ... ,..(6°82) 
exist and are bounded. 

6-153. Khinchin’s Theorem. /fX;’s are identically and independently 
distributed random variables, the only condition necessary for the law of large 
numbers to hold ts that E (Xj) ; t= 1,2, ... should exist. 

Theorem 6-33. Let Xn be any sequence of r.v.’s. Write : 

Yn =[Sna—E (Sn) |/n where Sp =X, + X2+ ...+Xn. 


A necessary and sufficient condition for the sequence {Xn} to satisfy the 
W.L.L.N. ts that 


yY 
E “zt70 ane. 
1+Y, ...(6°83) 


<el—olasn-o 
..(6°81) 


ee} 0 as n— © 


cebon as n—> © 


=e} 0 as n-> © 


ee 
a 7P 


cet as n— © 
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Proof. If Part: Let us assume that (6°83) holds. We. shall prove | Xn} 
satisfies W.L.L.N. 


For real numbers a, b; a2b>0 we have: 
a2z2b => atabz2zb+ab ...(*) 
Let us define the event A ={|Y,|2e}. 


2 
wEAy => Y,| 2e°>0 


2c = 


”. Taking a = y2 and b=e*in (*), we define another-event B as follows: 


y’ 1+e 4 y? s ¢? 
1+Y ee 14Y° 142 


SinceewEA => WweEB, — => P(A) <P(B) 


7 
aA ba 
ai 


E| Y,/(1 + Y3)} 
2S 
¢2/ (1 + e’) 
[By Markov’s Inequality (6-75)| 


B, = 


—-Q asn>o [By assumption (6-83)] 
| Yn a eee as n—> © 
= lim P | Sn -E (Sn) > 0 
nw © n 


=> WI.LN holds for the sequence { X,} of r.v.’s. 


Conversely, if Xn satisfies WLLN, we shall establish (6-83). Let us assume 
that_X;’s are continuous and let Y, have p.d.f. f, (y). Then 


rien] Laap oe 


-—-@ 


CY 


(S Jie hn (y) dy 


where -A={|Y] ze} and Af={|y|<e!} 
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Y.2 2 12 
F(A) s Ji nods| fu 0) dy |. Peay < Vand Tt < 


A Ac 
< Pia) +e? | ST, (y) dy (.- On AS: lyl<e) 
Ac 
= P(A)+e€2. P(A‘) 
< P (A) + &? ) (-- P(A°) <1) 
Y 2 
=> E F ze a <P {lY,l2e} +? .(**) 


But since ({X,,} satisfies WLLN, we have : 
lim P[lY,l2e]—73 0 


n— co 


and since €? is arbitrarily small positive number, we get on taking limits in (**), 


Corollary. Let X;, X>, ..., X,, be sequence of independent r.v.’s such that 
Var (X;) < c fori= 1,2, ... and 


a 
Be (= Xi) va (S,) 
> = Oo asin 9 © 
n n n 
Then WLLN holds. 
Proof. We have : 
Vie < y2 me ke ca) Sy 
ee Hn 
Y,,* 1 VarS, B 
: Ss _ 2-—_“--! 
= E |; +Y,2 J° n2 E [Sn -E Gr)I n? n2 
lim E Yn slim 2-50 (By assumption) 
1} —> oo ] + Y,? i —- oo n2 


Hence by the above theorem WLLN holds for the sequence (Xn} of r.v.’s. 


Remark. The result of Theorem 6-33 holds even if E(X;) does not exist. In 
this case we simply define Y,, = [S,/n] rather than [S, — E (S,)]/n. 


Example 6°48. A symmetric die is thrown 600 times.Find the lower hound 
for the probability of getting 80 to 120 sixes. 


Solution. Let S be total number of successes. 
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Then 
E (S) = np = 600 x = = 100 
1 5 500 
V(S) = pq = 600 xe x © = 6 


Using Chebychev’s inequality, we get 
P[|S-E(S)|<ko]z1 “3 


> P| | S100 | <kV50076 le1-7 
=> P| 100 -k V500/6 <S< 100 +k ¥5C6/6 21-3 
Taking ke Teme , we get 

1 19 


P (80s S s 120)= 1~7055(67500) ~ 24 


Example 6-49. Use Chebychev’s inequality to determine how many times 
a fair coin must be tossed in order that the probability will be at least 0-90 that the 
ratio of the observed number of heads to the number of tosses will lie between 
0-4 and 0-6. 


[Madras Univ. B.Sc (Stat.)Oct 1991; Delhi Univ. B.Sc. (Stat Hons.) 1989] 
Solution. As inthe proof of Bernoulli’s Law of Large Numbers, we get for 


anye>O0, 
PY me} = 
dn e° 


=> Pla ce fel = 
n 4n &- 


Since p=0-°5 (as the coin is unbaised) and we want the proportion of 
successes X/n to lie between 0-4 and 0-6, we have 


@ 


ae i 


2 oy ei 
"| 
Thus choosing ¢ = 0-1, we have 
1 1 
P ——— & 1 = 
4n (0-1)° 0-04 n 


Since we want this probability to be 0-9, we fix 


1 - ee eee 
004n 


=e 
7 7P 


<0 fat 
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1 
ay 0-10 = 
1 
ea n= O40x004 770 


Hence the required number of tosses is 250. 
Example 6-50. For geometric distribution p (x) =2™* ; x = 1, 2, 3, ... prove 
that Chebychev’s inequality gives 
P[| X-2| s ]>4 


while the actual probability is ©. [Nagpur Univ. B.Sc. (Stat.), 1989] 


; | eee 
Solution. E (xX) = erie rar ate 
=5(1+24 +347 +44? +...), (A= 1/2) 


-1(-aAy7e 
=>7(1 A)*=2 


= 5[1+444947+..J=2(1+A)(1-A) 2 =6 
[See Example 6-17] 
Var (X) = E (X’) - [E (X)P =6-4=2 
Using Chebychev’s inequality, we get 
P{|X-E@)|sko}>1-% 
With k=V 2, we get 
P{|X-2| sV2.V2}>1- -3= ; 
=> P{|X-2|s2}>$ 
And the actual probability is given by 
P {|X -2|s2}=P{0sX< 4} =P {X¥=1, 2,3 or 4} 


? 15 
7 gtls) +(3) #(5) ~ 16 
Example 6°51. Does there exist a variate X for which 


P[pe-2o0sXsy,+20]=06 ...(*) 
(Delhi Univ. B.Sc.(Maths Hons.) 1983} 
Solstion. We have : 
P({we-20sXsp,+2o0]=Pi[|X-p.|s20] 


1 
‘ 2 1-—=0-75 
4 (Using Chebychev’s Inequality) 
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Since lower bound for the probability is 0-75, there does not exist a r.v. X 
for which (*) holds. 
Example 6-52. (a) For the discrete variate with density 
fas Mey) +2 Lo) +5 hy &) 
evaluate P [|X -w.| 22 0]. (Delhi Univ. B.Sc.(Maths Hons.) 1989] 
(b) Compare this result with that obtained on using Chebychev’s inequality. 
Hint. (a) Here X has the probability distribution : 


x: -~1 0 1 » EQ)=~1x5+1x5=0 
p(x): 1/8 6/8 1/8 EX?=1xi+1xget 


Var (X)=E(X’*)-[E(X)P=1/4 => o,=1/2 
P([|X-p.l220,] = P[|X|21]) = 1-P(X|<1) 
=1-P[-1<X<1J=1-P%=0)=1/4 
(b) P[|X-p,|220,] s 2 (By Chebychev’s Inequality) 
In this case both results are same. 
Note. This example shows that, in general, Chebychev’s inequality cannot be 
improved. 
Example 6-53. Iwo unbiased dice are thrown. If X ts the sum of the 
numbers showing up, prove that 


Wiese! 
P(|X-7|23) 4 54 
Compare this with the actual probability. (Karnataka Univ. B.Sc., 1988) 


Solution. The probability distribution of the r.v. X (the sum of the numbrs 
on the two dice) is as given below : 


X Favourable cases (distinct) Probability 
(p) 
Z (1, 1) 1/36 
3 (1, 2), (2, 1) 2/36 
4 (1, 3), (3, 1), (2, 2) 3/36 
5 (1, 4), (4, 1), (2, 3), G, 2) 4/36 
6 (1, 5), (5, 1), (2, 4), (4, 2), G, 3) 5/36 
7 (1, 6), (6, 1), (2, 5), (5, 2), (3, 4), (4, 3) 6/36 
8 (2, 6), (6, 2), (3, 5), (S, 3), (4, 4) 5/36 
9 (3, 6), (6, 3), (4, 5),G, 4) 4/36 
10 (4, 6), (6, 4), (, 5) 3/36 
11 (5, 6), (6, 5) 2/36 
12 (6, 6) 1/36 
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E(X) == p.x 

x 

= 56 (2464 12 + 20 + 30 + 42 + 40 + 36 + 30 + 22 + 12) 

+ (252) =7 

36 7 
E (X’) = % p x 

x 


= 55 [4+ 18 + 48 + 100 + 180 + 294 + 320 + 324 + 300 


+ 242 + 144) 
1974 329 
56 (1974) = 30~C~*a 
22 
ee (x) == -(77 -> 
By Chebychev’s inequality, for < > 0, we no 
P(|X-pleks wane 
Ie 
2a _ 35 
P(|X-7|23 
= UNE TSS 9- “a (Taking k = 3) 
Actual Probability : 


P(|X-7|23)=1-P(|X-7|<3 
=1-P(4<X < 10) 
=1-[P(X =5)+P(X=6)+P(X=7) 
P(X=8)+P(X=9)| 
“1-2 (4 5655 ea wo— =o 
Example 6-54. /fX ts the number scored ina throw of a fair die, show that 
the Chebychev’s inequality gives 
P[|X-p|>2-5] < 0-47, 
where w ts the mean of X, while the actual probability is zero. 
[Kerala Univ. B.Sc., Oct. 1989] 
Solution. HereX isa random variable which takes the values 1,2....,6, each 
with probability 1/6. Hence 


7 

_1 _ | ae 
E(Xy=-(1+2+... +6) 55. ie) 
x 


ON 
Xx 
~ 


1a 7 PY 2,_, 6x7 13. 91 
E (X= 2 (10+ 2° +... +O) r ar 
Var (X) = E (X°) - IeEwP=s = = >> - 2.9167 


For k > 0, Chebvchew’s inequality vives 
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P[|X-BQO|>k] <3" 


Choosing k = 2°5, we get 
P{|X-u|>25]< 
The actual probability is given by 
p=P[|X-3-5|>2°5 ] 


=P|[X lies outside the limits (3-5 — 2-5, 3-5 + 2:5), i.e., (1,6)] 


But since Y is the numberon the dice when thrown, it cannot lie outside the 
limits of 1 and 6. 


p=p(9)=9 
Example 6-55. If the variable. X, assumes the value 2? ?'°8? with prob- 


ability 2°? ; p =1, 2, ..., examine if the law of large numbers holds in this case. 
Solution. Putting p =‘l, 2, 3,...,we get 
~ gl-2tog 1, 22-2 log 2 23-2t0g3, 
as the values of the variables X), X2, X3 ...respectively, and 
id 
3? 5? De 
their corresponding probabilities. Therefore, 


2°9167 
6-25 


= 0-47 


~ 


E (X1) = 217718) zy t2e7M8? gue eee 2? 
= ‘ p= 
7 4 
= 127/08? 
Let U=27'8? then 


log U = 2 log p log 2 = log p . log 2” = log 4°. log p = log p ® 


~ ft 1 1 
Edi) = 2 pees | * bined * Glog 
which is a convergent series. 
fo] 
- 2 — is convergent if and only if p> 1 
nol 


Therefore, the mathematical expectation of the variables X, X2, ..., exists. Thus'by 
Khinchin’s theorem, the law of large numbers holds in this case. 


Exmaple 6-56. Let Xi.X2,...,Xn be tid. variables with mean x and 
variance a” and as n > x, 


(X34+X34+...4+X2)/n 2 c, 
for some constant c; (0 sc s &). Find c. [Delhi Univ. B,Sc. (Stat, Hons.), 1989] 
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Solution. We are given E (X;)'= u; Var (Xi) = on fe a Ws 
». E (X7) = Var(X)) + [E (X) [PP = 07 + Ww (finite) 5 i = 1,2, .., 2. 
Since E (X7) is finite; bv Khinchines’s Theorem WLLN holds for tue se- 
quence - X7° of Lid. rv.’s so that 


(X7 +.X3 + et X5)/n PR 3), 2S. —> 00 


an » i) i _ 
=> (X7 + X34 ...+X;5)/n a Oo +U°=C, as nwo 


7 a) 
Hence c=O +. 


Example 6:57. How large a sample must be taken in order that the prob- 
ability will be at least 0-95 thai X, will be lie within 0-5 of w.ja is unknown and 


o=l. _ [Delhi Univ. B.Sc. (Maths Hons.) 1988] 
Solution. We have: E (¥,)= uw and Var (Y,) =o°/n 


[ c.f. § 6-15, =n (6-80)] 
Applying Chebvchev’s inequality to the rv. ¥, we get, for any c > 0 
a = Var (Vv 
P{| X,-E (X,) l<c] 21 _ Yar Xn) 
Ze 


=> 


. 


P{| X.-u |<ch21- o 
We want v so that 


.(*) 
P| X.—u | < 0-5] = 0-95 (**) 
Comparing (*) and (**) we get: 
2 
c=05=1/2 and 1-— 


(Given) 
5=Q0-95 and o=1 
nc” 


1 Ea 095 => ae 005-4 => n=80. 
n n 20 
Hence n 2 80. 
Example 6-58. (a) Let X; assume that values i and -i with equal prob- 
abilities. Show that the law of large numbers cannot be applied.to the independent 
vartdbles X\, Xr, ..., t.e., X;’S. 
(b) IfX, can have only two values with equal probabilities i* and —i", show 


that the law: of large numbers can be applied to the independent variables 
X .X..,t a<t. 


Solution. (a) We have 
P(Xi=i)=1, P(Xi=-i=! 

E(X)=!() +1) =0; f= 1.2,3,... 

V(X) #E(X) 2544 


5 > = [7 | * E (Xj) =0 ] -«(*) 
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B, = V(X, + X2+...+ Xn) = V(X1) + V(X2) +... + VX) 


= (1+27+...+2°) ~ 20s tere) ...[From (*)] 


-; = —> x as n—»%, Hence we cannot draw any conclusion whether 
n 


WLLN holds or not,Here, we necd to apply further tests. (See Theorem 6-33 page 
6-104) 


(b) ew +(5 |-0 


2 


(ty 2 ct 
E (x3) =") + (—1 ) = jo? 


V(X) =E (X7) -[ E (XP =i?" 
rn 
Bh=V (Xi +X2+...+ Xi) = =I V(X) = S i? 
; i= | i=l 


7 5) 9 
mint oe Oe fx de 
0 
[From Euler Maclaurin’s Forsula] 


yon n neat 
2a+1 0 2a+1 
B,, co 


ee 0 if 2a - ul 
- es le if 20 1<0>4a<, 


Hence the result. 

Example 6-59. Let{X,! be mutually independent and identically dis- 
tributed random variables with mean uw and finite variance. lf S, =X; +X2 + 
+X, . prove that the Jaw of large numbers does not hold for the sequence ) 

Solution. The variables now are S;; So, ..., Sp. 

Brh=V(S}+X2+.. . + Sn) 

VIX, + (X, +X) + (KX, + Xo 4X3) + 0.04 (X14 X24... + Xn )t 
Vinkit (n—- Ny eronnes ees A. ‘ 
n” V(X1) + (n— 1)" V (X2) +... + 2? WXn 21) + 12 V (Xn) 

(Covariance terms vanish since variables are independent.) 
Let V (X;) = 0° for all i. 


i] 


(Since the variables are identically distributed.) 
+ 4 a 2 - 
= (1° + 2° +... pee 
Bn (n + 1) (2n+1)- 


2 6n 


ms 
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Example 6-60. Examine whether the week law of large numbers holds foy 
the sequence Xi of independent random variables defined as follows: 
P[Xpa2 227 rd 
P[Xt=O]=1-2 7% — (Dethi Univ. B.Sc. (Matis Hons.), 198g) 
Solution. We have 
E (Xi) = 224 24) 2M 40x (1-2-4) 
oa q7 (ak 1) (2" i") es 9) 
E. (XE) = (2*y? 27 ht 2 a x 1 209 
gk 94-41), 5 (97 Ok+1) 


¢ 


a 
Var (X,) = E (X32) -E (Xp)? =1-0=1 
B, = Var 3 x)= E Var (Xi) 
[- X's (0 = 1,2, ..., 2) are independent | 
= 2 (A)=n 
tim Bn lim 1 


= S256 


n> x n- n> © R 


Hence (Weak) Law of large numbers, holds for the sequence, of independent. 
rv.’s{ Xx. 
Example 6-61. Let X1,X2,...Xn be jointly normal with E (X;) =0 and 
E (x?) =1 for all 1 and Cov (Xj, Xj) = 9 if joi] <1 and =0, otherwise. Ex- 
amine if WLLN holds for the sequence Xn ; ...(*) 
Solution. We have: 
Var (Xi) = (X7)-[E XW) f =1, (i= 1,2,..,2). 


E (Sn) =E [2 xi]=0 


i=} 
x | 


i=l 


Var (X,) = Var 


a Rn 

= VarX;+2 = Cov (X%,,.X)) 
iz] isjel 
n+2.(n—-1)p [On using (*)] 
Since X-, Xo, .... Xn are jointly nonnal, 


Sp = 2 Xi ~ N (0; o° Where o* =n+ 2(n-.1) p>0, 
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n 
Taking Y, =— eX =", we have- 
Fliay,?) = ® lites, 


elo? dx 


Cora) brs 
n2 + x2 V2ns 


9) } o2 2 os \ 
0 


2 
n>+y?{n+2(n-1)p) 


/ e772 dy 


= Tog tht 2 Deb. 


y? e7vl2 dy 


PJ 
i a) 

SIs 

a 


! 
al” 


S 
Hence by Theorem 6-33, WLLN holds for {X,,}, i.e., a Las 0 as n — 9, 


6°16. Borel-Cantelli Lemma. (Zero-One Law). Let A,, Az ... be a 
sequence of events. Let A be the eent “that an infinite number of A, occur. That 
is € A if @ ce A, for an infinite number of values of n (but not necessarily 
every n). But the set of such w is precisely lim sup An, i.e., lim A,. Thus the 
event A, that an infinite number of A,, occur is just limA,. Sometimes we are 
interested in the probability that an infinite number of the events A, occur. 
Often this question is answered by means of the Borel-Cantelli lemma or its 
converse. 

Theorem. 6°34. (Borel-Cantelli Lemma). Let Aj, Ao, ... be a sequence of 
events on the probability space (S, B, P) and let A = lim A,, 


if dX P (A,) < ©, then P (A) = 0. 

In other words, this states that if % P(A,) converges then with probability 
one, only a finite number of A), Ar, «- : can occur. 

Proof. Since A=lim A, = A) U Am 


n=l)men 


wehaveAc U A,» for every n. 
mon 


Thus for each n, 
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P(A)S & P(A,) 
m=n 
Since »y P(A,,) is convergent (given), = P (A,,), being the remainder 
a= man 


term of a convérgent series, tends to zero as nm + ©», 


P(A)s & P(A,) > Oasn—~ . 
m=n 


Thus P (A) = 0 as required. 
The result just proved does not require events A,, Ao, ... considered to be 


independent. For the converse result it is necessary to make this further 
assumption. 


Theorem 6°35. Borel-Cantelli Lemma (Converse). Let A,, A2, ... be 
independent events on (S, B, P), A equal to lim A,. 


if a P (A,) = ©, then P(A) = 1. 


Proof. Writing, in usual notation, A, for the coniplement.S — A, of A,, we 
have for any m,n (m>n). 


because of the fact that if (A,, A, ¢ 1.» A,,) are independent events, so are (A,, 
A, tt 1a sees Am). ; 


Hence P( AA) <1 PAD] 
a | r= An 


mn 


< FI ce?“ 
ken 


[-- 1 —x <e~* for x 2 0) 
ae 
- JI Pay 
=e kan Vm 
n 


Since & P(A,) =, p> P (A,) > © as pit — 00 
i= = a 


m 
- J P(A) 
and hence e ‘« Vmn—-a>Oasni-oo. 


P(A Ax) =0 8) 
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ia A= OAs 

A= U OAR (De-Morgan’s Law) 
s P(A)S =. P (A Ax) =0 [From (*)] 
Hence P(A)=1-P (A) = ], as required. 


If A,, A>, ... are independent events it follows from Theorems 6-34 and 
6-35 that the probability that an infinite number of them occur is either zero 


(when . P(A n)< eo) or one [when y P(A n) = ©]. This statement is a 


nel 


special case of so-cailed “Zero one law” which we now state. 


Theroem 6°36. (Zero One Law). If A,, Az, ... are independent and if E 
belongs to the o-field generated by the class (Ay, An + 1 +.) for every n, then 
P (E) ts zero or one. 

Example 6°62. What is the probability that in a sequence of Bernoulli 
trials with probability of success p for each trial, the pattern SFS appears 
infinitely often ? 

Solution. Let A, be the event that the trial number k, K + 1, k + 2 produce 
the sequence SFS (k = 0, 1, 2, ...). The events A, are not mutually independent 
but the sequence Aj, Ay, Az, Ajo, ... contains only mutually independent events 
(since no two depend on the outcome of the same trials). 

Pr=P (Ay) =P(SFS)=p*q, (q=1-p) 
is independent of k, and hence the series 
Pit Pa+p7 t ..., diverges 

Hence by B.C.T. (converse) the pattern SFS appears infinitely often with 
probability one. 

Example 6°63. A bag contains one black ball and white balls. A ball is 
drawn at random. If a white ball is drawn, it is returned to the bag together with 
an additional white ball. If the black ball is drawn, it alone is returned to the 
bag. 

Let A, denote the event that the black ball ts not drawn in the first n trials. 
Discuss the converse to Borel-Cantelli Lemma with reference to events A,,. 


Solution. A, = The event that blackball is not drawn in the first trials. 


gal) 
= The event that each of the first. 1 trials resulted: in the 
draw of a white ball. 
=> P(A,) = P(E; OE... OE,), 


where E; is-the event of drawing a white ball in the ith trial. 
*. P(A,) = P (Ej) P(E, | Ey). P (E34! £) A Ep) .... P(E, |) Ey A Ep... A En-1) 
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¢ 


m m+1 m+n—1 m 
= x ) A _ eae 
m+1 m+2 m+n m+n 


(Since if first ball drawn is white it is returned together with an additional 
white ball, t.e., for the seconod draw the box contains 1 b, m + 1 W balls and 


- 4 _m+1 

P (E2| E1) =" , and so on. 
3 PAta oem > 
ne li no {Ntn a das 


Lee een) ere | 
ml m+1 m+2 m+3 "'T 


4 ag 4g 
“| 3-( Sin) “ 


But. -2 sis verge > - is ‘convergent, iff p> 7 


n=) nel 


sje tne: ’ 
oe | 


R:H.S. of (**) is divergent. 


and 


Hence = P(A,) = 


n=l 
From the definition of A, in (*) it is obvious that A, | . 
A= ua An= lin sup A, = @ 


P(A)= P(@)= =0 


This result is inconsistent with converse of Borel-Cantelli Lemma, the reason 
being that the events A, (n= 1, 2, ...) considered here are not independent, 


P(A; Aj) = P (Ai) = ~~~ # P (Ai) P(A), 
since for >A; C Py as An |} 
EXERCISE 6 (d) 


1. ‘State and prove Chebychev’s inequality. 
2. (a) Arandom variable X has a mean value of 5 and variance of 3. 
(i) What is the least value of Prob [ |X -—5| <3]? 


(ii) What value of h guarantees that Prob [ | X-—5 | <A] = 0-99 ? 
(iii) What is the least value-of Prob ( |X —5 | < 7-5) ? 
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(b) A random variable X takes the values —1, 1, 3, 5 with associated 
robabilities 1/6, 1/6, 1/6 and 1/2. Find by direct computation P ( |X-3|21). 
Find an upper bound to this probability by applying Chebychev’s inequality. 
(c) IfX denote thesum of the numbers obtained when two dice are thrown, 
yse Chebychev’s inequality to obtain an upper bound for P| |X—7|>4}.Com- 
pare this with the actualy probability. 


3.(a) Anunbaised coin is tossed 100 times. Show that the probability that 
the number of heads will be between 30 and 70 is greater than 0-93 . 

(b) Within what limits will the number of heads lie, with 95 p.c. probability, 
in 1000 tosses of a coin which is practically unbised? 

(c) Asymmetric die is thrown 720 times. Use Chebychev’s inequality to 
find the lower bound for the probability of getting 100 to 140'sixes. 

(d) Use Chebychev’s inequality to determine how many, times a fair coin 
must be tossed in order that the probability will be, at least 0-95 that the ratio of the 
number of heads to the number of tosses will be between 0-45 and 0°55. 

[Delhi Univ: B.Sc. (Stat. Hons.), 1988] 

4. (a) If you wish to estimate the proportion of engineers and scientists 
who have studied probability theory and you wish your estimate to be correct within 
2% with probability 0-95 or more, how large a sample would you take (i) if you 
have no idea what the true proportion is, (ii) if you are confident that the true 
proportion is less than 0-2 ? [Burdwan Univ. (B.Sc. (Hons.), 1992] 

Hint. (1) « =2% or 0-02 


P [| fe-p|<002]21-——-— =095 
4n (0-02) 
05 = —— —> n= 12,500 
4n (0-02) 
i) p< 02,P[lferp|se]21-PL 
Now p(1—p)<0-16, therefore 1-9 = 0-95 


Hence n= 50 x 50 x 20 x -16 = 8000 


(b) ‘Let the sample mean of a random variable X be X s.d.s. Then ifat lease 
99 per cent of the values of X fall within K standard deviations from the mean, 
find K. 


§. (a) IfX isar.v. such that E (X) =,3 and E (X’)= 13, use Chebychev’s 
inequality to determine a lower bound for P (- 2<X<8). 


[Delhi Univ. B.Sc. (Maths Hons.), 1990] 
Hint. Ux = 3, or=4 => oO, =2. Chebychev’s inequality gives 
P[|X-3|<2k|21-1/ = PB-2k<X¥<3+%wW21-Ve 
Now taking k = 2:5, we get.P(—2 <X <8) 221/25. 
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(b) State and prove Chebychev’s inequality. Use it to prove that in 2000 
throws with a coin the probability that thé number of heads lies betWeen 900 and 
1100 is at least 19/20. [Delhi Univ. B.Sc. (Maths Hons.), 1989} 

6. (a) A random variable X has thé density function € ‘forx20. | 
Show that Chebychev’s inequality giver P [|X - |'|>2]< 1/4 and show that the 
actual probability is e ~ 

(b) LetX have the p.d.f.: 


fQ) = sa VB <x <3 


= 0, elsewhere. 


Find the actual probability P [| |X =| 2 ; G | and compare it-with the upper 


bound obtained by Chebychev’s inequality. 
7. IfX has the distribution with p.d.f. 
fQ@) =e", Osx<a, 
use Chebychev’s inequality to obtain a lower bound to the probability ef the 
inequalitv —1 s X <3, and Compare it with actual value. 

%. Explain the concept of “convergence in probability”. 

If X1, X>, ... x by r.v.s. with means 11, U2, ..., Pa and standard deyiations 
G1, 02, ....On-and if 6, > 0 as n— &, show that Xn—Un converges to Zero 
stochastically. 

Hence show that if m is the number of successes ina indeperident ‘trials, the 
probability of success atthe ith trial being pi then m/n converges in probability 
to (pi + prt ...+pn)/n. 

9, (a) IfX,, takes the values 1 and 0 with corresponding probabilities p, and 
1 —pn, examine whether the weak law of large numbers can be applied to the 
sequence | Xn where the variables X, ,n = 1, 2,3, ... are independent. 


(b) | Xi 1i=1,2,... is a sequence of independent random variables with 


, 1 ? 
expected value of X; equal to m; and variance of X;, equal to G} . If—+. 2 oj 


n i=] 
tends to zero as n tends io infinity, show that the weak law of large numbers holds 
good to the. sequence, - [Rombay Univ..B.Sc. (Stat.), 1992] 


10. ¢Xe-, k=1, 2, ... isa sequence of independent random variables cach 
taking the values —1, 0, 1. Given that 
P (X= 1) => P(X, = -1), PU=0)=1-. ; 
Examine if the law of large numbers holds for this sequence. 


LI. (a) Derive Chebychev’s inequality and siow how it leads to the weak 
law of large numbers. Mention-some important particular cases wherein the weak 
law of large numbers holds good. 
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(b) State and prove the weak law of large numbers. Deduce as a corollary 
Bernoulli theorem and comment on its applications. 


(c) Examine whether the weak law of large numbers holds good for the 
sequence X, of independent random variables where 
1 Pe 1 1 
P| X,=—= |F>, P| Xn=-—R] ET 
"van | 3 vi} 3 
(d) |X, is a sequence of independent random variables such that 


1 1 
X, == =Ppn, n= F~— | Sele pn 
P a Pn, P| X, a l-p 


Examine whether the weak law of large numbers is applicable to the sequence 
ta 
(e) IfX isa random variable and E (X2) < ce,:, then prove that 
4 1 > < 
Pi |X lz ais E(X*), for all a>0. 
‘ 


Use Chebyshev’s inequality to show that for n > 36., the probability that in 
n throws of a fair die,‘the number of sixes lies between 2 n+Vni and - n+Vn is 
at least 31/6 . : [Calcutta Univ. B.Sc. (Maths Hons.), 1991] 


12. Let{ X, be a sequence of mutually independent random variables such 
: -n 


that n= 1 with probability cS 


and = =-s«X, = 2"" with probability 2." ' 


Examine whether the weak law of large numbers can be applied to the 
sequence | X, |. 


13. Examine whether the law of large numbers holds for the sequence 
| Xe of independent random vaniables defined by P (X, = + k™ 2) = : 


14. (a) State Khinchin’s theorem. 


(b) Let X1, X2, X32, ... be a sequence of independent and identically dis- 
tributed r.v.’s, each uniform on [0, |]. For the geometric mean 


Gy = (X, X2... Xn)!" 
show that G, P c forsome finite numberc. Find c. 
Hint. X ~U([0,1], let Y=-log X ; 


Then Fy(yy= le"; -=> fy(yb=e*sy20. 
“ Yj=—log X;, (0 = 1, 2, ..., n) are iid. rv.’s. with E (Yi) =1. 
By Khinchin’s theorem 


= Yn/n=-| 2 ea in |=—tog Gn 2 EY;=1 => Gf etec. 


t=] i=] 


(c) LetX);X2,... be iid. r,v.’s-with cominon p.d.f. 
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f(x) = mpi, 5>0 


=(Q,x<1 
Discuss if WLLN holds for the sequence | Xn 
Hint. EX; = (1+ 5)/d < (finite). Hence by Khinchin’s theorem 


Sn/n = LXi/n P (1+6)/6 as no. 


15. Let { Xn | be any sequence of r.v.’s Write Y, = rs > Xj. 


i=] 
Prove that a necessary and sufficient condition for the sequence { Xn} to 
satisfy the weak law of large numbers is that 


2 
E Lf 5 | as no, 
1+/Y, 
Hint. See Remark to Theorem 6:33 . 
16. State and prove Weak Law of Large Numbers: Determine whether it 
holds for the following sequence of independent random va riables: 
P(X, =+ 1) =(122°")/2 = P(X, =- 1) 
[Delhi Univ. B.Sc. (Maths Hons.), 1989] 


17. LetAi, X2,... bei.id. standard Cauchy variates. Show that the WLLN 
does not hold for the sequence { Xn}. 


Hinf. Use Theorem 6°33 . 


S2 SJjn 


2 
1 
| 5 at de 
Spitx *1tx 


| 2 = wa is also a standard Cauchy 
variate. See Remark 4, § 8-9-1] 
2 paaomel = 
= iJ sin Od =~ (x = tan 6) 
=> am E tin, —>0 = WLLN does not hold for | Xn} js 
Ae ee 1+Y; 


18. (a) Examine if the WLLN holds for the sequence | Xi ; Of iid. r.v.’s 
with 


P[ xa! k] =; k=1,2,3,..,é=1, 2,3, ... 


[Dethi Univ. B.Sc. (Maths -Hons.), 1990] 
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2. ket 6 6 Se aykst 
Hint. EWiy= =X (-1) wk. 5732 x= (-1) . (17k) 
ke] uk’ nN pal 
x 2 3 4 § 
6 
= —.log.2 
ae 


[-.. The series in-bracket 1s convergent by Leibnitz test for alternating series]. 

_ Hence by Khinchin’s theorem, WLLN holds for the sequence { X;] of i.i.d. 
r.v.’S. : 

(b) The r.v.’s Xi, X2, ..., Xn have equal expectations and finite variation. Is 

the weak law of large numbers applicable to this sequence if all the co-variances 

Oj are negative? [Delhi Univ. B.Sc. (Maths Hons.), 1987] 


B, V ae Z of 
Hint. Bi Man Oe Nai ss Ny) _ tL ZL of+2 = on 
n n n [ i=} i<jel 


| n 
<Z z o? —0Q as n> © 
NW \ ja} 


Hence WLLN holds. 

19. State and prove Borel Cantelli Lemma. 

Ina sequence of Bernoulli trials !etA, be the event that a run of n. consecutive 
successes occurs between the 2’th and 2”* ‘th trails. Show that if pe ; , there is 
probability one that infinitely many A, occur, if p < ; , then with probability one 
only finitely many A, occur. 


(-. 07 are finite ) 


20. Let Xi, X2, ... be independentr.vs.andS,= <£ X,.If = 0° x, COon- 


kel nal 
verges, prove that the series 2 (X, — E Xn) converges in probability. 


n 
f—= £6 x, 0, then prove that 
bn™ kea)\ 
Ba — Sn p 
bp Fa" 
Deduce Chebychev’s inequality. 
6:17. Probability Generating Function 
Definition. If ao, 41, 43, ... is a Sequience of real numbers and if’ 
00 
A(s) = 40+ MSHS +... ais’ ..(6°84) 
1 i=Q _ 
converges in some interval — So < S$ < So, when the sequence is infinite then the 
function A(s) is known as the generating function of the sequence {a;}. 
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The variable s has no significance of its own and is introduced to identify a; 
as the co-efficient of s‘ in the expansion of A(s). If the sequence {a;} is bounded, 
then the comparison with the geometric series shows that A(s) converges at least 
for|s|<1. 

In the particular case when a; ts the probability that an integral valued discrete 
variable XY takes the value 4, 

Le, azpi=P(X=1); t=0,1,2,... with Ip;=1, then the probability 
generating function, abbreviated as p.g.f., of r.v. X is defined as : 


P(s)=E(s*)= E s*. px .-.(6°85) 
x=0 
Remarks. 1. Obviously we have P(1)= 2 py =.1. 
x 


Thus a function P(s) defined in (685) is a p.g.f. iff pe = OW x and 
z Px = 1 


x 
2. Relation between p.g.f. and m.g_f. 
Taking s=e' in (6:85), we get 
P (e') = E (e*) =-Mx (t). ...(6°86) 
i.e. from p.g.f. we can obtain m.g.f. on replacing s by e’. 
3. Bivariate probability generating function. The joint p.g.f. of two random 
variables .X; and X2 is a function of two variables s; and s2 defined by : 
Px, x2 ($15 52) = E ($1"". 52%) = ZL sy" 52. p (x1, x2) «..(6°87) 


x1 x2 
Marginal p.g.f.’s can be obtained from (6°87) as given below. | - 


Px, (51) = E (s1"' ) = Py, x, (51, 1) ; Px, (52) = E (s2°? )-= Px, x. (1, 52) ...(6°88) 
4. Two r.v.’s are indepéndent if and only if : 
Px,, x2 ($1, $2) = Px, (S1) . Px, (82). .-.(6°89) 
The above concepts can be generalised to n random variables 
Theorem 6-37. If X ts a random variable which assumes only integral values 
with probability distribution 
P(X=k)=pm; °=0,1,2,... and P(X>k)= qe, kz0 
k 


SO that Qk = Pk+1+Pr+2+...=1- 2 pj; and two generating functions are 
i=0 


P(s)=po+prstprs’+,.. 
O(s)=qotgistqs +... 
then for ~-1<s<.l, Q(s)'= 1-P(s) ...(6-90) 


l-s 
Proof. We have 
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Gk-1- k= Pk, K2z1 


7 ‘ 
= ge-is* - = qes* == pes* 
ke=1 k=l k=l 


=> 5 Q(s) - Q (s) + qo = P (s)- po 
_ 0+ qo) - P(s) 

=> Q (s) = (-s) 

But po + Go = pot pirt-prt...=1 


Hence the-theorem. 

Theorem 6-38. For a random variable X, which assumes only integral values, 
the expectation E(X) can be calculated either from the probability 
distribution P (X = 1) = p; or in terms of 


Qk = Pk+1 + Pk+2+... 


~ © eo 
Thus E(x)= = ipp= = Qe 
t=1 k=0 
In terms of the gererating functions 
E(X)=P (1) =Q:‘(1) ; ...(6-91) 
Proof. P(s)= = pes* . If E (X) exists, then 
k=O 


E (x)= x k px 
kel. 


Ps)= Dkps*-' = P'(1)= : A Pk 
k=l ket 

“. E (X) = P'(1) 

We know that 


Q(s)(1-s]=1-P(s) 
Differcutiating both sides w.r.t. s, we.get 


Q'(s)[I-s]-Q(s)=-P'(s) ..(*) 
- Q(1)=P'(1) 
Hence E (X) =P’ (1) =@ (1) 


Theorem 6:39. If E(X7) = Zk’ px exists, then 
E(X*)*P"(1)+P'(1)=29'(1)+Q(1) 
and hence V(X) =2Q'(1)+Q(1)-{Q (I) =P" (1) +P’ (1) - {P' (YP 
(6-92) 


Proof. P (s)= = pes", P'(s)= Lkps*7' 
k= 
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sP"(s)= Zk pes a Differentiating again, we get 
P'(s)+sP"(s)= Th pxs*") 


| 4 
P'(1)+P"(1)= 2k pew E(X’) (48) 
Q" (s) [1 - s] -2Q' (s) =-P” (s) [Differentiating (*) again) 


Putting s=1, we get 
2Q'(1) =P" (1). Substituting in (++*), we get 
E(x") = P’ (1) + P” (1) = Q(1) +2.’ (1) 
Var (X) = E (X?)- {E (XY =P" (1) +P) - (Py 
=20'(1)+2(1)-{2()}" 
617-1. Probability Generating Function for the sum of independent 


variables (Convolutions). If X and Y are non-negative independent, integra} 
valued discrete random variables with respective probability generating functions 


P(s)= = pes*, pr=P(X=k) 
k=-0 


R(s)= XI ores*, m=P(Y=h, 
k=0 

it is possible to deduce. the probability generating function for the variable 
Z=X+/Y, which is also clearly integral valued, in terms of P (s) and Q (s). 

Let w, denote P (Z =k). The event Z=k_ is the union of the following 
mutually exclusive events, 

(XzONYe=k) XKa1NV=k-1),(X¥=2NY=k-2),..., (X¥e2kNnY=0) 
and a e 
since the variables X and Y are independent, each joint probability is the product 
of the appropriate individual probabilities. Therefore the distribution w, =P 
(Z = k) is: given by 

We = Pork + Pi tk-1+ P2rk-2+...+ pero forall integral k20 

The new sequence of probabilities {w,} defined in terms of the sequences 

{px} and {rx} is called the convolution of these sequences and is denoted by 


{wa} = {pa} * {re} (6-93) 
Theorem 6-40. 


Wegn E[XO-1)... hors »|- Pio) 
aS sel 


Proof. Differentiating (6-85)-partially r times w-r.t. s, we get 
oP) = » x (x-1)(x-2)...@-r+ 1) s*~" px 
x ‘ 


as’ 
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ers = ¥ x(e-1)(e-2)...(e-7+ p= BY, 
sel 


0S 
x 

Theorem 6-41. Jf {px} and {rx} are the sequences with the generating 
functions P(s), A and {w,} is their convolution, then W (s) = P (s) R (s), where 
W (s) = 2 We s" is the generating ee of the sum X + Y. 

Proof. Since the co-efficient of s* in the product P (s) R (s) «is 

Porkt+ Pl Tk-1 + -.. t+ Pk-171+ Pevo = Wi, 

it follows that the probability generating function for Z, namely, 


wi(s)= wis” is equal to P (s) R (s). 
k=-0 ‘ 


Cor. If X1, X2, Xa are independent integral—valued discrete variables with 
respective probability generating functions P;(s) , P2(s), ..., Pa(s) and if Z =X 
+X7+...+Xn, the probability generating function for Z is given by 


Pe()= TPG) 


In particular, when X41, X2, ...,Xn all have a common distribution and hence 
common probability generating function P (s), we have 
a 
Pz (s) =[P (s)]" 
Example 6-64. Can P{s) = 2/(1 +s) be the p.gf. ofar.v.X ? Give reasons. 
Solution. We have P(1) = 2/2 = 1. 


Also P(s)= 2 prs” = 2 (1 +s)7 


ve 


=2 (l-s+s’-s*+..,) 


=2 7 (-1)'.s’ 
r=(0 --o(#) 
=> pr=2(- 1) 


Hence p** <0, ie., P1, P3, PS, ... are negative, 
Since some co-efficient in (*) are negative, P (s) cannot be the p.g.f. of a 
rv X. 


Example 6-65. If P(s) is the probability generating function for X, find the 
generating function for (X ~ a)/b. 


Solution. P(s)=E (s*) 
PGF.for A — {cE (s¢-™) ag” E (s*”°*) 


6-128 Fundamentals of Mathemati€al Statistic, 


ws E [(s!/)*] = 5% psi’ 
Example 666. Let X bea random variable wih generating function 
P (s). Find the géneviuing function of (a)X +1 (b) aX. 


Solution. (42) P(s)= = Pa S ‘aE (s*) 
ke 


P.G.H.of X41 =E(s** es Ets*)=s, P(s) 

(b)  P.G:F.of 2X =E (s ~) =E4(s7*h= P(s?) 

Example 6-67. Find the generating Junction of (a) P(X sn), {b) P(X <n), 
and (c) P(X = 2n). (Dethi Univ: M.Sc. (O.R.), 1989] 

Solution. (a) Let X be an integral valued random variable with the 
probability distribution 

P(X=n)=p, and PX sn)=q, 

sothat 9n=po.t+pitp2rt...+-pn; 1,29, 1, 2, ..4 

= Gn-Qn-1=Pn, N21 


=> ZL ans"- © Ga-1S"= = pas” 

ne] nw] nel 
=> Q.(s) - qo - 5 O(s) = P-(s) - po 

.. P(s)+qo-po P: 

= g(a S)*ao~Po_ PS) JL qo = po} 
(b) Let qun=P(X<n) = pot pit ...+ Pn-1,Qn-Qn-1™ Pn-1, N22 

%» @ r> «) 00 
=> So gas"= ZL Ga-is"= EZ pa-is"=s & pas® 

na=2 n=2 n=2 nel 
= Q (s) - 91 5 ~ SQ(s) = sP(s) - spo { °° go = 0] 
~ Q(s)[1-s]=sP(s)-spotqis [41=po] 
Hence Q(s)- 28 


(c) Let P(X =2n) = po 


QO(s)= = prs” =po+ prs + pas? +... 


Rae 
20 (s). = 2po + 2p2s+2pas?+... 
=(po+ pis + prs+p3s> + pas? +...) 


' + (po-pis? + prs ~p3s*” 


+...) 


at  -pr (s a Y pe(-s 1" 


= P(s 2). P(-s 1”) 


P(s ¥*)-+P(s —V2) 


Q(s)= 5 


Mathematical Expectation 6-129 


Example 6-68. Let {X;} be mutually independent, each assuming the values 
0,1,2,..-,a—-1 with probability ~ : ‘ 


Let S, 2X1 +X2+ :..+Xn. Show that the generating function of S,, is 


P(s) = 


iz 
a(1-s) 
and hence 
reseaes Score (,25) 
“(Rajasthan Univ. M.Sc. 1992) 
Solution. As the {Xx} are, mutually independent vanables and each X; 
assumes the same values 0, 1, 2, ...,@ - 1, with the probability, . therefore each 


will have the same generating function and the generating function of S, will be 
the nth convolution of generating function of.X1. Now 


a-)| a 
+ a-l 1-s 
Px, ‘ee = [3° +5) +...45 ie a-s) 


Now the ore ce =j is the co-efficient of s’ in 
ta-s “y" (1 - sy" 
a 
If we take ty + 1)th term from (1 -s 7°)", then it wil have the power of s 
equivalent to s ” and hence to get the powet of 5" as jy we must take th; jterm from 


(1-s) " having the power ofs as j - av. 
Required probability . 


“23, )er (cae 


S3,e0"""(4(75) 


a" v0 j- av 
=J py celle | 
a va0 j-—av 


(14% a1 =e C= 177] 
Example 6-69. A, random okie X assumes the, value, Ai, A2; . with: 
probabilities uy, ua, ..., show that 


Pr* a zr we “4 OY; AQ>0, Luj=1 


6-130 Fundamentals of Mathematical Statistics 


is g probability distribution. Find its generating function and prove that its mean 
‘equals E (X) and variance equals V (X) + E (X). 
Solution. 
Tope Pj = wer (ay 
k-0 k-0| K! jn0 
oo) ; re) . oo 
=S jue’ = (Al l= LD werd 
j k=0 ' j20 
« 2 uj = A, 
joo 
Hence px represents a prbability distribution. 
Let P(s) be the generating function of pe, then 
fo) co a) oo 
P(s)= 2 pxs*= = a z we (Aj‘s* 
k=0 k=0 j= 0 : 
“(Fubini’s Theorem) 
@ fe) . 
= 2 jue = (sd)*/k! 
_j=0 k=0 
fe) 
= Lu; eee F Uj eie-¥) 
j=0 y=9 
2 2(5-1/ - 
Thus P(s)= = uj Lrye-neME- | 
j-0 2! 
ro) 
P'(1)= = uj Af#E (X) 
j-0 
= v sy 
P"(s)= Zu; [2 #2 (s—-1) +...] 
j=0 
~ ° ~- 
P"(1)= = dP wek (Xx?) 
j= 0 
Vpn) =P" (1) +4 P (AY- {P’ (IP = EK) +E) - (EY 
=E (xX) +V(X) 
EXERCISE 6(e) 
1. (a) Define the probability generating function (p.g.f.) of a random variable. 
(b) X is a positive integral valued variable, such that P (¥ =n) = py, 
n=O, 1, 2, ... Define the probability generating function G(s) and the moment 


gencrating ‘function M (2) for X and show that, M (0G (e’). Hence or otherwise 
prove that 


E(X) =G’ (1), var (X) = G" (1) + G' (1) - [6 (DP 


ES 
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(c) IfX and Y are non-negative integral valued independent random variables 
with P (s) and Q.(s) as their probability generating functions, show that their sum 
X+Y has the: p.g.f. P (s) Q (s). 

2. A test of the strength of a wire consists of bending and unbending until it 
breaks. Considering bending and unbending as two operations, let X denote the 
random variable corresponding to the number of operations necessary to break the 
wire. If P(X =r) =(1 -p)p™' ; r=1,2,3,...and0<p< tt, find the probability 
generating function of X. 

3. Define the generating function A(s) of the sequence {a;}. Let a; be the 
number of ways in which the scorey can be obtained by throwing a die any number 
of times. Show that the g.f. of {a;} is(1-s-s Puig? ae ae a ie ~1 

4. Four tickets are drawn, one at a time ‘with replacement, from a set of ten 
tickets numbered respectively 1, 2,3, .... 10, if such a way. that at each draw each 
ticket is equally likely to be selected. What is the probability that the total of the 
numbers on the four drawn tickets is 20? . 

Hint. If X; denotes the number on the ¢th ticket then, for i = 1, 2, 3, 4, we 
observe that X; is an integral-valued variate with possible values 1, 2, 3, ..., 10, 
each having associated probability 1/10. Here a X; ‘has 


1 

Bla ag stags ttt aa sat s(1 es) (1-5)! . 
and, since the X; ‘s are independent, it follows haa the total of the numbers on thie 
drawn tickets ( 
Z =X, + X21 + X34+Xz 


has probability generating function 
1 10 Z -1 : ee! 4, 10 4 - -4 
{7050 shee) } io’ sali )' (1 =) 
The required probability is thé co-efficient of s’° 
1 10)4 -4 0 63 
ie ESA -" = TO O09- 


5. Find the generating functions of (a) Pix 2>'n), (b) P(X>n+ 1). 

6. (a) Obtain the generatitig function of qn, ‘the probability that in n tosses 
of an ideal coin, no mn of three heads occurs. 

(by LetX bea non-negative integral-valued random variablé ‘with prowepinty, 


‘+ © 


generating function P({s) = 2 pas": Afterobséiving'X, conduct X biiiomial “trials 
n=0 
with probability p of success. and let Y denote the corresponding resulting number 
of successes. 
Determine (i) the probability eee function of Y and , ww probability 
generat ng function of X given that Y= X , 
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7. In a sequence of Bernoulli trials, let U, be the probability that the first 
combination SF occurs at trials number (# - 1) and-n. 
Find the generating function; mean:and variance. 
Hint. U2 = P(SF) = pq, Us = P(SSE) + P(FSF) =pqi(p + 9) 
U4 =.(SSSF) + P(FSSF) + P(FFSF) =-pq(p* +'pq + q°) 


n-2 
Ingeneral U,=pq ,z p ge 
ne 


8. (a) Ina sequence of Bernoulli trials, let U, be the probability of an even 

number of successes. Prove the recursion formula 

| U, = qUn-1+(1-U,-1) p 
. From this derive ‘the generating function and hence the explicit formula for 
Up. 

(DJA Series of independent Bernoulli trials is performed until an.uninterrupted 
run of | r-Successes is obtained for the first time where r is a given positive integer. 
Assuming: that the probability of a success in any trial is p = 1 - g, show that the 
probability peneraune function of the number of trials is 

F(s)= p's" (1 - ps) 


rere ra 


S+qp s' 


ADDITIONAL EXERCISES ON CHPATER VI 

1. (a) A horizontal line of length ‘5’ units is divided into two parts..If the first 
part is of length X, find’E(X) and E (X-6 —- X)}. 

(b) Show that 
E (X~ p)’ x £(X%) = 3p0?- 
where p and o” are the mean and variance of-X Irspectively. 

2. (a) Two players A and B alternately: roll a.pair‘of fairdicé. A wins if he 
gets six points before B gets seven points and 'B wins if he gets seven points 
before A Pets : six points. IfA takes the first turn, find the probability that B wins 
and the expected number of trials forA to win. 

(b) A box contains 2” tickets among which "C; tickets bear the numbers 
i(¢=0, 1, 2,. ., n). A group of m tickets is drawn. Let S denote the sum of their 
numbers. Find a expectation and variance of S. 

Ans. 1 zmn, 5 + mn - {mn (m - 1)/4 (2" - 1)} 
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3. In an objective type examination, consisting of 50 questions, for each 
uestion there are four answers of which only one is correct. A candidate scores: 1 
if he picks, up the correct answer and —1/3 otherwise. If a candidate makes only a 
random choice in respect of each of the 50 questions, find his expected score and 
the variance of his score. 

4. (a) A florist, in order to satisfy the needs of a number of regular and 
sophisticated customers, stocks a highly perishable flower. A dozen flowers cost 
Rs.3 and sell for Rs. 10. Any flowers not sold the day they are stocked are worthless. 
Demand in dozens of floweis is as follows : 

Demand 0 1 2 3 4 5 


4% 98 


Probability 0-1. 0-2 0-3 0-2 0-1 0-1 

(i) How many flowers should the florist stock daily i in order to maximise the 
expected value of his net profit ? 

(ii) Assuming that failure to satisfy. any one customer’s request will result in 
future lost profits amounting-to Rs. 5-10 (goodwill cost), in: addition to the lost 
profit on the immediate sale, how many flowers should. the florist stock ? 

(itt) What is the smallest goodwill cost of stocking five dozen flowers ? 

Hint. For? = 0, 1, 2, 3, 4, 5,. let Xi, be the random vaniable giving the florist’s 
net profit, when he decidés to stock ‘i’ dozen flowers. Determine the probability 
function for each and the meanof each and pick up that ‘i ‘for which it is-maximum. 

Ans. (i) 3 dozen, (ii) 4 dozen and (iii) Rs. 2 

5. Consider a sequence of Bernoulli trials with a constant probability p of 
success in a single trial. Let X;, denote the ene of failures following the 


(k-1)th and preceding the kth success, and let S,= E Xf. 
kel 


Derive the probability distribution of Xx. Hence derive the probability 
distribution of S,. \Find E (S;) and Var (S,). 

6. In the simplest type of weather forecasting — "rain" or“ no-rain" in the next 
24 hours — suppose the probability of raining is Pp (> 5); and thata forecaster Scores, 


a point if his forecast proves correct and zero otherwise. In making 7 independent! 
forecasts of this type, a forecaster, who has no genuine ability, predicts “rain” with 
probability A and “no rain" with probability (1 — A). Prove that the probability of 
the forecast being correct for any one day is 
[1~ p+ (2p X} 

Hence derive the expectation of the total score (Sn) of the forecaster for the n days, 
and show that this attains its maximum valve for.A = 1. Also, prove that 

Var (S,) =n [p - (2p - 1) A) [1-p + (2p- 1) 
and thereby deduce that, for fixed n, this variance is maximum for A = . 
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-Hint. P(X;=1)=1-P(X;=0)= =pri+q(1-A), Sa= z Xi 
- i-1 
and the X;s being independent, the stated results follow. 
7. Inthe simplest type of weather forecasting — rain or no rain in the next 24 
hours — suppose the probability of raining is p (> 3), and that a forecaster scores 


a point if his forecast proves correct and Zero otherwise. In-‘making n indiependent 
forecasts of this type, a forecaster who has no genuine ability decides to allocate 
at random r days to a “rain” forecast and the rest to "no rain". Find.the expectation 
of his total score {S,} for the 1 days and show that this attains its maximum Value 
for r =n. What is the variance, of S,, ? 
-Hint. .Let X; be a random variable such that 
X;=1 if forecast is correct forithday  - 
= 0 if forecast is incorrect for ith day, (i = 1, 2, ovsy n) 


Then P(X;=1) =~, P(Xj=0)=21-— = — 
n n n 


E(X)=p(7)+ cig r)andSe~ 5 Xi 


But the X; ’s are cortelated random variables, so that fori = j, 
E (X; Xj) = P (Xi =1 0.X; = 1) 


rp-Z[p(2=4)+ 4224) | 
#4\- slut i)* nT 1 7) | 


Hence:E:(S,) = np - (n—r) (p- q) < np. for p>q and ,V(S,) = npq. 

8. Let m letters ‘A’ and mz letters ‘B’ be arranged at random ina sequence. 
A run is‘a succession of like letters preceded and followed by-none or an unlike 
letter. Let W be the total number of runs of ‘A’ s and ‘B’ s. Obtain expressions for 
Prob {W=r}, where r is a given positive even integer and also when r is odd. 

Compute the expectation of W. 


9. ‘An urn contains K varieties of objects in equal numbers. The objects are 
drawnoneat a time and replaced before the next drawing. Show that the probability 
that 7 and no less one will be ees to dia objects of all varieties is 


s20 7 


Hence or otherwise, find the expected number of drawings in a simpie form. 


we 
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10. An um contains a white and b black balis. After a ball is drawn, it is to 
be returned to the urn if it is white, but if it is black, it is to be replaced by a white 
ball from another urn. Show that the probability of drawing a white ball after the 
foregoing operation has been repeated x times is 


b- (; 1 \ 
1-5 (i a4 

11. A box contains k varieties of objects, the number of objects of each 
variety being-the same. These objects are drawn one at a time and put back before 
the next drawing. Denoting by 1 the smallest number of drawings which produce 
objects of all varieties, find E (2) and V(n). 

12. There isa lot of N objects from which objects are taken at random one 
by one with replacement. Prove. that the expected value and variance of the least 
number of drawings needed to get n different objects are respectively. given by 


Nia : + ‘Homa | 


N N-1 ~ N+=n+1 


1 2 n-1 
and aa oe pet Ss a 
(N-1)* (N-2) _ (N-n+ 1) 


13. Ala a population consits of equal number of individuals of c different 
types. Individuals are drawn at random one by one until at leasts one individual of 
each type has been found, whereupon sampling ceases. Show that the mean number 
of individuals in the sample is 


c eer al 
2 3 c 
and the variance of the number is 
elias ee -¢ bee 4 re 
7 32. c? | 2°30 € 


14. (a) ApointP is taken at random ina line AB of length 2a,, all positions 
of the point being equall y likely. Show-that the expected value of the area of dhe 
rectangle AP.AB is 2a°/3 and the probability of the area exceeding a */2 
i/v2. 

(b) A point is chosen at random on a circle of radius a. Show that the 
expectation of its distance from another fixed point also on the circle is 4a/x . 


(c) Two points P and Q are selected at random ina square of side a. Prove 
that 


E(|PQ|*)=a’°v 
15. Ifthe roolsx;,, x2 of the equation.’ —ax + b= Qare realand bis positive 

but otherwise unknown, prove that 
E (xi) =a and E (x2) = a 


16. (Banach’s Match-box Problem). Acertain mathematician always carries 
two match boxes (initially containing N match-sides). Each time he wants a 
match-stick, he selects a box at random, inevitably a moment comes when he finds 


b 
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a, box empty. Show that the probability that there are exactly r match-sticks in one 
box when the other box becomes empty is 


= 1 
"CNX = 
Prove also that the expected number of matches is 
sal 6: Nx = . -1 


17. n couples procrete independetnly with no limits on family size. Births 
are Single and independent and for the ith couple, the probability of a baby is p;. 
The sex ratio S is defined as 
ei Mean number of all — 
' Mean number of all children 
Show that if all couples, 
(«) Stop procreating on the birth of a boy, then 


sen/ Se 
in1 Pp 


(i) Stop procreating on birth of a girl, then 


oe sei-fe/ 3 <x —|, where gj=1-p1 


ieig 


(iit) Stop procreating when they have childzen of both sexes, then 


S= si_s F /| 3 -n 
12,3 i=l | a pai a" 


18. Show that if X is a ‘random variable such that P (a < X s b) = 1, then 
E (X) and Var (X) exist, anda < E(X) sb and Var(X)s (bl—a)*/4. 

. 19. (X, Y) isa two-dimensional discrete random variable with the possible 

values 0 and 1 for X, and also 0 and-1-for Y, and with the joint probabilities given 

by 


Find the characteristic functions wy (), W(t) and & (t1, tr, for X,Y and 
(X, Y) respectively and show that wp (¢), 2) = Wy (t1) We (¢2) when poo p1i = poi 
Pi0 -1. 
‘20. For aigiven'sequence-| X, of r.v.’s, ‘i 
a ” °C -&% 
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@,.(t, X,) = (sin nt)/nt, 


determine the distribution function of X,. Hence show ‘that even though: the 
sequence of characteristic functions Qn (f) converges to a limit @ (4), the sequence 
of distribution functions does not converge to a distribution function. What is-the 
condition that is violated here? {Indian Civil Services, 1984] 
21. Two continuous variates X and Y have a joint p.d.f with a joint 
characteristic function @ (f1, f2): If gy (x). ds, the marginal density, show that 
p'r (x), the rth simple moment for the conditional distribution of Y given X = x, 
 tisfies the equation 


t'r (x). 8 (x) = 5 4 #6 (4,0) orga OD ins de 
-@ t 
(Indian Civil Services, 1987] 
22. Prove that the real part of a characteristic functions again a charac- 
teristic function. Prove further that if wa. (0) = a; (t) + iby. () and 1 wr (1). =a7(t)+ 
ib, (t) are characteristic functions, then a, (¢t) a2 (t) — by (t) b2 (8) i is a chatactéristic 
function. 


23. Show that for any distrioution 


ffs 3 ae ae 


and hence deduce P[ | X¥-E (X)|>ko] <> 
24. (a) Let X¥ bea random variable with moment generaung function 
M(t), -h<t<h. Prove that 


P(X2za)se"M (0), O<t<h 


where k > 0 and Var (X) = 0’. 


and-that 
P(X sa)se%M(),-h<t<0. 
(b) Let f(x, y) = xe" 9*) x20, y>0 
= 0, elsewhere 
Find moment generating function of Z = XY. 


Hint. mxy (t) = f fe f (x, y) dx dy 
0 0 


=f xe* fe 9) dy dx; 1-1>0 
0 


1 
oa ae rege 


25. The sratabatity of obtaining a 6 with a biased die is p, where 
(0 <p<1). Three players A, B and C roll this die in order, A starting. The first 
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one to throw a 6 wins. Find the probability of winning for A, B and Cc. 


-7on 


rth throw determine the probability generating fanedion of X and hence, or 
otherwise, evluate E (XY) and. Var (X).. 

Hint. Probabilities for the wins of A,B and C are p/(1-q ), 

pq (1—q°) and pq 241 - —q°) respectively. 
P(X=r)=pqd~', for ret. 

The probability generating function of X is P(s) = ps/(1—gqs) , 

whence E(X)=1/p and Var (X)=1/@p’. 

26. Define convergence in probability. Let X1,X2, ... be i.i.d. variates with 
fiw =e 1) x21. Show that Y, > 1 in probability where Y, = Min (X,) : 
1sken. (Indian Civil Services, 1982) 

27. Let{X,,”=1, 2, ... | be a sequence of standardised variates and Con 
(Xm, Xn) = exp [-|m- nla], a>0O and m # n. Show that W.L.L.N. holds 
for this sequence. (Indian Civil Services, 1933) 

28. From the probability generating function (p.g.f.) of two random Vari- 
ables X and Y given by 

P(s,O= exp[- —-A-pw-bt+Astutre dst], 

(i) obtain the marginal p.g.f.’s and identify them, 

(ii) obtain the p.g.f. of X+Y and P(X + Y) =0., and 

(ui) interpret the caseb=0. 


CHAPTER SEVEN 


Theoretical Discrete Probability 


Distributions 


7-0. Introduction. In the previous chapters we have discussed in detail the 
frequency distributions. In the present chapter we will discuss theoretical discrete 
gjstributions in which variables are distributed according to some definite 

robability law which can be expressed mathematically. The present study will 
also enable us to fit a mathematical model or a function of the form y = p(x) to 
the observed data. 

We have already defined distribution function, mathematical expectation, 
m.g.f., characteristic function and moments. This prepares us for a study of 
theoretical distributions. This chapter is devoted to the study of univariate 
(except for the multinomial) distributions like Binomial, Poisson, Negative 
binomial Geometric, Hypergeometric, Multinomial and Power-series 
distributions. 

7-1. Bernoulli Distribution. A random variable X which takes two values 
0 and |, with probabilities g and p respectively, ie, P(X = 1) =p, 
P(X = 0)=4,q =1->p is called a Bérnoulli variate and is said to have:a 
Bernoulli distribution. 

Remark. Sometimes, the two values are +1, —1 instead of | and 0: 


7-1-1. Moments of Bernoulli distribution. The - moment about origin 


WY = EB (X)=0'.g+ lV .p=p;re=l,2,... (71) 
My’ + E(X) = p, Yr’ = E(X’) = p 
_ Ha = Var (X) = p-p? = pq 
The m.g.f. of Bernoulli variate is given by : 
My (th) = ex P(X=0)+e!!. P(X=l=q+pe' ...(7-1a) 
Remark. Degenerate Random Variable. Sometimes we may come 
across a variate X which is degenerate at a point ‘c’, say, so that: P (X =c) =] 
and = 0 otherwise, i.e., the whole mass of the variable is concentrated at a single 
point “c?. 
Since P (X=c)= 1, Var (X)= 0. 
Thus a degenerate r.v. X is characterised by Var (X) = 0. 
M.g.f. of degenerate r.v. is given by 
My (t) = E (e'*) = e P(X =c) =e .. (7-15) 
7-2. Binomial Distribution. Binomial distribution was discovered by James 
Bernoulli (1654-1705)-in the year 1700 and was first published posthumously in 
1713, eight years after his death). Let a random experiment be performed 


repeatedly and let the occurrence of an event in a.trial be called a success and its 
hon-occurrence a failure. Consider a sect of n independent Bernoullian trials (n 
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being finite), in vhich the probability ‘p’ of success in any trial is constant for eac}, 
trial. Then g = 1 —p, is the probability of failure in.any trial. 

Tie probability of x successes and consequently (n — x) failures in 7 inde. 
pendent trials, in a specified order (Say) SSFSFFFS..FSF (where S_ represents 


success and F failure) is given by the compound probability theorem-by the 
expression : 


P (SSFSFFFS...FSF) = P(S)P(S)P(F)P(S)Pi FPP) x 
. x P(PIP(S)P(F) 
=P-P-O29-4-4-4-P +. 4-PG 
PPP 1 FG oon pq 
© factors} {(n —x) factors! 


, : . {nm a 
- But x successes inn trials can occur in - .ways and the probability for each 


of these ways is p“'q"~*.. Hence the probability of x successes in n trials in any 
order whatsoever is given by the shi theorem of proba bility by the expression: 


The probability distribution or the number of successes, so, o obtained i is Called 


the Binomial probability distribution, for the obvious réason that the- probabilities, 
of 0, 1, 2, ..., m successes, viz., 


q i; , gi” x P; (2 2 | tp gq’? i? , are the successive terms of the bino- 
mial expansion (q + p)”. 


Definition. A random variable X is said to follow binomial distribution if 
it assumes only non-negative values and its‘probability mass function ts given by 


p 7 *;x= 0,1,2,..,2;q=1-p --o(7-2) 
P(X = x) = p(x) = e \p 
0, otherwise 


. The.two independent constants n and p in the: distribution,are known as the 


parameters of the distribution. ‘n’ is also, sometimes, known as the, acerce of the, 
bizomjal distribution. ‘g 


Binomial distribution is a discrete distribution as X can take ‘ily the integral 


values, viz., 0, 1, 2,..., ”. Any varjable which follows binomial: distribution is 
known as binomial variate. 


We shall use the notation X ~ B(n, p) to denote that the random variable 4, 
follows binomial distribution with parameters n and p, 

The probability p(x) in (7-2) is also sometimes denoted by-b(x, n, p). . 

Reniarks 1. ae assigntbeit of probabilities 1 is permissible because 


pal= 2 z( |e" qf -*#(q+p)"=1 


x=0 
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2. Let us suppose that 7 trials constitute an experiment. Then if this 
experiment is repeated N times, the frequency function of the binomial distribu- 
tion is given by 


fix) = Np(x)= N[" | p'q"*3x= 0,1,2,..,0 a2) 


and the expected frequencies of 0, : 2, ..., n Successes are the successive terms of 
the binomial-expansion, N (q + p)", q +pe 1. 

3. Binomial distribution is important ‘not only because of its wide ap- 
plicability, but because it gives rise to many‘other probability distributions. Tables 
for p(x) are available for various Values of'n_ and’ P- 


4. Physical conditions for Binomial Distribution. We get the binomial 
distribution under the following experimental-Conditions. 

(i) Each trial results in two mutually disjoint outcomes, termed as success 

and failure. 

(ii) The number of trials ‘n’ is finite. 

(iii) The'trials are independent of each other. 

(iv) The probability of success ‘p’ is constant for-each trial. 

The problems relating to tossing of a coin or throwing of dice or drawing 


cards from a pack of cards with replacement lead to binomial probabability 
distribution. 


Example 7-1... Ter coins are thrown simultaneously. Find the probability of 
getting at least seven heaas- ; 


Solution. p.= Probability of getting a head = 
q = Probability‘of not getting a ies =< 
The probability of getting x beads ina random jinowiBt 10 coins is 


10) (1 19° ft0) (1° 
oF Cem 


.. Probability of getting at least seven; heads is given by- 
P(X = 7) = p(7) + p(8) +' pQ) + p10) : 


(3) (8) 9(2) (8) (i a 
| 2 7 8 9 10 

_ 120+ 45 +10+1 176 

> 1024 ~ 1024" 

Example 7-2. AandB play a game in which their chances of winning are. 
in the rdtio 3 : 2. Find A’s chance of witining at least three games out of the ft ive. 
games played. ' [Burdwan Univ. B.Sc. (Hons.), 1993] 

Solution. Let p be the probability that ‘A ’ wins the game. Then we are 
given p = 3/5 => q= 1-p= 2/5. 

‘Hence, by binomial probability law, the probability te out of 5 games 
played, A wins ‘r? games is given by : 
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PX =r) = plr)= (375) (2/5)5-' sr = 0, 1,2, ...,5 


The required probability that ‘A ’ wins at least three games is given by : 


5 Fr aS-r 
P(Xz3)= 5 ( A oe 


ra3\ 7 5 
3 1/5\ 2 (5 2.4) _ 27x (40+ 30+ 9) 
as 3 (3) 2 + (;}-3*24 1.3 <1] = 3125 = 0-68 


Example 7°3. If m things are distributed among ‘a’ men and ‘b’ women, 
show that the probability that the number pf things received by men is odd, is 
1 (b + a)” —(b - a)” 
2 | (b +a)” 
(Nagpur Univ B.Sc., 1989, ’93) 


Solution. p = Probability that a thing is received by man = —— » then 


+ b 


¢ is the probability that a thing is received by 


a __ob 
atb a+b 


G2 ep ue 


woman. 

The probability that out of m things exactly x are received by men and the 
rest by women, is given by 

P(x) =" Cy pq” *; x = 0,1,2,...,m 

The probability P that the number of things received’by men is odd is given 
by 

P = p(1) + p(3) + p(5) +... "Cig" | pt7C3:qQr p+ "Cs° qh op +... 

Now 
(q+py aq +"Cy qh! pt "Ca-qh*p+"C3-q™ os p+ "Cy Qn" pt... 
and 
(q-p)" =q"~"Ci-q""' -p+"C2-qn*-p —"C3.q" > ip + "Ca: q™ "+p -... 

(q+ py" -(q—py" =2 ("Ci q” "p+ "C3-q” +p +..]= 2P 


b-a 
But q+p=landq-p=7—— 
m : m m 
, (2-4 =?2P => Pat (b + a)" — (b - a) 
b+a 2 (b + a)” 


Example 7-4 = An irregular six faced die is thrown and. the expectation 
that in 10 throws it will give five even numbers is twice the expectation that it will: 
give four even numbers. How many times in 10,000 sets of 10 throws each, would: 
you expect it to give no even number. (Gujarat Univ. B.Sc. 1988) 

Solution. Let p be the probability of getting an even number in a throw 
of a die. Then the probability of getting x evén numbers in ten throws of a die is 


P(X = x) -( | p g?*3x = 0,1,2...10 


| 
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We are given that 
P(X = 5) = 2P(X = 4) 


10 10 5 
L.é., [sa = 2/ a)e¢ 
- 10!p , !g 
515! 4:6! 
P4241 
= 5" 63 


3p25q=S(i-p) => 8p=5 => p#=5/8andq=3/8 
10) (5 /3\ 
ince x} | 8 7 
Hence. the required number of‘times that in 10,000 sets of 10 throws each, 
we get no even number 
10 


= 10,000 x P(X = 0) = 10,000 x = = 1 (approx.) 


Example 7-5 Ina precision bombing attack there is a 50% chance that 
any one bomb will strike the target. Two direct hits are required to destroy the 
target completely. How many bombs must be dropped to give a 99% chance or 
better of completely.destroying the target ? [Gauhati Univ. M.A,, 1992] 

Solution. We have: 

p = Probability that the bomb:strikes the target = 50% = : Let n be the 
number of bombs which should be dropped to ensure 99% chance or better of 
completely destroying the target. This implies that “probability that out of 2 bombs, 
at least two strike the target, is greater than 0-99". 


Let X be a r.v. representing the number of bombs stnking the target. Then 
X ~B(n, p = 5) with 


noma 2) (EY (2)( 8] 


We should have : © 
P(X 2 2)2 0-99 
[1 - PXs 12 0-99 
[1 ~ [p(0) + p(1)}] = 0-99 


2-43) 


> 0012 => 2x (O01lhe len 


4 § 


=> 2" = 100+ 100n -»(*) 


By trial method, we find that the inequality (*) is satisfied by n = 11."Hence 
the minimum number of bombs needed to destroy the target completely is 11. 
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T6 Fundamentals of Mathematical Statistics 


= ‘Example, 7-6, A department ina works has 10 machines which may neeg 
adjustment from time to time during the day; Three of these machines are old, each 
having a probability of 1/11 of needing adjustment during the day, and 7 are new, 
having corresponding probabilities of 1/21. 


a Assuming that no machine needs adjustment twice on the same day, deter. 
mine the probabilits that on a particular day 


(i) just 2 old and no new machines need adjustment. 
(ii) If just 2 machines need adjustment, they are of the same type. 

~ (Nagpur Univ. B.E., 1989) 
Probability that an old machine needs adjustment 


Solution. Let p, 


= 1/11 
qa = 1-p; = 10/11 
and = ._-p2 = ‘Probability thata new machine needs adjustment = 1/21 


@=1—p2 = 20/21 
Then Pi(r) =, _.Probability that ‘r’ old machines need adjustment 
= 8C,pigi-’ = °C, (10/11)? (4/11)" 
and — Pxr) = Siaecne that ‘r’ new machine need adjustment 
= 'C,p)qh7” = 'C-(1/21Y (20/21)'~" 


(i) The probability that just two old machines and no new machine need 
‘adj justment is given (by the compound probability pel by the expression ; 


_ P\(2)- P20) = °C2(1/11)’ - (10/11) ¢ (20/21)’ = 0-016 
+ (ti): ‘Similarly the probability that just 2 new machines and no old machine 
sie adjustment is - 
P,(0) - P(2) = (10/11)? - "Co(1/21)" (20/21) = 0:028 
” “The probability that "If just two machines need adjustment, they are of the 


same type” is the same as the probability that “either just 2 old and no new or just 
2, new and no old machines need adjustment”. 


--. Required probability = 0-016 + 0-028 = 0-044 


; r: 2-1 \Moments. The first four moments about origin of binomial dis- 
; ‘tribution are obtained as follows : 


‘hi = E(X) = 5 ae rem =np <= ft oe aa f'-** 
x=0 x xray) *%- 
= np(q + p)"~* = np _ (09g +p=1) 


Thus the mean of the binomial distribution is. np. 
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yo’ = BUA) = 5 alr]? d'~* 


x(x - 1) x-2 
n a) : 
> |” 
2 aes P 
n(n - 1)p°(q + p)"? + mp = n(n — 1)p* + np 


wR EO) ad oT) ee 


a 
3 ea - D+ ean (5 2 | as" 


—x 


n(n - 1)p" 


x=0@ 


5 lx Qe - 1) (x — 2) + 3x (x - 1) + x| pq’ ~* 
0 


x = 


n(n 1)(n = 2)p> EF (; -3| pg 


x 


x 


+ 3n(n—1)p? E = 2) P prog’ * + 
x=22 


n(n -1)(n - 2)p° (q + py" > + 3n(n - 1)p’ (q + p)’*? + mp 
n(n — 1)(n - 2) p° + 3n(n ~ 1) p* + mp 
Similarly 
x* = x(x—1)(x -—2) (x-3) + 6x (x-1) (x= 2) + 7x (e- 1) +x 
 Letx? = Ar(x— 1) (x —2) (x-3) + Bx (x -1) @-2) + Cx(x-1) +x 
By giving to x the values 1, 2 and 3 respectively, we find the values of 
arbitrary constants A, B gles C. Therefore, 


us’ = E(X*) = > (, eee 


x=0 


= n(n—-1)(n- 2)(n— 3) p* + 6n(n— 1) (n- -2)p° + 7n(n— 1) p’ +'np 
[On simplification] 
Central aril of Binomial nana 


M2 = po’ — pi’? = np? md + mp — n’p® = np(1—p) = npq 

M3 = w3' — 3p2’ wy’ + 2’? 
= {n(n — 1) (n-2) p? + 3n (1-1) p? + np| —3 n(n — 1) p? + np| np +2 (np)? 
= np [- 3np’ + 3np + 2p’ ~ 3p + 1 - 3npq) 
= np [3np (1.— p) + 2p’ 3p-+ 1 - 3npq) 


78 Fundamentals of Mathematical Statisti,, 


np (2p? — 3p + 1] = np (2p’ ~ 2p + q) = npq(1 — 2p) 
npq(q + p — 2p] = pq ae — p) 


pa = pa’ — 43’ pi’ + Ope’ pi 2 341’ * = npq[1 + 3 (n - 2)pq] 
[On simplification) 
Hence 
2.2 2 
6, = 3 = np ga — py ~ Cae .(7-4) 
2 a pq — 

; u, npq {1 +3 (n -2) pq} 143 (n—-2) pq : 1 -—6pq (7.5) 

20-2 -.ccrwl”" 4.uUa "= - - son LP 

aa n pg npq npq 

vee CoP WLP ig 3 LEE 7s 

vi = VBr Vag Wap B2 - 3 "pq ...(7°5 a) 


Example 7-7 _ Comment on the following : 

The mean of a binomial distribution is 3 and variance ts 4’. 

Solution. Ifthe given binomial distribution has parameters n and p, then 
we are given 

Mean= np - 3 -.(*) 
“and Variance = npq.= 4 -o**) 

Dividing (**) by (*), we get q = 4/3, 
which is impossible, since probability cannot exceed unity. Hence the given 
‘statement is wrong. 

Example 7-8. T/te mean and variance of binomial distribution are 4 and! 
respectivély. Find P(X 2 1). (Sardar Patel Univ. B.Sc. 1993) 
7 Solution. Let X ~B (n, p). Then we are given 

Mean = E (X) = np = 4 
and Var (X) = mpq = = 


Dividing, we get 
= = Be. 


(*) 


Substituting in (*), we get 
5g ee 


‘P(Xz lel - P(X =0) =1-¢q' = 1 - (1/3) = 1 - (1/729) 
= 1 — 0:00137 = 0-99863 
Example7:9 [f X ~B (n, p), show that: 


a 
E [2] = Fs cont, SA} 
n n n n n 


(Delhi Univ. B.Sc., 1989) 
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Solution. Since X ~B (n, p), E (X) = np and Var (X) = npq 


[a] n em rvs \- 3; Var (X) = 24 
5 eff) -eff-e(2] (a2 
won eat) Ee te 
(El (-f)-0- 
el(r-e)i-(a-? 


n 

7-2-2 Recurrence Relation for the moments of Binomial Distribution. 
(Renovsky Formula) 

By def., 


Mr = E (XX = E(X)}’ = in (x — npy t ra 


‘Differentiating with respect to p, we get 
du, - n r-l x nex 
Dp Zol 2 | [me - pq. 
-1 m-x -x-1 
+ (x — npy {xp*"' q"* — (n — x) pd! i 
-m = : (x — np" p* 
xro0\ * 
~ {n* Z n-—-x 
+ — np) pf * j= -— 
2; J py P* {3 ; 


= =n 5 (x - mpy"'p@) + = (x — mpy px) SP . 


xaQ 


1 + r 
Suite S (x '— np! p(x) + — E (x — npY** pe) 
Pq x=(0 


oe arp 1+ — : Ur +1 
dp T= Pq + 


7 du, (7° 6 
= Ur+1 = Pq [rrve-s + 7 ( ) 


Putting r = 1,2 and 3 successively in-(7- 6), we get 
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*. = 1 and = 
nrlm a = Pq” evs td 


du2)_. dimpqg)_ ad 


i ll iad 

, dys 

and ss: wah oi a npq + —- i (mPa (4 ~ a 
= pa | 3 pa+ny <int-n0-m] 


= pa | nak o-¥ hee 


= pq (3n 2 pq /'s n (1 — 6p + 6p*)] = pq[3n* pq + n(1 — 6pq) | 
=npq[3np¢+ 1- 6pq]= npq[1+ 3pq(n-2) ] 


~ Example 7-10 Show that the rth moment ,' about the origin of the 
binomial distribution of degree n is given by : 
| Mr = | p 2) (q + p)" --(*) | 
dp [Patna Univ. B.Sc. (Hons.), 1993] 


Solution. We shall prove this result by using the principle of mathematical 
induction. We have 


n ae 3 n ae _ 
q+tp= x" | p= ap atPy = Eig 
mee x x=Q x 
"Op 
Thus the result io is true forr = 1. 
Let us now assume that the result (*) is true forr = k, so that 


k 
sc: Mea 3 (7 )otgt* --.(**) 
P35} +P) = ue 2{ \p fx 


Differentiate Gi *) partially w.r. to p and multiply both sides by p to get: 
k 


9 a : o{n\ © nex bel ke] 
= . —— = z = 
(SZ) |(e&) oro] F(t) race =ece) 
3 k+l 
= [p a. (q + py = pea 
' Hence if the result (+*) is true forr = k, itigalso true forr = k + 1. Itis 


already shown to be true for k = 1. Hence by the principle of mathematical 
induction, (#), is true for all positive integral values of r. 


a Sea ; 
Zarpt=p = :\t *xp 3" \er x= py’ 
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7-2-3. Factorial Moments of Binomial Distribution. The rth factorial 
moment of the Binomial distribution is: 


te Ol. 5 (ey (r) Snes 
wi E|~ E xp(aye Es Tori P pied 
r n—vr)! ~ es r p 


xr 
=n of (7° 7) 


W1) = E |x = np = Mean 
way = EXO} = nO phe n(n — 1p 
wa) = EXO} = n® ro ~1)(n- Dp 
Now # (2) = WQ2) — may” + Ha) = n° p ‘- np” = np’ + np = npq 
M3) = Hey — 3H’ Bay + 2H)” - 2n@)’ 
=n(n— 1)(n- 2) p> —3n(n—-1) p’ np + 2n’ p> —2np = — 2npq (1+ p) 


[On simplification] 
7-2-4. ‘Mean Deviation About Mean of Binomial Distribution. 


The mean deviation n about the mean mp of the binomial distribution is given by 
n 


EZ |x - mpl pe) = = |x - mpl (r\er 
0 x20 x 


X= 


n 
(x being’ an integer) 
xX KNX 


p'q'-* + zx me) [ | a 


x= np 


i 
Ms 
Fo 
> 
| 
3 
SS 
www 
[as 
“oy 
ed 


it 
Te) 
M3 
— 
| 
~ 
S 
ww 
x oy 
“ee” 
By, 
’ 
* 
* 


= 2 z (x — np) é ‘laa, 
where is the greatest integer contained innp + 1. 


a2 [ham - 20} (5) a 


a) “ at cad es she gas 
‘i (x-1)tQ(r—xt? x!(n—-x—!? 
* Er (rlee = np 
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: - n! +1 
=2 2. [teaq = , where t, = ——————___ n~x 
pa | er’ xt(n-x-ynih 4 


=2 [tw-1—-& [= 2%-1 


This is obtained by summing over x and using ft, 8 O 


n! 
Sq) = 2 = 2 eee tg 
eat ie 
-1 -1 p- (7: 
= 2npq oe p* a (7-8) 


7-2-5. Mode of the Binomial Distribution. We have 


oe (; | 7 q’-*/ t a | oie! 
n‘ -x n} -x+ 
“ (n =yiet Po l Ge Tas rei? “¢ 


_~M@ax+ lp _x¢4+m-x+ Ip-xq 


xq xq 
, m+ Dp-xP+gd_,, + Dp-x (79) 
xq | xq 


Mode is the value of x for which p (x) is maximum. 
We discuss the following two cases : 
Case 1. When(n + 1) p is not an integer 


Let (n + 1)p = m + f, where m is an integerand f is fractional such that 
0 < f < 1. Substituting in (7- 9),'we get 


_p&) . on (m +f) -—x mb 
p(x - 1): xq 
From (*), it is obvious that i 
_p&) > 1 for x = 0, 1,-2,...,m 
p& - 1) 
and be ED oe he pace a 
pe >t) Pp p (m) 
p(l) (2) __p(m : 
- po)” * pti)” °" p(m-1)” ” 


ang 40s PM +, PM +2 2, pM ., 
m 


.. p(G)<p(1) <p(2)<...<p(m—-1) <p(m)>p(m+1)>p (m+ 2) 
>p (m+ 3) ...>p(n), 
__ ‘Thus in.this-case.there exists unique modal value for binomial distribution 
and itis’ m, the integral part of (n + 1) p. 
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Case II. When (n + 1)p ts an integer. 
Let (mn + 1)p = m (an integer). 
Substituting in (7-9), we get 


p(x) _ 14 "7% a(**) 
p(x — 1) xq 
From (**) it is obvious that 
>1lforx = 1,2,..,m-1 
peop f Therese 
p(& - 1) <1 forx = m+1,m+2,...,n 


Now proceeding as in case 1, we have : 
p (0) <p(1)<...<p(m-1)= p(m)>p(m+1)>p(m+ 2)>...>p (n) 


Thus in this case the distribution is bimodal and the two modal values are 
m andm — 1. 


Example 7-11. Determine the binomial distribution for which the mean 
is 4 and variance 3 and find its mode. (Madurai Kamraj Univ B.Sc. 1993) 

Solution, Let X ~B (n, p), then we are given that 

E(X) = np=4 --(*) 
and: Var (X) = npq = 3 ..(**) 
Dividing (**) by (*), we get 
Qq=5 => pel-q=; 
Hence from (*), n= == 16 
Thus the given binomial distribution has parametersm = 16 and p = 1/4. 


Mode. We have (n + 1)p = 4-25, which is not an integer. Hence the 
unique mode of the binomial distribution is 4, the integral part of (x + 1) p. 


Example 7-12. Showthat for p = 0-50, the binomial distribution has a 
maximum probability at X = sf, ifn is even, andatX = + (nm — 1) as wellas 
X = : (n + 1), ifn is odd. (Mysore Univ,, B. Sc. 1991) 
Solution. | Here we have to find the mode of the binomial distribution. 
(i) Leta beeven= 2m, (say), m = 1,2,... 
.. Ifp = 0-5, then(m + 1) p = (2m + 1) x (3) =m + 0°5 
Hence in this case, the distribution is unimodal, the unique mode being 
at X = m = n/2. 

(ii) Let n be odd =(2m + 1), say. Then 
(n + 1)p = (2m + 2) x 5 = m + 1 (Integer) 


ated 
2 2° 
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Since (m + 1) p is an integer, the distribution is bimodal, the two modes 
being i (m + 1) and 3(n+1)-1 = s(n-1). 


7:2-6. Moment Generating Function of Binomial Distribution. Let 
X .be a variable following binomial distribution, then 


My (t) = E (e'*) -5 (rere > were" (7 | (q+ pe')" 
x=0 x x=0 x 


.(710) 
M.G.F. about Mean of Binomial Distribution : 
E| ef(X-mP} E(e*e~"") = e7 iP : E(e*) 7 e me ; Mx(t) 
=e" (q + pe)” = (qe'™ + pe't)" (7-11) 
- [aft p+ BF - ee, ee + 


2 Pg 
f 
-[1+ 3; p04 x Pa ial sia + P+ 


[ p 
=|. fem 31 PI - p)+ FyPa(t ~ 300) +} 


re 4 
f 
1+ (1) {2 "Pq + 37Pa(9 ~ p) + qppail ~ 3a) +} 


2 


(3 } [aren + Free - p) +,. ft 
2 


Now 2 = -Coefficient of a1 7 "Pq 
e 


u3 = Coefficient of = npq(q — p) 


4 = Coefficient of a = npq(1 - 3pq) + 3n(n - 1)p'q 


npq(1 — 3pq) + 3n? p” gq — 3np* q 
= 3n’ pq’ + npq (1 - 6pq) 
Example 7-13 X ts binomially distributed with parameters n and p. What 


is the distrbution of Y=n- x 7 [Delhi Univ. B.Sc. (Maths Hons.), 1990] 
Solution. X ~B(n,p), represents the number of successes in n_ inde- 


pendent trials with constant probability p of success for each trial. 
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. Y = n-— X, represents the number of failures inn independent trial with 
constant probability ‘q’ of failute for each trial. Hence Y= n—- X ~B(n, q) 


Aliter Since X ~B(n,p), My(t) = E (e'*) =(q + pe')” 
My(t) = E(e") = E (e""™) 
= &-E(e“) = &” Mx(- 2) 
= a -(g + pe)" 
n n 
ete | +e ee 
Hence by uniqueness theorem of m.g.f., Y = n — X ~B(n,q) 
Ly) 


Example 7-14. Them.gf ofar.v.X ts E + Ze - Show that : 


5 ¥ « 9-x 
9 1 2 
Piw 20 <X<u+20)* 2 (2) (5) ($ 


[Dethi Univ. B.Sc. (Maths Hons.), 1989] 


9 
Solution. Since Myx (¢) = (3 + 3¢| ” (4 + pe’) 


by uniqueness theorem of m.g.f. X ~B & = 9, p= 3) 


Hence E (X) = pr = np = 3; ok = mpg 2 9xax5=2 


ut 20 =3+2x V2 = 3 +2 x 1:4 = (0-2, 5-8) 
. P(p - 20 <X <p + 20) = P (02 < X¥ < 58) = P (1s Xs 5) 
5 5 
= Lp(x)==r “Cp” 
xel 1 


rea 


5 : 
= © °C, (1/3) (2/3)"* 
7:2:7. Additive Property of Binomial Distribution. Let X ~B (7, p1) 
and Y ~ B (n2, p2) be independent random variables. Then 


Mx (t) = (qi + pie)", My (0) = (qz + pre)” .(*) 
What is the distribution of X¥ + Y? 
We have 
My.sy(0) = Mx(0)-My(0 [- X and Y are independent] 
= (9. + pie)" -(q2 + me)” + **) 


Since (**) cannot be expressed in the form (q + pe)", from uniqueness 
theorem of m.g.f.’s it follows that X + Y is not a binomial variate. Hence, 
in general the sum of two independent binomial variates is not a binomial variate. 
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In other words, binomial distribution doés pot possess the additive or reproductive 
property. 

However, if we take pi = p2 = p (Say), then from (**), we get 

Mx+y(t) = (q + pe)"*”, 

which is the m.g.f. of a binomial variate with parameters (m1 + 72, p). Hence, by 
uniqueness theorem of m.g.f.’s X + Y ~B(m + m,p). Thus the binomial 
distribution possesses the additive or reproductive property if p, = p2. 

Generalisation. /f Xi, (i = 1,2, ...,k) are independent binomial variates 


k k 

with parameters (ni, p), (i = 1, 2, ..., k) then their sum & X; | = Ni, P| 
ial 

The proof is left as an exercise to the reader. 

Example 7-15. Ifthe independent random variables X,Y are binomially 
distributed, respectively withn = 3, p = 1/3,andn = 5, p = 1/3, writedown 
the probability thatX + Y 2 1. 

Solution. Weare given 

X ~B (3, 3) and Y ~B (5, 7 

Since. X and Y are independent binomial random variables, with 

Pl = p2=5 +, by the additive property of binomial distribution, we get 


X+Y~B(3 + §, 3)» e., X+Y~B (8, 3) 
P(X+Ye=r)=*C,(4) (2° -»(*) 
Hence P(X +Y21)=1-P(X +Y < 1) 
=1-P(X + Y = 0) 
1-(2/) 
7-2-8. Characteristic Function of ptemee Distribution. 
e fn ° 
ox() = E(é) = E e&® p(x) = z elt & | ea 
x=0 


> el ‘s (pe"y q” Ses (q + pe'y" ae(/* 12) 
x=0 
7-2-9. Cunmmulants of the Binomial Distribution. Cumulant generat- 
ing function is 
Kx (t) 


log Mx (t) = log (q + pe)” = nilog(q + pe’) 


ae 
4! 


, F re 4 
nlog|1+plt+a 7 +a, + aq, t 


( Pe P 4 
moglasp (140474 3,7 +} 
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3 ? 4 2 
Elen egt Elena | + | 


Mean=\1 = Coefficient of ¢ in Ky (0) = np 


U2 = K2 = Coefficient of in Ky (t) =n (p —p*) = np (1 -p) = npq 
The coefficient of F in Kx (t) 
= |F-5 E. 2: ntS|- aa (1 - 3p + 2p’) 


. K3 = Coefficient FT ~in Ky(.) = mp (1 ~ 3p + 2p’) 


= np (1 ~ p)(1 - a = npq(1 — p — p)- = npq(q — p) 
U3 = k3 = mpq(q — p) 
The Coefficient of ¢* in Kx (t) 


Pp tle +a)+oa-4 
4! 2:(3:°4)*° 3 21° 4: 


a “P [1 — Ip + 12p* - 6p" 


4 
*.. K4 = Co efficient of ra in Ky(t) = np (1 — p)(1 — 6p + 6p’) 


= npq[1 — 6p (1 — p)| = npq (1 — 6pq) 
. 4 = Ka + 3x3 = npq (1. — 6pq) + 3n* p* 
= npq (1 — 6pq + 3npq) = npq [1 + 3pq(n - 2)] 
72-10. Recurrence Relation for Cumulants of Binomial Distribu- 
tion. By def.,, 


c= [Ste meio] nS i + ed) 


a errors 
f 
= dé qt pe 


| a ee (rare 
dp an Gee rar 


d’*} 
Kr+1 "| om ras log’ a+ re] 


f=0 


nis ee ioaeiare pe) space 
af a 8 ol ae 


7:18 Fundamentals of Mathematical Statistic, 


en ee 
en q + PE } jug LW \9 + PE) ho 


ey ane | oe ~ npq | & [$= 
one dp df \ q+ pe = dt \ q+ pe - 


id q + pe 
nS lesa -elZo 
t=0 ‘ ta0 
Guise - ..(713) 
In particular, 
K2 = pq° = = pq’ z (np) = npq (.- Ki = mean = ap) 
Px2 pg 40D ~ npg (q - p) 


WP a 


d 
ka = Pq a = Pq’ oe (mPa @ - Pd 
v1 pl (1 - p) (1 - 2p)| 


= mpq + 5, 0 - 39° + 2p’) = npq(1 - 6p + 6p’) 


= npq [1 - 6p(1 ~ p)} = mpq (1 — pq) 
7-2-11. Probability Generating Function of Binomial Distribution 


P(s) = S P(X eb ste f : =, | ws a gq’ = (ps + 
xaQ 


x = 0 


(0: ‘13 a) 
The fact that this generating function is mth power of (q + ps) shows that 
. p(x) = { b (x; 1, p) is the distribution of the suntS, = X, + X2 + ... + X, of 
n random variables.with the common generating function(q + ps). Each variable 
X, assumes the value'O with probability g and 1 with probability p. 

Thus [B (ks, pt = {b(ks Lp} ..(7- 136) 

Let X and Y be two independent random variables having b (k ; m, p) and 
b (k;n, p) as their distributions, then 
Py (s) = (q +-ps)” and Py(s) = (qg + ps)" 
ne Py.y(s) = (q+ ps)" (q + ps)" = (q + ps)”*" 
*. |b (k; m, p),}* Fb (k5n, p)} = [b(k3m + n,p)} ..(t 13¢) 
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Also jay’ = [n(q + ps)" p]sai = mp 

Hay’ = [n(n — 1) (q + ps)"~7 psa = n(n - 1)p’ and soon. 

yy’ = [n(n -— 1)...(n — 7 + 1) q + ps)" "p'sa1 
=n(n-1)...(n-—r+1)p 

Example 7-16 Show that 


1 
E (ya) ds a>0O 
where G (t) is the probability generating function of:X. 
Find it when X ~ B(n, p), and a = 1 
" [Delhi Univ. (Stat Hons.) Spl Course, 1988] 
1 


.o(*) 


i 
Solution, R.H.S.= f1°°'-G()de = f 177! (Er’) de 
0 0 


1 


i 
“ae eal! dt= Z| pf ite de 
0 x x 


0 
‘1 1 
~ > Ps (x + a) oe Cared 
n (**) 
IfX B(n,p), then G(t) = = epy = (g + pt)” 
: x=0 


Hence taking a = 1 in(*) and using (**), we get : 


1 
; 1 _ , _{l@+py*' |) 1-¢! 
elaral "form a = (n+1)p lo (n+ 1)p 


7-2:12. Recurrence Relation for the Probabilities of Binomial Dis- 
tribution. (Futing of Binomial Distribution). 


t 


We have 
F n 1)? gg 1 
p(x +1). _ 
Pp (x) Aue 
()? q 
_n-x p (On simplification) 
x+1 q 
Pay Pn a (ES oP ; .(7> 14) 
p(-+ 1) EEE Aldon 


which is the required recurrence formula. 


This formula provides 1 us a Very convenient metliod of | graduating the given 
data by a binomial distribution. The only a we need to calculate is P (0) 


- 
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which is given by p (0) = q", where q is estimated from the given data by equating 
the meanx of the distribution to np, the mean of the binomial distribution. Thus 
Pp =x /n. 

The remaining probabilities, viz., p (1), p (2), ... can now be easily obtained 
from (7: 14) as explained below : 


pa)= (pees Dieo= (F552) pe 
x=0 


1 4q 
PQ) = (p+ Weis (E58) p(1) 
xa] 
PO= [P+ Dan (4-2) ve 
xa2 


and so on. 
Example 7:17. Seven coins are tossed and number of heads noted.-The 
experiment is repeated 128 times and the following distribution is obtained: 


No. of heads 


Fit a Binomial distribution dssuming 

(i) The coin is unbaised, 

(ii) The nature of the coin is not known. 

(iii) Probability of a head for four coins is 0-5 and for the remaining three 
coms is 0-45, 


Solution. In fitting Binomial distribution, first of all. the mean and 
variance of the data are equated to np and mpq respectively. Then the expected 
frequencies are calculated from these values of m and p. Here n=7 
and N = 128. 


Case I, When the coin is unbaised 
| = > (p/q = 1) 
Now p(0) = q" = (4)' = (1/128) 
f(0) = Nq" = 128(1)' = 1 


Using the recurrence formula, the various probabilities, viz., p (1), p (2)... 
can be easily calculated as shown below. 


Expected frequency 
F(x) = Np) 


f{0) = Np (0) = 
fa) =1* 7=7 
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f2)=7x3=21 


f(3) = 21 x 


2 
3 
1 


f(4) = 35 x 
f(5) = 35 x: 
f(6) = 21 x : = 7 


1 
f (7) = 1x 7™ 1 
Case Il. Whenthe nature of the coin:is not known, then ' 


e Rn 
np = ‘ E fimi= Gg = 33828in=7 


p = 0- 48326 and q = 0:°51674, (p/q = 0- 93521) 
f(0) = Nq’ = 128 (0- 5167)’ = 1°2593 (using logarithms) 


N =X < Expected frequency 
r+ f(x) = Np) 
6° 54647 | f(0)=Np (0) = 1-2593 = 1 


2: 80563, | f(T) = 1:2593 & 6 ACA = §&- noe = = 8 
1-55868 |f(2) = 2-80563 = 8- 2438 = : 23: 129 = 23 


Ww ss) 


tod | tn 


0: 93521 | (3) = 1: 55868 x 23-129 = 36-05 = 36 
0: 56113 | (4) = 0-93521 x 36-05°= 33-715 = 34 


1 
2 
5 


e r 
0°31174 f(5) = 0: 56113 x 33-715 = 18-918 = 19 


sal ole 


0:13360 | £(6) =.0-31174 x 18-918 = §-897 =6 , 
{(7) = 0- 13360 x 5-897 = 0-788 = 1 


The probability generating functions-(p.g.f.), say Px (s) for the-4 coins and 
Py (S) for the remaining 3 coins are giveniby; . 


Px (s) = (0-50 + 0-50s)*, Py(s) = (0°55 + 0: 45s)? ...[¢f: 7° 13 (@)] 


Since all the throws afe independent, the p.g.f. Pxsy(s) for the whole 
experiment is given by 
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Px.y(s) = Px(s) Py(s) ..[e,f. 7- 13 (b)] 
= (0-50 + 0-505) (0-55 + 0° 45s)° 
= (0: 0625 + 0:25s5 + 0:375s7 + 0-255° + 0: 0625s') 
x (0- 166375 + 0: 408375 s + 0° 334125 s* + 0- 091125 s°) 
Now f(x) = Nx coefficient of fin Py.y(0 
*. ‘f() = 128 x -0625 x - 16637 = 1-13310 
f(1) = 128(-25 + - 166375 + - 408375 x -0625} = 8-5910 
f(2) = 128 - 28396 | = 36-3470 (5) = 128 {- 14602} = 18-6934 
f(3) = 128 |- 184117} = 23-5669 f(6) = 128/- 04366] = 5- 5889 
f(4) = 128 |- 260570} = 33-3529 f(7) = 128 | -005695} = - 72896 
Example 7-18. LetX and Y be independent binomial variates, each with 
parameters n and p. Find P (X —Y = k). (Calcutta Univ. B.Sc., 1993) 


Solution. Since.each of the variables X¥ and Y takes the values 0,1,2,.., n, 
Z=X-Y takes onthe values — n, — (n — 1),.., ~ 1,0,1...,2 


PZ=kh= © PW=ke+ernY zr) 


re0d 


n 
= 5 P(X= k+r)-P(Y=r) (“X and Y are independent). 
raQ 


‘= : n ker. -k-r n r ay 

soled) Pt q (Tee 

“2 n eek W-2-k -(*) 
= Ze r P q 


where k&— 7, — (n= 1), ..., } —T, 0,1, 2, 3 and q= 1— p. 
In particulat, we have : 


r=(0 


P(Z=0)= 5 (7)? ae 


n 


t=: ‘i nN) ron 3n-e® _ an 
PZ = —n) = —~-ner|ir|? q BP’, 


\ 
because ‘we get the result when r =n and for other values of r <n, 


2 is not defined and hence taken as 0. 
—-nA+r 


Example 7-19. Find the m.g.f. of standard binomial variate (X - np)tN npq 
and obtain its limiting form as n —> ©, Also interpret the result. 


(Dethi Univ. B.Sc. (Stat. Hons.) 1990, 85] 
Solution. We know that if X'~ B_(n, p), then 
Mx (t) = (q + pe)" 
The m.,g.f. of standard binomial variate. 
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_X-nmp _X-p 
Z vnpq a? (say) 


where } = "p and O° = npq, is given by 
Mz(t) = &"”" Mx (t/a) 


=e NI. (g + peirrayr [From (**)} 
= [ert (q+ perry] 
= aeneats + pet/V"P4 ij 
2.2 
x _ _ pt | a ’ -3/2 
Gr ape Oana (n J 


22 a 
t t n {..~3/2 
+ 1+-L.,2* 409 (n 

. Vnpq = 2npq | 
where 0’ (n- 7) and 0” (n-*/?) involve terms containing n*? and higher 
powers ofn in the denominator. 


2 _" 
“ Mz) = C : p) + oe (p + q) +0 4) 


= tesa 
2n 


12 


a ) involves terms with n° 


where 0 (n 


and higher powers of n in the 
denominator. 


2 
a log Mz (t) =n log [ + - + 0 | 


2 
-n{{Zsom}-2{Zeou} +..| 


cr tet - V2 
5+ 0" (na **) 
es 


= 
= 
«a 


where 0°” (n involve terms with n’? and higher powers of nm in the 
denominator. Proceeding to the limit asm —> ©, we get 

2 

lim t 

pees log Mz (t) = 5) 
lim 
na 12.2) 


= 


Mz(t) = exp (7/2) (?*) 


Interpretation. (**) is the m.g.f. of standard normal variate [c.f Remark 
lo $ 8-2-5]. Hence by uniqueness theorem of moment generating functions, 
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standard binomial variate tends to standard normal variate as n —> ©. In othe, 
-words, binomial distribution tends to normal distribution as n —> ©. 


Example 7-20. A drunk performs a random walk over positions 0, + 1, 
+ 2,..., asfollows. He starts at 0. He takes successive one unit steps, going to the 
right wih probability p and to the left with probability (1 — p). His SlEPS are 
independent. Let X denote his position after n_ steps. Find the distribution of 
(X + n)/2 and find E (X). (1.1.T. B.Tech., Dec. 1991) 


Solution. With the ith step of the drunk, let us associate a variable y, 
defined as follows : 


X; = 1, if he takes the step to the right 
= — 1 if he takes the step to the left 


Then X = X; + X2 + ... + Xn, gives the position of the drunkard after », 
steps. 


Define Y; = (Xi_+ 1)/2 
Then Y; = (1 + 1)/2 = 1, with probability p 
= (+ 1 + 1)/2 = 0, With probability 1 - p = q, (say). 
Since the n steps of drunkard are independent, Y;’s , (i = 1,2,...7) are 
i.i.d. Bernonlli variates with parameter p. 


Hence <= Y; ~B(n, p) 
inl 


“ " (Xi; +1 ie ae 
= Ene 2 a }-5 [2% +2] 


fel io 


where X = 2 Xj, is the position of the drunkard after steps. 
inl 
Since (X¥ + n)/2 ~B (n, p), we have 
; X+n 
. 2 

=> E(X)+n=2np => E(X) =n (2p - 1) 
Example 7:21. Suppose that the r.v.X is uniformly distributed on (0,1) 
ie. fy) = 1;0 sx 1. $ ...(*) 
Assume that the conditional distributional Y|X = x has a binomial dis- 
tributién with parameters n and p= x, Le., 
*% 
P(YeyiXe aya (9 Pa y= OL 2aon (**) 


Find (a) E(Y) 


= np => 5 EW +n) = 


(b) Find the distribution of Y. (Punjab P.C.S., 1990) 
Solution. (a) We are given that the conditional! distribution of 
_ Y|X =x ~B(n, x) ..(i) 


te, E(V|X = x) = .»{ii) 
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We have : 
E(Y) = ELE(Y|X)] = E[ aX] = nE(X) [On using (ii) } 
1 1 


Now E (Xx) sf xf(@x)dx = fxd = = 
0 0 2 
BY) and’) sa 
2 2 
(b)- We have: fx,y (x,y) = f(x)» frix 2) 
Since X has (continuous) uniform distribution on (0,1) marginal distribution 


of Y is given by. 


oo 1 
fr) = f f@y) de = ff frix la): fea 
.) 0 


1 
. f'g “xX (l—x)""-1- dk [using (*) and (**)] 
0 


1 
= "Cy fx (l-x)"” dk 
0 


= 
"Cy BY + Lina ye I= et Piy+ I) M(n- y+ 1) 


(n— y)! C (a+ 2) 
2 n! : y! (n — y)4 
y!(n-y)! (n + 1)! 
-— : y = 0, 1,2,...,n 


Since Y takes the values 0, 1, 2, ..., 2 each with equal probability 
1/(n + 1), Y has discrete uniform distribution. 


Remark We could find E (Y) on using the distribution of Y in (6). 
n 64 n 
E(Y)= 2 yp(y) = 

y=0 


nm+i1, 


zy 
=0 
sod [O+1+2+ +nj=4 
n+1 _ 2’ 


as in Part (a). 
Example 7:22. If K (t) ts the cumulative function about the origin of 
the Binomial Distribution of size n, show that 


, y-1 
& K(@ =n {1 + e Fr } , where z = log, (p/q) 
(b) By expanding the R.H.S. in powers of t by Taylor’s Theorem, show that 


r-1 
Ky =n rr, where x, ts the rth cumulant. 


7°26 


Fundamentals of Mathematical Statistics 


(c) Hence or otherwise obtain the recurrence relation 


eo r>1 
Kry+1-= Pq dp ’ 
[Baroda Univ. B.Sc. 1993; Delhi Univ. B.Sc. (Stat. Hons.) 1992) 
(d) Prove that K,41 = a where z = log. (p/q) 
Solution. _ For binomia] distribution with parameters n and p, we have 
K() = log M(t) = nlog (q + pe’) 
; ~1 
(a) 2K =e onl 14 Let 
dt q + pe Pp 
if z = loge (p/q) => (p/q) = & => (q/p) = e7’, then 


a + eyo -+.(*) 
ad’) q 
b) [ere], aoe aa) 
df 1 dt _ 
= dq’! -(z+¢) - qr’! ace 
ssinsetny” | a(S (a) 
oe taQ0 


By summetry of the function e”*',(1 + &*') int and z we have 


d eo d er 


awe. wea... — 


dt’-'\14 é*! 
Substituting in (**), we get 


: . a’! et! : d’~} é 7 
. dJ-'|1+4é*! os dz |1+¢2 


=n ie (l+e7) lan ot (3 + 4 
ay ale 2 
dz’~' re ead 
« oo re }--3(S4\¢ 
dp dp \ dz’! dz \ g’-! | dp 
4k. A [-z = loge (p/4) 
1# Pa [From (***)] 
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Oe dx, dp _ dv, dz _ dk, 1 _ dk, 

dz ~ dp dz” dp! dp dp! pq *? dp 
dk, [c.f. part (c)] 
aie Kr+1 


Example 7-23. If b(r;n,p) = ( é Pp’ ¢’ is the binomial prob- 


ability in the usual notation and if 
k 
Bik;n,p)=P(X sk) = 2 b(r3n,p), 
ro 


then prove that 


q 
B (k jn, p) = (n - b| : Jferra -~t)'dt;q=1-p 
0 
k k ‘ 
Solution. B (k;n,p) = = b(r:n,p) = = & ear 
r=0 r=0 


Differentiating w.r. tog and noting thatq=1-p =>. r* = — 1, we 


get: 
d anes Par a : ieay' 
7 ad a alealtes (-1)-f "+ pi-(n- nd | 
a n!(—r) r-l1 ner , e- -r) ro on-r-l 
eA ree i f Fe l(n— yi? Pq 
2 : __a(n- 1)! r-1 nmr, __n(n- 1)! o epiat 
x | (r- 1)! (n-n!? r!(n-r—-1)! | 


_ af (eee (72 ) pe] 


= n { t ~lp- ms a(**) 


=. 
where = ("— 1) ph git} as 
r | ey ae ( ) 


a n[ (- 1) + (4= to) + (2- Hh) tot (t te-1)] 


= nt [ori 0, From, (+#4)} 
2 
, a. n—-1)\ &, n-k-1 
oe dq 7 v7?) n| k |p ° ’ P Bs 1 a 


On integration, we get 
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q 
Bi(k3n,p)en- ( e : 1 hs (1 ~ uy lt! du. 
0 


{n+l n:(n-1)!) _— ni(n-k) | n 
pum ( k ery — k¥(n—k)! m0 (7 | 
q 


B(k3n,p) =(n - k) (if \sa — use ut*- ay 
A 


as desired. 
Remarks. 1. We further gét : 

— t —_~f- t 
Bk +1, n— kh = Pk +D P= W | kN -k=1)! 


T(n + 1) n! 


ae: SeOA ne: SENET | 
BikK+1n-—-k kit (n-k-1)! k 
Hence the result may be written as : 
ye yt-F-l ay 


Bikinp= Pe D ere RI Cl—w) 


This result is of great practical utility. It enables us to represent the euimiilas 
tive Binomial Probabilities (which are generally quite tedious and time consuming 
to compute) in terms of Incomplete Beta Functions whhich are tabulated in Karl 
Pearson’s‘Tables of'the Incomplete Beta Functions. 

2 Let us now work out the probability : 


PiX 2 kh 5 ( e |e q'-' 
rek\ 7 


Differentiating w.r. to p, and proceeding similarly,-we shall get : 


4 bw zkhe=-n z (T ~ T,-1) (Try it) 
where ei i a (Tn © 0) 


d - - \n- 
megan e =kb=anTeg-1= 1 ea)? ‘(= p) . (. q = 1-p) 


y 


On eaten we shall get: 


P 
pce wen( fr! ee ~ u)'* du 
0 


l Vee nok 
P(X 2 k).= ——_—_———— 
(X24) B(k,n = aS as du 

This is quite an important result and should be committed to memory. We 
shall use it in ‘Order Statistics’. 
This result can be stated as follows : 
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If X ~B(n,p) and Y has Beta distribution with parameters k and 
n-k + 1 (c.f. Chapter 8), then 
P(Y sp)=PX2zh=1-P(X% sk-1) 
=> Fy(p) = 1 - Fx (k - 1) 
EXERCISE 7 (a) 

1. (a) Describe the probability model from which the binomial distribu- 
tion can be.generated. Hence find the first four central moments. 

(b) If p is the probability of ‘success’ ata single trial, obtain the probability 
of r ‘successes’ out of. independent trails. Determine the mode of the eesuliing 
distribution. 

2. (a) Define the binomial distribution with parameters p and n, and 
give a situation in real life where the distribution is likely to be realized. Obtain the 
moment generating function, of the binomial distribution and hence or otherwise 
obtain the mean, variance, skewness and kurtosis of the distribution. 

(b) Obtain the Moment Generating Function of the Binomial Distribution. 
Derive from it the result that the sum of two binomial variates is a binomial variate 
if the variates are independent and have the same probability of success. 


3. The mean and variance of a ‘binomial variate X with parameters 
n and p are 16and.8. Find 

() PX = 0), @) P& = 1), (i) P(X 2 2). 

4. Fora Binomial distribution the mean is 6 and the standard deviation is 
v2. Write out all the terms of the distribution. 


Ans. 29, p = 2/3,q@ 1/3; p(r) = (1/3) - & 2°; r = 0,1, 2,...,9 


5. (a) A pérfect cube is thrown a large number of times insets of 8. The 
occurrence of a 2 or 4 is called a success. In what proportion of the sets would you 
expect 3 successes. 


Ans ‘27-31% 

(b) In eight throws of 4 die, S or 6 is considered-a success. Find the mean 
number of successes and the standard: deviation. (Ans..2- 66, 1-33) 

(c) Aman tosses a fair coin 10 times. Find the probability-that he will have 

(t) heads on the first five tosses and tails on the next five tosses 

(ii) heads on tosses 1, 3, 5, 7, 9 and tails on tosses 2, 4, 6, 8, 10. 

(iit) 5 heads and 5 tails 

(iv) at least 5 heads 

(v) not more than 5 heads. {Madras Univ.B.Sc.'(Maini Stat) Nov. 1991] 


Ans. (i) (1/2)! i) 12)", it), 5H” 
10 e ‘: 10 5 ‘ ' 10 

(ivy) = %, 2 (vy) =& Yc, (2) 

xa5$ 2 x0 2 


(a) In 256 sets of twelve tosses of a fair coin, in how many caSes may 
one expect eight heads and four tails ? 


(Ans. 31) (Delhi Univ. B.Sc. Oct. 1992): 
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(b) In 100 sets of ten tosses of an unbaised coin, in how many cases shoulg 
we expect 


(i) Seven heads and three tails, (wi) at least seven heads ? 

Ans. (i) 12, (i) 17 

7. (a) During war 1 ship out of 9 was sunk on an average in making q 
certain voyage. What was the probability that exactly 3 out of.a convoy of 6 Ships 
would arrive safely ? (Madras Univ. B.Sc., 1992) 

Ans. °C3 (8/9)° (1/9)° 

(b) Inthe long run 3-vessels out of every 100 are sunk. If 10 vessels.are Out, 
what is the probability that 

(i) exactly 6 will arrive safely, and 

(ii)- at least 6 will artive safely ? 

Hint. The probability ‘p’ that a vessel will arrive safely is 

P = 97/100 = 0-97 and q = 0-03 
The probability that out of 10 vessels, x vessels will arrive safely is 
p (x) = 10 C. p* ec = 0 Cz (0: 97)" (0: 03)!°-* 

(t) Required probability = p (6) = 10 ©6(0- 97)° (0-03): 

(it) Required probability P (X 2 6) 

8. (a) Astudent takes a tnie-false examination consisting of 10 questions. 
He is completely unprepared so he plans to guess each answer. The guesses are to 
be made at random. For example, he may toss a fair coin and use the outcome to 
determine his guess. 

(i) Compute the probability that he guesses correctly at least five times. 

(ii) Compute the probability that he guesses correctly at least 9 times. 

(ut) Whatis the smallest that the probability of guessing atleast ” correct 
answers is less than 1/2. (Dibrugarh ‘Univ. M.A., 1993) 

Ans. (i) 319/512; = (ti) 11/1024; = (ut) 6. 

(b) A multiple-choice test consists of 8 questions and 3-answers to each 
question, of which only one is correct. [fa student answers each question by rolling 
a balanced die and checking the first answer if he gets 1 or 2, the second answer if 
he gets 3 or 4, and the third answer if be gets 5 or 6, find the probability of getting: 

(i); exactly 3 correct answers, 

(u) no correct answer, 

(ui) at least 6.correct answers. [Gauhati Univ. M.A. (Econ.), 1993] 

9.(a) The incidence of occupational disease in an industry is such that the 


workers have a 20% chance of suffering trom it. What is the probability that out 
of six workers chosen at random, four or more will suffer from the disease. 

Ans. 52/3125 

(b). (a) In a binomial distribution consisting of 5 independent trails, 
probabilities of 1 and 2 successes are 0- 4096 and 0O- 2048 respectively. .Find the 
parameter p of the distribution. (Ans. 0: 2) 
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10.(@) With the usual notations, find p fora binomial random variable X 

ifn «= 6 and if 9P(X = 4) = P(X = 2). (Ans.0-25) 
(Mysore Univ. B.Sc. April 1992) 

(b) X is a random variable following binomial distribution with, mean 
2: 4 and variance 1- 44. Find P(X 2 5), P(1 < X s 4). 

11.(@) Ina certain town 20% of the population is literate, and assume that 
200 investigators take a sample of ten individuals each to see whether they are 
literate. How many investigators would you expect to report that three people or 
less are literates in the sample ? (Shivaji Univ. B.Sc., Oct. 1992) 

(b) A lotcontains 1 per cent of defective items. What should be the number 
(n) of items in a random sample so that the probability of finding at least one 
defective in it, is at least 0- 95 ? (Ans. 68) 

12. (a) If on the average rain falls on ten days in every thirty days, find 
the probability 

(i) that rain falls on at least three days of a given week, 

(i) that first three days of a given Week will be dry and the remaining wet. 


7 
Ans. (i) = /C,(1/3¥ (2/3)'~, (i) (273) - 173)". 
xre3 


(b) Suppose that weather records show that on the average 5 out of 31 days 
in October are rainy days. Assuming a binomial distribution with each day of 
October as an independent trial, find the probability that the next October will have 
at most three rainy days. 

Ans. 0: 2403 


13. The probability of a man hitting a target is 1/4. (z) If he fires 7 times, 
what is the probability p of his hitting the target at least twice ? (a) How many 
times must he fire so-that the probability of his hitting the target at least once is 
greater than 2/3 ? [ Ans. (i) 4547/8192, (wu) 4] 


Hint. (i) p= 4+, q =~ We want n such that 
1-q'>isad<i=(2\ <ianme=4 

14. (a) The probability ofa man hitting a target is 1/3. How many times 
must be fire so that the probability of hitting the target at least once is more than 
90%. Ans. 6. | (Shivaji Univ. B.Sc., 1991) 

(b) Eight mice are selected at random and they are divided into ‘two groups 
of 4 each. Each mouse in group A is given a dose of certain poison ‘a’ which is 
expected to kill one in four; each mouse in group B is given a dose of certain poison 
‘b’ which is expected to kill one or two. Show that nevertheless, there may be 
fewer deaths in group A and find the probability of this happening. 

Ans. 525/4096 

15 (a) Acard is drawn and replaced in an ordinary deck of 52 cards. How 
many times must a card be drawn so that (7) there.is at least an even chance of 
drawing a heart, (iz) the probability of drawing a heart is greater than 3/4 ? 

Ans. (i) 3, (di). 5 
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(b) Five coins are tossed. What is the variance of the number of heads per 
toss of the five coins: 

(i) ifeach coins is unbiased, 

(ii) if the probability of a head appearing is 0-75 for each coin, and 

(uit) if four coins are unbaiséd and for the fifth the probability of a head 
appearing is‘0- 75 ? 

Hint (ii) Use generating function. [See Ex. 7-17 (it)] 

16. An owner of a small hotel with five rooms is considering buying 
television sets to rent to room occupants. He expects that about half of his 
customers would be willing to rent sets, and finally he buys three sets. Assuming 
100% occupancy at all times : 

(i) . What fraction of the evenings will there be more request than T.V. sets? 

(it) What is the probability that a customer who requests a television set 
will receive one? 

(ii?) If the owner’s cost per set per day is C, what rent.R must he charge in 
order to-break even (neither gain nor lose)-in the long run ? 

Hint. (i) Let the random variable X denote the daily number of re- 
quests. Then required probability is 

PX 2 4) = PK = 4) + PX = 5) = (3) 4) (; L 

(ii) The customer can get a T.V. in the following mutually exclusive ways, 

(a) There are no other requests that night. 

(b) There is one other request. 

(c) There are two other requests. 

(dq) There are three other requests and his request precedes at least one of 
them. 

(e) There are four-other requests, and his request preceedes at least two of 
them. 

The probability of the desired event 

= (0-5)* {1 + fer + 4Cr + 3+ “C3 + 3 ‘Cal 
(ui) Mean revenue 
= (0:5) -0 + °C; (0-5)° R + °C2 (05) 2K + | °C3 (0:5)° + °Cu(0-5)° + °Cs (0-5)*} 3R 
Ty 
~ 32 
The break-even rental is the value of R for which 
‘73 ; 
37 R= 3c => R= 1-315C 

17. A manufacturer claims that at most 10 per cent of his product is 
defective. To test this claim, 18 units are inspected and his claim is accepted if 
among these 18 units, at most 2 are defective. Find the probability that the 
manufacturer’s claim will be accepted if the actual probability that a unit is 
defective is 

(a) 0-05 (6) 0-10 (c) 0-15 and (d) 0- 20. 
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Ans. (a) 0-9410 (6) 0-9326 (c) 0-4445 (d) 0:2715 
18. (a) A set of 8 symmetrical coins was tossed 256 times and the 
frequencies of throws observed were as follows : 


Number of heads : 0 1 2 3 4 5 6 7 8 
Frequency of throws: 2 6 24 6 64 SO 36. 10 1 


Fit a binomial] distribution and find mean and standard deviation of fitted 
distribution. 


(b) Aset of 6 similar coins is tossed 640 times with the following results: 
Number of heads : 0 1, 2 3 4 5 6 


Frequency : 7 64 140 210 132 ~~ = 75 12 


Calculate the binomial frequencies on the assumption that the coins are 
symmetrical. 


19. (a) The following data due to Weldon shows the results of throwing 
12 dice 4096 times, a throw of 4, 5 or 6 being called:a success (x). 


x: O 1 2 3 4 5S 6 7 8 9 10 11 12° Total 

f: — 7 60 198 430 731 948 847 536 257 71 11 — 4096 
Fit the binomial distribution and ca!cilate the expécted frequencies.-Com- 

pare the actual mean and S.D. with those. of the expected-ones for the distribution. 


Ans. Expected freq. : 1, 12, 66, 220 495 792, 924, 792, 49S, 
220, 66, 12, 0; mean = 6, variance = 1: 71. 


(b) In 103 litters of 4 mice, the number of litters which contained 0, 1, 2, 3, 
4 females are recorded below : 


Number of female mice 0 1 2 3 4 Total 


Number of litters g 3234 24 5 103 


(i) Ifthe chance of obtaining a female ina single trial is assumed constant, 
estimate the constant but unknown probability. 


(a) Ifthe size of the litter 4 had not been given, how could it be estimated 
from the data ? 


20. X is random variable distributed according to the Binomial law : 
b (x37, p) = Me p qd’ *;x = 0,1, 2}...,7 


Obtain the recurrence formula : 


x +1 
Use this as a reduction formula and get the theoretical frequencies when an 
unbaised coin‘is tossed 8 times and the experiment is repeated 256 times. 


(Madras Univ. B. Sc. April 1992) 
Z1. (a) By differentiating the following identity with respect to p and then 
multiplying by p, 


b(x + 137, p) = epi g OOM) 
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[| ee =(q+p)".qe1-p 


x=0 
prove that 4’ = np and po = npq. 
22. (a) Let X ~b(x;n,p) and r bea non-negative integer. If the rth 
moment about the origin is denoted by.u,’ = E(X"), prove that 


e $ d my 
Ueo1 = mp, + p(l — p) ra 


[Delhi Univ. B.Sc. (Hons. Subs.), 1993, 88] 
(b) Show that for the binomial distribution B (n, p), 


d 
Mr+1 = Pq Veet ave » pr+tqe=ti, 


where symbols have their usual meanings. 


[Delhi Univ. B.Sc. (Stat. Hons), 1989] 
(c) If X ~B(n, p), obtain the recurrence relation for its central moments 
and hence find values of Bi; and B2. 
[Calcutta Univ. B.Sc. (Hons.), 1992] 
23. (a) The following results were obtained when 100 batches of seeds 
were allowed to germinate on damp filter paper in a laboratory : 


1 
Bi = 5 and B. = 


Determine the binomial distribution and calculate the frequency for X = 8, con- 
sidering p > q. 


i -4-p 1 () 
Hint. We have §; ae is 
and Bo =3+ — = 54 .(ii) 


From (i) and (u), we can find the value of n, p and q 


(b) Between a Binomial distribution withn = 5 and p= ; and a distribu- 


U 


tion with frequency function 
f(x) = @&(1-x), 05x51; 
determine which is more skewed. 
24. (a)x =r is the unique mode of Binomial Distribution having mean 
np and variance np (1 — p). Show that 
(n+1)p-1<r<(n+1)p a 
Find the mode of the binomial distribution withp = 5 and‘n = 7. 
[Delhi Univ. B.Sc. (Stat. Hons.) 1991, ’84] 
Ans. 4,3 (Bimodal). 


(b) Show that ifnp be a whole number, the mean of the binomial distribution 
coincides with the greatest term. 


qneoretical Discrete Probability Distributions 1.35 


(c) Compute the mode of a binomial distribution b (7, 5). 

[Delhi Univ. B.Sc. (Maths. Hons.), 1989] 
Ans. 1, 2 (Bimodal). 
(d) Define Bernoulli trials and state the binomial law of probability. Find 


e bounds for the most probable number of successes in a sequence of n 
gernoulli trials. 


ne workers can manufacture 120 articles during a shift, another worker 140 
articles, the probabilities of the articles being of a high quality are 0-94 and 0-80 
respectively. Determine the most probable number of high quality articles 
manufactured by each worker. [Calcutta Univ. B.Sc. (Maths. Hons.), 1988] 

25. Show that if two symmetrical binomial distributions (p = q = 5) of 


degree #t (and of the same number of observations) are so supcrimposed that the 
rth term of one coincides with the (r + 1)th term of the other, the distribution 
formed by adding superimposed terms is a symmetrical binomial of degree 
(n+ 1). (Bhagalpur Univ. B.Sc., 1993] 


26. (a) Let X denote a binomially distributed random variable. Show that 
; 2 
Bee)” 0,E (Ton) = 1, and 


V npg, Vinpq 


fo FEY]-[-re La] (Zh reentry (Sh 


(b) Obtain the characteristic function of the standard binomial variate 


(X - np)N npq, where X is the number of successes obtained in n independent 
trials. each with constant probability p of success, g= 1 — p. Obtain the lisnit of 
this function as n= 9, [Delhi Univ. B.Sc.(Maths. Hons.), 1991] 


(c) If X ~ B (n, p), prove that 


d 
Kee t= Pq: dp (K,), 
where Kk, is the rth cumulant. 
Hence deduce the values of Kz and K3. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1991, °87] 


27. (a) If X and Y are two independent identically distributed binomial 
variates, obtain the probability that the absolute difference | X — Y! equals a 
given value, say r. 


(b) (i) If X and Y are independent binomial variates, with parameters p, and 
p3. and indices n, and ny» respectively, obtain the probability that X + Y equals 


r, 
(ii) In the above if p, = 2, what is the distribution of X + Y ? 
[Poona Univ. B.Sc., 1988] 


\ 
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(c) If X and Y are two independent binomial variates with parameter, 
n, = 6, p = 1/2 and n2 = 4, p = 1/2 respectively, evaluate, 
() P(X + Yee r), (i) P(X + Y 2 3) 
(Gujarat Univ. B. Sc. Oct. 1992) 
Hint X + Y ~B(6 + 4, 1/2) = B(10, 1/2) 
Ans. (i) P(X + Y =r) = p(n) = “C, (1/2); r = 0,1,..., 10 
(ii) P(X + Y = 3) = 1 —[p(O) + p(l) + p(2)] = 0-945 
(d) If X and Y are two independent binomial variates with parameters 
(n, = 3,p- = 0-4) and (n2 = 4, p = 0-4) respectively, find: 
(i) P(X =Y), (ii) P(X + Ys 2), (ai) PX = 3|X + Y = 4) 
Hint. X¥ + Y ~B(3 + 4,0:-4)-= B (7, 0-4) 


(i) P(X-Y) = 5 P(XerNYear= E PUK = r) P(Ye r)= 0-2871 
r20 re 


2 
(ii) P(X+Ys2)=+ i (0: 4)’ (0° 6)’~" = 0-420 
r=0 


(iii) P= 3/4 r= 4) ee ay ota 

28. (a) Obtain the moment generating function of Binomial distribution 
withn = 7 and p = 0-6. Find the first three moments of the distribution. 

[Poona Univ.B. Sc. 1992) 
Ans. (0°-4+ 0°62 )’ : mean = 4:2,n2 = 1:68, 3 = — 0° 336. 
‘(b) Suppose that the ni.g.f. of a random variable X is of the form 
Mx (t) = (0°4e + 0-6) 

What is the m.g.f. of the random variable Y = 3X + 2? Evaluate E (x). 

Ans. E(X) = 3:2, My(0) = e% (0-6 + 0:4e*)® 

(c) Obtain the moment generating function of the binomial distribuiion. 
Hence or otherwise obtain the mean, variance and skewness of the distribution. 

29. Show that the factorial moment generating function w(t) of the 
binomial distribution b (x; n, p) is (1 + pt)". Hence or otherwise show that 


wy = al? p’ 


Hint. Factorial moment generating function w (t) is defined as 


oO =Ed +o = L(+ pe BC(p + of gr 


v 
wry = co efficient of = inw(t) = "Crip = np! 
30. Show that 
(i) b(n,p;k) = b(n, 1-p;n-k) 


(i) 2 b(a,p;k)=1- 2 b(n, 1 -p;k) 
ker 1 


kan-r+ 


(iti) b(n +/1,p;k) = p.b(n,p;k - 1) + q.b(n, pk) 
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Hint. (/) b(n, 1.-—p; n- k)= @ ) =p yr -k p n—(n—k) 


(ii) > b(n, pk) = x O(n, l-p3;n-k= +4 b(n, l—-p;k) 
31. For a binomial distribution, let 


Fn (y) = x (”) p* qr, 


where g=1—>p, 

prove that 

(t) Fig (y) = p Fn(y~1)+ q FQ) 

(ii) Cov (X, n- X) =-—npq (Bombay Univ. B.Sc., April 1990) 


32. (a) Random variable X follows binomial distribution ‘with parameters 
yn = 40 and p =; - Use Chebychev’s inequality to find bounds for 

(i) P( 1X -101< 8) 5.) P[ IX —101> 10) 

Compare these values with the actual values (Hint : Use Normal 
approximation for the Binomial). (Madras Univ. B.Sc. (Main Stat.), 1988) 

Ans. (i) 113/128 (lower bound), (di) 0.075 (upper core 

(b) X follows binomial distribution with n= 40, p =z - Use Chebychev’s 
lemma to 

(i) find k such that 

P {1X -201> 10k} < 0-25, and 

(ii) obtain a lower limit for P {1X ~201<5}. 

[Delhi Univ. B.Sc. (Maths. Hons.), 1984} 

Ans. (i) 2V10, (ii) 3/5 , 

(c) How many trials must be made of an event with binomial. probability of 
success.5 in each trial, in’order to be assured with probability of at least 0-9 that 
the relative frequency of success will be between 0-48 and 0-52? ~ (Ans. 6250) 

Hint. Use Chebychev’s Inequality. 

33. (a) Show that if 4 coin is tossed n times,the probability of hot more 


than k heads Is : ; 
L(o) +(7) + +(e) G) 


[South Gujarat Univ. B.Sc., 1988] 
(b) If X has binomial distribution with paramctes n and' p, then prove that 


P[Xiseven]= 5[1+(q-p)"]. [Delhi Univ. B.Sc. (Stat. Hons.), 1988] 


34. If the probability of hitting a target is 1/5 and if 10 shots are fired, what 
is the conditional probability of the target being hit at least twice assuming that 
at least one hit is already scored ? 

[Nagpur Univ. B.Sc., 1988, ’93] 
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Hint. LetX donote the number of times a target is hit when 10 shots are 
fired. Then X¥ ~B (10, 0-2). The required probability is : 


Py eo ye tet ee ee) eae) 


P(X 2 1) ~ P(X 2 1) 
_1-[PW=0)+PW=1)] _ 0-625 _ 
1-{P& = 0)] ~ 0-893 ae 
. 35. (a) LetX beaB(2, p) and Y beaB(4,p). If P(X 2 1) = 5/9, 
find P(Y.2 1) [Kerala Univ. B. Sc., 1989] 


Hint. P(X z= 1).=-1- P(X =0)21-q = 5/9 =q = 2/3,p & 1/3. 
P(Y21)=1-P(Y = 0) =1-q° = 65/81. 

36. Let.B denote the number of boys in a family with five children. If p 
denotes the probability that a boy is there ina family, find the least value ofp such 
that 

_P(B = 0) > P(B = 1) (Shivaji Univ. B. Sc., 1990) 


‘Ans. q > Spq' => q > Sp > p <=. 


37. (a) Suppose X¥ ~B(n, p). with E (X) = 5, Var (X) = 4. Find 
nand p. (Ans. n = 25, p = 1/5) 

(b) LetX ~B (n, p). For what p is variance (X) maximised if we assume 
n is fixed. 

Ans. VarXx = npq = np ~ p) = fp) (say) if’ (p) = 0, f''(—) < 05 p= 12 = 4 

38. (a) X ~B(n = 100,p.= 0:1). Find PY s py — 3 0;) 

Anns. p = 10, 0 = 3, P(X s py - 3'0,) = P(X < 1) = 10-9 x (0-9)” 

(b) IfX ~B(25,0-2), find P(X < py, -2 o,) 

_ [Delhi Univ. B.A. (Stat. Hons.) Spl. Course 1989] 

39. Forone halfofn events, the chance of success is p, and the. chance of 
failure is g, whilst for the other half the chance of success is g, and.the chance of 
failure is p. Show that the S.D. of the number of successes is the same as if the 
chance of success were p inall the cases i.e. Vnpq, but that the mean of the number 
of successes is n/2 and not np. (Delhi Univ. B.A. 1992) 

Hint. X ~ B (n/2, p) and Y ~B(n/2, q) are independent. Let 
Z = X + Y. Now prove that Var (Z) = mpq and E(Z) = n/2. 
| 40.. The discrete density of X is given by fy (x) = x/3, forx = 1,2 and 


fvjx (y|x) is binomial with parameters x and 5 i.e., 


Fyx (y|x) = P(Y = y|X = x) = thos 
fory = 0, 1,..,x and x = 1, 2. 
(a) Find E(X) and Var (X); (b) Find E (Y) 


(c) Find the joint distribution. of X and Y. 
‘Hint. Proceed as in Example 7: 21. 
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Ans. (a) E(X) = 5/3, Var(X) = 2/9, (b) E(Y) = 5/6. 


x 
x x 1 
(c) f(x,y) = Ly 3) la|e"- 1,2,; y = 0,1,...,x 
41. Twodice are thrownn times. Let_X denote the number of throws in 
which the number on the first dice exceeds the number on the second dice. What 
is the distribution of X ? 


Ans. X ~B (n,p = 15/36) 

Hint. _p is the probability that the number on the first dige exceeds the 
number on the second dice.in a throw of two dice. 

42. LetX; ~B(n,pi) and X2 ~B (a, p2). 

If pi < p2, prove that: 

P(X; s k) 2 P(X. s k) for k = 0,1,...,-n 
Hint. Use Example 7: 23. 
43. If X ~ Bla, P), show that 


P(X sk) =) — dy 
oq it + y)"* 
a a: 
where M1 = f —“ dy =P k + 1n-h 
9 (1 + y) 


Hint. = Pu sk)= = k se el = Ax, (say) 
q ate 
[See eee 23] 


4 k 
Find —— iq (RHS) = . 7 reer es dy serait, 
qq|7,,(1 +) [1+(p/g I \¢ 
1. ko on-k-1 | 
“Besta Po 7M 
(On simplification) 
44. If X B(n,p) and Y has beta distribution with parameters k and 
n — k + 1, (See Chapter 8), then prove that 
P(Y sp) = P(X 24 ie. Fy(p) = 1 - Fyr(k - 1) 
45. If a fair coin is tossed an eVen number 27 times, show that the 
probability of obtaining more heads than tails is 
? dn 
i mo {4 
2 " | | 
Hint. X : No.ofheads; Y = No.of tails; No. of trials = 2n 
P(X > Y) + P(X < Y) + P(X = Y) = 


=> 2P(X¥>Y)=1-P(X=Y) 
[-.- By symmetry, p = q = { => P(X 3 Y) = P(X'< Y)} 
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_ | ="C.p" < q' = 1 2G. (1) 
=> PK > Vy = 5 [1 ne )"] 


7-3-0. Poisson Distribution (as a limiting case of Binomial Distribu- 
tion). Poisson distribution was discovered by the French mathematician and 
physicist Simeon Denis Poisson (1781—1840) who published it in 1837. Poisson 
distribution is a limiting case of the binomial distribution under the following 
conditions: 


(i) vn, the number of trials is indefinitely large, t.e.. n — ©. 

(i) p, the constant probability of success for each tral is indefinitely small, 
Le., p — 0. 

(iti) np = A, (say), is finite. Thus p = A/n,q = 1 — /n, 
where A is a positive real number. 

The probability of x successes in a series of nm independent trials is 


° —_ n x —x ° = 
b(;n,p) = (re q’'-*;x = 0,1,2,...,n ...(*) 
We want the limiting form of (*) under the above conditions. Hence 
n! A\" Ay" 
din, Boonie Be. serait) * | 


Using Stirling’s approximation forn!as n > © viz., 
lim nt = V2 eo" n"* 4). we get 


o-> © 


TH ort. pnrii/2) x n~x 
lim b (x; n, p) = lim Gar Gg f _ z 
x n n 


ae aro | xIVER EOD: (nay 


Ya n" n-x+(1/2) 3) 


ex! Ge xyer ay 


to] 
* a 
ee, RaeaNe 


1-2) 


lim [, A]", lim (,_% 
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But we know that 


and : nal" ”) 


Therefore 
lim nr tee Cn ae el 
nwo OSM P) = ck ae ee 0,1, 2,..., 0; 
[Using (**)] 


which is the required probability function of the Poisson distribution. ‘A’ is known 
as the parameter of Poisson distribution. 


Aliter. Poisson distribution can also be derived without using Stirling’s 
approximation as follows : 


b(x;n,p) = (ea — py * = (7) ie (1 — p)" 


4 


r 
_ a(n — 1) @ —-2)...a—-x + 1), | ay 
ns Te ae ta “5 


lim | ete [From (**)} 


Definition. A random variable X ts said to follow a Poisson distribution 


if it assumes only non-négative values and its probability mass function is given by 


~X 4x 


p(x, d) = P(X =x) = <* ;x = 0,1,2,...;A >0 


= 0, otherwise .(7> 14) 
Here A is known as the parameter of the distribution. 
We shall use the notation. X¥ ~P(A) to denote that X is a Poisson variate 
with paramter i. 
Remarks 1. It should be‘noted*that 
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> P(X =x) =e" »y Nexis ere =1 
x=0 x=0 


‘2. The corresponding distribution function is: 
x 


F(x) = P(X s x)= S p(r) = e* > N/r!;x = 0,1, 2... 

r=0 r=0 

3. Poisson distribution occurs when there are events which do not occur as 
outcomes of a definite number of trials (unlike that in binomial) of an experiment 
but which occur at random points of time and space whercin our interest lies only 
in the number of occurrences of the event, not in its non-occurtences. 

4. Following are some instances where Poisson distribution may be suc. 
cessfully employed: 

(1) Numbér of deaths from a disease (not in the form of an epidemic) such 
as heart attack or cancer or due to snake bite. 

(2) Number of suicides reported in a particular city. 

(3) The number of defective material ina packing manufactured bya good 
concern. 

(4) Number of faulty blades in a packet of 100. 

(5S) Number of air accidents in some unit of time. 

(6) Number of printing mistakes at each page of the book. 

(7) Number of telephone calls received at a particular telephone exchange 
in some unit of time or connections to wrong numbers in a telephone exchange. 

(8) Number of cars passing a crossing per minute during the busy hours of 
a day. \ 

(9) The number of fragments received by a surface area ‘¢’ froma fragment 
atom bomb. 

(10) The emission of radioactive (alpha) particles. 

7-3-1. The Poisson Process. The Poisson distribution may also be ob- 
tained independently (i.e., without considering it as a limiting form of the Binomial 
distribution) as follows : 

Let X; be the number of telephone calls received in time interval ‘t’ on a 
telephone switch board. Consider the following experimental conditions : 

(1) The probability of getting a call in small time interval (¢, t + df) is 
d dt, where i is a positive constant and dt denotes a small increment in time ‘?’. 

(2) The probability of getting more than one call in this time interval is very 
small, i.., is of the order of (dt}” i.e., 0 [(dt)"] such that 

ict 2 
lini 0 (dt) = 0 
d—+>0 dt 

(3) The probability of any particular call iff the time interval (¢, ¢ + df) is 
independent of the actual time ¢ and also of all previous calls. 

Under these conditions it can be shown that the probability of getting x calls 
in time ‘t’, say, Py (¢) is given by 
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ov ay 


P,(t) = ———_; x = 0, 1,2,. 


which is a Poisson distribution ae parameter A ¢. 


Proof: Let Px(‘) = P {of getting x calls ina time interval of length ‘t’}. 
Also P” {of at least one call during (¢,¢ + dt)}=Adt + 0 [(at)’ ] 


and P {of more than one call during (t,¢ + dt)} =O[ (dt)* }. 


The event of getting exactly x calls in time ¢ + dt can materialise in the 
following two mutually exclusive ways : 


(i) x calls in (0, ¢t) and none during (¢,¢ + dt) and the probability of this 
event is P, (t) {1 -[(Adt + 0 (dt) II, 


(ii) exactly (x — 1) calls during (0, 2) and one call in (¢, ¢ + dt) and the 
probability of this event is P, _ 1 (¢){A dé). 


Hence by the addition theorem of probability, we get 
P,(t + dt) = Py(0){1 — Xdt — 0 (dt)’} + Peg (t) ddd. 
= Pz (t)(1 — Adt) + Pe-y(t)Adt + O(dt)’ Pr(t) ...(1) 


: : 
wo Pe APO AP) + RPO + aay 


Proceeding to the limit as dt — 0, we get 


lim Pe + dt) - PO py 4 AP (t) 


dt - 0 dt 

: P(t) = - AP) +A Prii()x 21 .W.Q) 
ore ) denotes differentiation w.r. to ‘t’. 

Forx = 0, Pr-1(¢) = P-1(t) = P{(- 1) calls in time | = 0 

Hence from (1), we get 

Pot + dt) = Bo(t) {1 — Ade} + 0 (dt) 
which on taking the limit dt —> 0, gives 


Po (t) = — Po(t) = tk ; 


P, (t) 


Integrating w.r. to. ‘t’, we get 
log Po(t) = -— At + C, 
where C is an arbitrary constant to be detérmined from the condition © 
Po(0) = 1 
Hence log 1=C => C=O 
: log Po(t) =-At => Po(t) = &* 
Substituting this value of Po (¢) in (2), we get, withxy = 1 
Py' (t) = — APy(t) + Ae*! 
=> Py’ (t) + APi(t): = Aew*! 
This is an ordinary linear differential equation whose integrating factor is 
e*' Hence its solution is 
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MPrwQernfe er de+C=d0+C, 


‘where C, is an arbitrary constant to be determined from P; (0) = 0, which gives 
C, = 0. 


Pi(t) =e nt 
Again substituting this in (2) with x = 2, we get 
P2(t) + Po) = AE Kt 
Integrating factor of this equation is e*' and its solution is 
2 
Ped! = fte e* Mase Xf ac, 
where C2 is an arbitrary constant to be determined from P2 (0) = 0, which gives 


‘C2 = 0. Hence 
-ar AD? 
P2(t) = € 5 
Proceeding similagly step by step, we shall get 


e*! 
P,(t) = coer. x = 0, 1,2,..., © 
7-3-2. Moments of the Poisson Distribution 


Wi’ = E(X) = od ha 


o —-K 4x @ x-1 
= Ex-S* we 2 Gomi 


x=0 x! rei - 1)! 
neo {rene mame J enehd ad 
= tAtaytagyten|o = 


Hence the mean of the Poisson distribution is A. 


2’ = E(X°) = = 2pG,h) = E(xG-1 +x} 


@ eo e* x 
-ExG-nXeExt us 
x=0 x! 
«9 x~-2 
a Ve > traavetteru Waa 
xr=2(%—2)! 


ps’ = EY) = z x'p (x, A) . 
> st | 


= E \=6 -1)@ -2)+3x @- 1) ?- 


= Exe - Ne - 2) & P43 Z2@-1) 
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aes \g 5 43e°)2 5 +r 
xo3(x — 3)! 7 eal 


en Me + 367 hy2 Qh tre 434A 


w' = E(X') = Ex + p(x) 
x20 


A. 


a -h az 
= E (x(e — 1) (x— 2)(x- 3) + Gx(x—- 1) 2) + Tx(x- 1) 4 x}= x 


@ x-4 ad x-3 
se | a + 6e8n| = ca 


x-4(X-4)! xa3 (x — 3)! 
2 02 ad 7-2 
+7e°R 2 ats 
es (ere rye O (ere 47 (Cre) +A = atdunis 
The four central moments are now obtained as follows : 
yo = po — m2 = (+ A- Ved 
Thus the mean and the variance of the Poisson distribution are each equal to 


33 —3 py pe +2 wi = (0 +37 4d)-3A (A+ A+ 2A wd. 
pe = pa’ — 4s’ pay’ + 6 pr’ i’? — 3 

= (A‘+ 62+ TAZ4 A)— 4A(AP + BAM A+ 6A (A724 A)— BAS m BA7HA 
Co-efficients of skewness and kurtosis are given by 


2 2 
[Co 6 = VB, « 
‘i w pe ee a, 


(7+ 15) 
ry 
Hence the Poisson distribution is always a skewed distribution. 
Proceeding to the limit as A -+ ©, we get 
fi = 0 and B2 = 3 
7-3-3. Mode of the Poisson Distribution 


Also Br = 3+ = and yp = fp -3 => 
2 


p&) ~ x! A (7-16) 
x 


(x ~ 1)! 
We discuss the following cases : 
CaseI. When i is not an integer. 


Let us suppose that S is the intengral part of A. 
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pi) oe eee p(S-1) , 1, _ pS) > 1, 
J) "p(S-2)) * pS=1) 
S +1 (S + 2) 
and pis + 1) < 1, oy 1 ee 
p(S) p(S + 1) 
Combining the above expressions into a single expression, we get 
p (0) < p(1) < p(2)... < p(S — 2) < p(S — 1) < p(S)>p(S + 1)> 
p(S + 2)>..., which shows that p (S) is the maximum vaive. Hence in this case 
the distribution i is unimodal and the integral part of A is the unique modal value. 


Case II. When id = k (say) is an integer. Here we have 
PM), PQ, pk-),, 


p(0\)” ’p(i)” ’”? p(k - 2) 
ne _plh) _, pk +1) neeon. 1... 
P(k-1) ’ pk) p(k + 1) 


“ p(0)<p(1)<p(2)<...<p(k—- 2)<p(k- 1)= p(k)> p(k+ 1)> p(k+ 2)... 

In this case ‘ve have two maximum values, viz., p(k — 1) and p (k) and thus 
the distribution is bimodal and two modes are at (k — 1) and k, t.e., at,(A ~— 1) 
and A, (since k = A). 

7-3-4. Recurrence Relation for the Moments of the Poisson Distribu- 
tion. By def., 


wu, = E{X -E(Q)}’ = 2 hE EOD) 


Sy u-w SE 


xe 
Differentiating with respect to ie we get 
Tee Se wre jp E. yew Ax gerd gd Ne] 


x20 
oO 


--r¥ 2 pyrt SX 2 >) (x - BM yr &*(% =)| 


xa0 
. rie" 1 a 
ont (e - ay 7?) (x - A) = 
d; 1 
ax” — TUr-i ve Pre 
=> rx + a Zhe 
pee on abe (717) 
Putting = 1,2 and 3 successively, we get 
ener ery es ae (Bo 1, pr = 0) 


dx 
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du du 2 
= 2hp +A = 1 us = 3Au2 + 2 = 3 + +A 


7-3-5. Moment Generating Function of the Poisson Distribution 


-r 4x SS Be tx 
My (t) = » oe EN y eer 
: 0 


—, x! “ x! 
2 
z~eJi4re4 KE) + ice | ee he a eh lemD) 
2! ...(7> 18) 


7-3-6. Characteristic Function of the Poisson Distribution 


00 @ —-A 4x 00 it 
; XV es Le 
x(t) = = p(x) = - e 7 ae = wer 
=e phe _ Ae 1) (7-19) 


7-3-7. Cumulants of the Poisson Distribution 
Kx (0) = log M(t) = log [e*-] = A(¢ - 1) 


Ky = rth cumulant = co-efficient of | in Ky (t) = A 


=> Kw 2A; r= 1,2,3.,.. ..(7> 19a) 

Hence ail the cumulants of the Poisson distribution are equal, each being 
equal to A. In particular, we have 

Mean«= x; = Ame = As] K3= A and ye Ky 3x3 @eA+ 32? 


2 2 2 
ws oA 1 uM A+3n 1 
=“-——=-— = — and ~2 = =—+3 
Bi re 43 ry B us 42 rt 
Remark. Ifm isthe mean and o is the s.d. of Poisson distribution with 
parameter A, then 


mov Y2= 4° VA - VB: (B2 - 3) 


11 
=A: VA oe , al 


7-3-3. Additive or Reproductive Property of Independent Poisson 
Variates. Sum of independent Poisson variates is also a Poisson variate. More 
elaborately, ifX;, (i = 1, 2,...,) are independent Poisson variates with param- 
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” 


ters i; i = 1,2,...,n respectively, then < X; ts also a Potsson variate with 
i=l 


Rn 
parameter Xk. 
jel 


f 
Proof. My, (t) =e 6-1). § 2 1,200 
My, +4Nx4+..4.7, (t) a Mx, (t) My, (t) eee Mx, (z), 
[since X;;t = 1, 2, ..., mare independent] 
= M(e-1) le-1y | dnle=1) 
aM thet. th)(e-? 


which is the m.g.f. of a Poisson variate with parameterA,; + Az + ... + An. Hence 


n 
by uniqueness theorem of m.g.f.’s, 2 X; is also a Poisson variate with parameter 
inl 


Remarks 1. In fact, the eonverss of the above result is also true i.e., if 
X, X2,...,Xn are independent and z a has a nessen distribution, then each 


of the random variables X\, X2 ... x, Ws a Poisson ‘ier Bild 
Let X; and X2 be independent r.v.’s so that X; ~P(A,) and X; + X2 
~P (Ai + Ao). Then we want to prove that X2 ~P (A2) . 


Proof. Since X; and X>2 are independent, we have 
My, +X? (2) = My, (t) Mx, (t) 
= heel M(e-1). yy Wy) 


= Mx,() = &2©~1) 
=> X_ ~P(d2), by uniqueness theorem of m.g.f. 


2. The difference of two independent Poisson variates is not a Poisson 
Variate. 


My, - 2X2 (t) = Mx, + (- X2) (0) S My, (t) : M ~*2) (¢), 
(since X; and X2 are independent). 
My, - 2; (0) = Mx, (0) My, (- t) [ *- Mex (t) = My (ct) ] 


oo (é-1). oe (e'=1) _ oM (e-1)+A(e'-1) 


which cannot be put in the form | ier ) Hence eS — X2) is not a Poisson 
variate. 
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Moreover the difference (X; — X2) cannot be a Poisson variate is evident 
from the fact that it may have positive as well as negative values, while a Poisson 
variate is always non-negative. 


7-3-9. Probability Generating Function of Poisson Distribution 


o oh sad k 
PGF.ofX¥ = Dy = he y er AD ordre OU-D 


...(7> 20) 
Example 7-24. Acar lure firm has two cars which t fires out day by day. 
The number of demands for a car on each day ts distributed as Poisson vartate with 
mean 1-5., Calculate the proportion of days on which (i) neither car is used, and 
(ii) some demand is refused. [Meerut Univ. B:Sé, 1993] 
Solution. The proportion of days on which there are x demands fora car 

= P {ofx demands in a day} 

7 es (1- 5)" 
x! 


? 


since the number of demands for a car on any day is a Poisson variate with mean 
4-5. Thus 


S15 44. 
P(X =x) = 20, x =.0,1,2,... 


(t) Proportion of days on which neither car is used is given by 
PW=0)=e¢"° 


(1-5)? (1-5)> . (1-5)* 
-[1-15+ a1 31 41 - | 


= 0: 2231 
(ii) Proportion of days on which some demand is refused is 
P(X > 2)=1-PQQ@ s 2) 
1-[P@& = 0) + P(X = 1) + P& = 2)] 


Roam seeeheae 
6 


a! 
= 1-0-2231 x 3-625 = 0- 1912 

Example 7-25. A manufacturer of cotter pins knows that 5% of his 
product is defective. If he sells cotter pins in boxes of 100 and guarantees that not 
more than 10 pins will be defective, what ts the approximate probability that a box 
will fail to meet the guaranteed quality ? [Kanpur Univ. B.Sc. 1993] 

Solution. We are givenn = 100. 

Let p = Probability of'a defective pin = 5% = 0: 05 

‘ dX = Mean number of defective pins in a box of 100 
=«np=100x 0-05 = 5 
Since ‘p’ is small, we may use Poisson distribution. 
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Probability of x defective pins in a box of 100 is 


—-r 4x e? 
PX¥=x-5* 28 Spe O11 
Probability that a box will fail to meet the guaranteed quality is 
10 Sg 10 53 
P(X > 10) = 1 —P(X s 10) =1- Yap ei >> 
x=0 ° x=0° ° 


Exaimple 7: 26. Six coins are tossed 6,400 times. Using the Poisson dis. 
tribution, find the approximate probability of getting six heads r times. 

Solution. The probability: of obtaining six heads in one throw of six coing 
(a single trial), is p = (12)°, assuming that head and tail are equally probable, 

- A = np = 6400 x (%)° = 100. 

Hence, using Poisson probability law, the required probability of getting ¢ 
heads r times is given by : 


P(X =nr)= 


ot r e” 100. 
— 2 -(100)" = 0,1,2,. 


r! 

Example 7:27. |. In a book of 520 pages, 390 typo-graphical errors 
occur. Assuming Poisson law for the number of errors per page, find the probability 
that a random sample of 5 pages will contain no error. 

[Patna Univ. B.Sc. (Hons.), 1988] 

Solution. The average number of typographical errors per page in the 
book is given by A = (390/520) = 0-75 

Hence using Poisson probability law, the probability of x errors per page is 

—-K 4x - 0- 
given by: P(X = x) = ex ~ oO = 0,1,2,... 
The required probability that a random sample of 5 pages will contain no 


5 Sry 
error is given by: [ P(X¥ = 0) ] “(e"” = oo 


Example 7-28. Suppose thatthe number of telephone calls coming into 
a telephone exchange between 10 A.M. and 11 A.M. say, X\ is a random variable 
with Poisson distribution with parameter 2. Sumilarly the number of calls arriving 
between 11 A.M. and 12 noon say, X2 has a Poisson distribution with parameter 
6. If X; and X2 are independent, what is the probability that more than 5 calls 
come in between 10 A.M. and 12 noon ? [Calicut U. B. Sc. Oct: 1992] 


Solution. Let X = X, + X2. By the additive property of Poisson dis- 
tribution, X is also a Poisson variate with parameter (say) A = 2 + 6 = 8 
Hence the probably of x calls in-between 10.A.M. and 12 noon is given 
z 
ty PW any 2 ee, x =0,1,2,. 
Probability that more than 5 calls come in between 10 A.M. and 12 noon is 
given by 
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5 ~8 gs 
P(X >5)=1-PW%s5)=1-) ae 
x=0 : 


= 1 — 00-1912 = 0- 8088 


Example 7-29. A Poisson distribution has a double mode atx = land 
y = 2. What is the probability that x will have one or the other of these two values ? 


Solution. We have proved that if the Poisson distribution is bimodal, then 
the two modes are at the points x = A — 1 and x = A. Since we are given that 
the two modes are at the pointsx = 1 and x = 2, wefind that A = 2. 


-rA 4x -2 
P(X =x) = <i = aa x = 0, 1,2,... 


=> P(X = 1) 2672 
and P(X =2)= 


: Th aie 
Required probability = P(X = 1) + P(X = 2) =2e°° + 2e°* = 0-542 
Example 7-30. if X is a Poisson variate such that 
P(X = 2) = 9P(X = 4) + 90P(X = 6) ..(*) 
Find (i) X, the mean of X, (ii) 8B, the coefficient of skewness. 


{Delhi Univ. B. Sc. (Maths. Hons.) 1992, ’87] 


Solution. IfX is a Poisson variate with parameter A, then 
-r 4x ; ° 
P(X =x) = <—* x= 0,1,2,...;4 > 0 


Hence (*) gives 
< , 2 ' 4 6 
e*.n? 4 f ad 
21 « joan + 55 
x42 
oy = [302 + Aa] 
=> 4 4+3227-4 = 0 


Solving as a quadratic in r?, we get 
n2 oe DOE VI+ 16 3 £5 
2 2 
SinceeX > 0, weget? 21 => Ae=#l 
Hence mean = 2 = 1, and v2 = Variance = A = 1 


Also 6, = Coefficient of skewness = 5 = 1. 


Example 7-31. If X and Y are independent Poisson variates such that 
P(X = 1) = P(X = 2) 
and P(Y = 2) = P(Y = 3) a(*) 
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Find the varaince of X — 2Y. 

Solution. Let X ~P(A) and Y ~P(y). 
Then we have 

-i x 


PW <5) 3 e012 ks 0 
x! 


rH, 
and PY =y) = SE, 0 0,1,2,..5 4 > 0 


Using ,*), we get 
2-2 
Ae = A 
27 3 -u -..(**) 
and ee 
2 3! 


Solving (**), we get 
re (A —-2] =O and we"[p -3] =0 
=> AX = 2 and w = 3, since A > 0, p > 0. 
Now Var (X) = A = 2, and Var (Y) = wp = 3 -..(***) 
Var (X — 2Y) = 1” Var(X) + (— 2)? - Var, 
covariance term vanishes since X and Y are independent. 
Hence, on using (***), we get 
Var (X - 2¥V) =2+4x3=14 
Example 7:32. If X and Y are independent Poisson variates with means 
Ai and A2 respectively, find the probability that 
(i) X+/Y =k, (it) X = Y [Delhi Univ. B. Sc. (Stat. Hons.), 1991] 
Solution. We have 


—Ar x 


P(X = x)= aE = 0,1,2,3,...;41 > 0 


-h2 . 
and PY = y) = <8, y 0 0,1,2,3..5 A2 > 0 


k 
(i) PX+Ye2h=t PWerNVYe=k-r) 
r=0 
k 


= £P(X =r) P(Y =k - r) 
r=0 


[-- X and Y are independent | 


k -_ = = 
ere eh. abr 
xs ee 
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7-53 
ki * 1!k-p!- Tk at eal =| 
abil Oa k k-2 42 k 
are A2 + Ci Ad + A1 + C2°An “Ai +... + a | 
on G1 +2) 


-—— «it Ao, k = 0, 1,2,... 
which is the probability function of Poisson distribution with parameter 
ht A2 

Aliter. Since ¥ ~P (Ai) and Y ~ P(A) are independent, by the addi- 
tive property of Poisson distribution X¥ + Y ~P (A, + A). Hence 


: Pm Oe 
PWXW+Y-h = _ ee 0,1, 2, .. 


(ii) Peed PXerNYen 
"., re 


= 5 P(X = r)P(Y =r) 
r=0 


[XX and 'Y are independent] 
eo + da) > (uA 
(ry 


Example 7-33. Showthat ina Poisson distribution with unit mean, mean 
deviation about mean is (2/e) times the standard deviation. 
(Patna' Univ. B. Sc. (Stat. Hons.) 1992; Delhi Univ. B.Sc. (Stat. Hons.), 1993] 
Solution. Here we are givendA = 1. 
P(X = x) = “* 2 sie = ¢. x = 0,1,2,... 
Mean deviation about mean 1 is 


E(\X-1|).= = [x-11p@) =e" 5 boat 


xo0 
se! ee 

2! 3! 4) °°" 
We have —2— 2 @+U-1 1 1 


(n+1)! (n+1)! ant (n+)! 
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.. Mean deviation about mean 


aye 1 1 1 1 a Ties 

oe t+ (1-35) *(s3-a7]*(sr- aa) + | 
1 2 2 7 

vse (1+ 1) = 3 1 = mae standard deviation, 


since for the Poisson distribution, variance = mean = 1 (given). 
Example 7-34. Let X,,X2, ....,Xn be identically and independently dis. 
“RK 


tributed Bin (1, p) variates..Let Sn = X Xj; and My, (t) be the m.gf. of Sn Find 
s je 1 


lim M, (t), using np = X(const.) [Delhi Univ. B. Sc. (Maths Hons.), 1989] 


n—-> 0 
Solution. Since X;, i = 1,2, ...n are iid. binomial variates B (1, P), 
n 

Sn = & Xj, isa binomial-B (, p). variate. 

jel 

n n 

Mat) = Mg f.of Sa = (4 + pe’) = [1 + (e - 1)p | 
If wetakenp = X} => p = A/n and letn — ©; we get 


n> eo 


n 
lim ; lim (e -1)A t 
M, (t) eo | exp[A(e-1)], 
which is the m.g.f. of Poisson‘distribution with parameter A. Hence by uniqueness 
n 
theorem of m.g.f.,S, = = X; ~ P(A), asm > ©, with np = 2 (fixed). 
j=l 


Example. 7-35. (a) IfX tsa Potsson variate with mean m, show tha 


the expectation of e-** is exp [=m(1-e-') ]. [Nagpur Univ: B.Sc. 1993] 


Hence show that, if X is the arithmetic mean of n independent random 
variables X,X2,..., Xn, \each having Potsson distribution with parameter m, ' 


then e~* as an estimate of e~™ is biased, although X is an unbaised estimate of m.: 
(b) If X is a Poisson variate with mean m, what would be the expectation 
of e** kX, k being a constant. 


Solution. 
as 2 —-m Xx ss —k 
E(e**)-= > e* p(x) = > ima oie > im 
x=0 x=0 ° x=0 
~k\2 
me F + me * + me +... 
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—-m me orm(1-e") -(*) 


We have 
e&)-£(4 ¥x|-1 3 ea 
n iol a iol 


Since Xj; ¢ = 1, 2, ..., a is a Poisson variate with parameter m, 
E(Xi) = m. 


> E(X¥ j= Sia diner 
Riel n 
Hence X is an unbaised estimate of m. 


jel 


a E(e —XiAn e Xn... 


ss E ( Xn Xin .E ( oN 
(since X1, X2, ..., Xn are independent) 


.(**) 


Now E(e*) = E | exp =* x Xi ) 


E(e*) = 1 E(e*") 
Using (*) withk = ia we get 


E (e*¥ Yoel . “os (since X; is a Poisson variate with seer m) 


E (¢*) = n [exp { —m (1 - ee ) | = ‘exp mS e /ny| 


ae, eer -e vey “e@™ 


Hence Bas 


estimate of m. 


is not an unbaised estimated of e", though Y is an unbiasex 


(b) E(e kX) = » eke - p(x) =k e ** x 


x=0 rel 


= mk mh. me = mk exp { m(e* -1)}- k 
Example 7-36. IfX and Y are independent Poisson variates with means 
m, and m2 respectively, prove that the probability thatX — Y has the value ‘r’ 


ts the co-efficient of f in 
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exp { mit + mt —m —- m2 | 
[Delhi Univ. B.Sc. (Stat. Hons.), 1991, ‘89] 


Solution. Since X and Y 4re independent Poisson variates with means 
m, and mp2 respectively, 


e™ ‘x 
P(X =x) =: ary ee = 0, 1,2;...0 
- 2 ey (1) 
and P(Y = y) = y! ;y = 0, 1,2,...% 
P(X-Y=)r) = a r+sNiY=s) = poe r+-s)P(Y= s) 
Sea $a / 
ems mi** m** em} 
- Ore: sy hones ae ...[From (1)] 
s=0 
mi **m 
e™ — m2 
Too ET ...(2) 
We have e mts mae el! x gmt” 
(mt) (mt) ** 
ca 21 (r+s)! 
1 \2 -1ly\s 
anal ) + Cnt ) 
sa mii** m3 
.. Co-efficient of f ine gmteme = »> = 
4 (rts)is! 


Hence from (2), we get 
re | 
P(X-~Yer)=e™-™ x Coefficient of in &™'*™! 


-1 
g i ’ = is at 
= Coefficient of ti in @ ™>™*™'*+™ 


which is the required result. 


? 


Example 7-37. Jf X ‘is a Poisson variate with mean m, show that 
X—-—m 


m 


is a variable with mean zero and variance unity. Find the M.G.F. for this 


variable and show that tt approaches ef as m — &.Also interpret the result. 
[Delhi- Univ. B. Sc. (Stat. Hons.), 1937] 


x—m 
Solution. Let Y = 
xX—m 1 
E = E — E(X —-m)=0 
1) | =” | = EW m) 
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2 
X-—m 1 1 
VYN=E fm eee 
MG. of Y = My(t) = - E[ aia 
oo tvm cE aa 
oo tm am 
es 
Vm \x 
en tvm | em > (me (me 
x=2Q 
Nim Vim \2 
emmy ar at os | 
= ot ™ oxy (me’™) = exp [—m—1V¥m + me” ] 
2 3 
t t t 
= exp| -m—-tV¥m+m|1 + Sat oa rT | 


1> 1 ’ 
= €xp >! tT ay rg + 
Now proceeding to limit asm —> ©, we get 
lim _ tf 
paar My (t) = e ..:{*) 
Interpretation. (*) is the m.g.f. of Standard Normal Variate [c.f. 
Remark to § 8-2-5]. Hence by uniqueness theorem of m.g.f.’s, standard Poisson 
variate tends to standard normal variate as m— ©. Hence Poisson distribution 
tends to Norma! distribution for large values of parameter m. 
Example 7:38. Deduce the first four moments about the mean of the 
Poisson distribution from those of the Binomial distribution. 
Solution. The first four central moments of the binomial distribution-are 
i = 0, Mean = np 
M2 = npq, M3 = npq(q — p) and .(*) 
4 = npg (1 — 6pq) + 3n’ pq’ 
Poisson distribution ts a limiting form of the pincaua distribution under the 
following conditions : 


(i) n>, (ii) p—0,ie, g—>1, and (iii) np = A,(ay), is finite. 
Using these conditions, we get fron (*) the moments of the Poisson distribu- 
tion as 
Hi = 0 
Mean = lim (np) = 

2 = lim (npq) = on (np) - lm(q)=A°1=A 

v3 = lim [mpq(q —-p)] =A4:1(1-O0) RA 

Hs = lim [npq(1 - 6pq) + 3(mp)'q’] 
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=[A°1(1 —-601)+ 307-1] 24+ 32? 
Example 7:39. If X is a Poisson variate with parameter m and Y is 
another discrete variable whose conditional distribution for a given X is given by 


PY =r|X=x)= p(i- py"; 0<p<i1,r=0,1,2,..,x 


then show that the unconditional distribution of Y is a Poisson distribution with 
parameter mp. 


(Delhi Univ. B.Sc. (Stat. Hons.), 1993, Shivaji U. B.Sc. Nov. 1992} 
Solution. We are given that 


P(X =x) = * 


e™ x 


;x = 0,1,2,. 


and rorerixe) «(2 (1-py ‘srs 
P(X =xNYeur) = P(X =x)P(Ye=ar|X =x) 
m rp" spy’ 


em 
*. P(Y =r) = The unconditional distribution of Y. 


x! 
- > |<. (p)e” a - 


x! 
xr 


i 
®, 
2 
M4) s 
By 
ya 
; . 
> 
3, 
pat 
- 
a 
= 
: 
| 


e™ (mpy mm" (1 - py” 
7 ) (x-r)! 


(m(1-p)} ] 


(x-r)! 


= (mp ¥ aii-e) se r = 0, 1, 2.,... 
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Hence Y is a Poisson variate with parameter mp. 


Example 7:40. If X.and Y are independent Poisson-variates, show.that 
the conditional distribution of X given X + Y, is binomial. 
(Madras Univ. B.Sc. Main 1992; Delhi Univ. B. Sc. (Maths Hons.), 1988] 
Solution. LetX end Y be independent Poisson variates with parameters 
1 and p respectively. Then X + Y is also a Poisson variate with parameter 
+ B- 
P[X«r|(X+Yen)] © P(X= rNX+Ye n) _ PX= rNYeon-r) 


P(X+ Yan) P(X+ Yen) 
_ P(X =r)P(Y¥=n-r) 
P&®+Yean) [since X and Y are indepdent] 


or ey ae 

eT! © (ee)! 
eth) (Deny? 
n! 


P[X=r|(X+Yen)] = 


~ nm! f/_k ) (_w_) 
ri(n—ryt-lX+ep] [XR 4p 


n r at eee Xr 
) (rp qd’ ", where p Kap an i-P 
Hence the conditional distribution of X¥ given X¥ + Y = n, is a binomial 
distribution with parametersm and p = A/(A + p). 


Example 7:41. Jf Xisa Poisson variate with parameter m and pr, ts the 
rth central moment, prove that 


m['Ci br-1 + “C2 Wr-2 +... +'Cr Ho] = brat. 
[Delhi Univ. B.Sc. (Stat. Hons.) 1990) 
Solution Since X ~P(m), its probability function is given by 
— 


p(x) = en: mt 


x! ,x = 0,1,2,...;m>0 


By definition, 
Meet @ E[X-EQ)Y*) = E[X-my*! 


: E(x ~ mY * p(x) 


xe 


@ wae | 0 
= E (x- mY (x -m) 
= 5 z(e- myer 5 (ley eo” mf 
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~ (y- m+ 1 -e" + m* 
“=, y — Mir, (x-1=y) 


‘=m: Ey - m+) ply) — mtr 
a 


sms (y-my + Cily — myo! + 'Co(y - my”? 
y=0 


+... + Cri (y — mm) + 1] p(y) —- mu, 
=m pe + Cr py-1 + "Copr-2 +... ¢ 'Cepe] — mp, 
= m{'Cy Met + 7C2 pr-2 +... + "Cr po |}. 
Example 7-42... If X has a Poisson distribution with parameter d, show 
that the distribution function of X is given:by 
1 a —f x ° 3 
F(x) = ris ns e't*dt; x = 0,1,2,... 


[Delhi Univ. M. Sc. (Stat) 1986) 
Solution. I[fX is a Poisson variate, then 


-h 42 
P(Xex)=* Kx =0,1,2... (*) 


Consider the anno gamma integral; 


1 s ; tec. <6 
I, = ae } é't dt; (x is a positive integer ) 


7 etl aime bE L ay 
* - =¥T 
—A5x 
aw 
: Spt et (") 
which is a reduction formula for /,. 
Repeated applications of (**) gives 
i,2 enn + et ie + + e*n +h 
, 7! (x-1)! °°" 611 
But = fe ‘d= |-e* [ = ¢* 
~~, We* wm 4 


2% Ree +he + + Pre aes 


21 nea ha 
= P(X¥=0) + P(X¥.©1)+...+ P(X=x) [From(*)] 
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= P(X sx) = F(x) 
where F (-) is the eae function of the r.v. X. 


=> raed fe am an leite 


(T(x + 1) = x!, since x is a positive integer.) 
Remark. This result is of great practical utility. It enables us to represent 
the cumulative Poisson probabilities (which are generally tedious to compute 
numerically) in tertns of incomplete gamma integral, the values of which are 
tabulated for different values of A by Karl Pearson in his Tables of — 
{-Functions. 
7-3-1060. Recurrence Formula for the Probabilities of Poisson Distribu- 
tion. (Fitting of Poisson Distribution). For a Poisson distribution with 
parameter A, we have 


--A 92 
p(x) = : K ;x = 0,1,2,...,2 
Bae 
and P(x# 1) Tap et ULB 


p(x +1 rs a 
p(x) bet iad ieee amas 
which is the required recurrence formula. 

This formula provides us a very convement method of graduating the given 
data by a Poisson distribution. The only probability we need to calculate is p(0) 
which is given by p(0) = e~*, where A is estimated from the given data. The 
other probabilities, viz., p (1), p (2).... can now be easily obtained as explained 
below: 


7P (x) ...(17°20) 


P(1) = [p(x +1)]s-0= Fray p (0), 


x«9 


p(2) = [p(x +1)jx1* frst p (1), 
rel 


p3) = [p( + 1)] 2° |— | p (2), 
xe2 


and so on. 

Example 7-43, After correcting 50 pages of the proof of a book, the 
proof reader finds that there are, on the average; 2 errors per 5 pages. How many 
pages would one expect to find with 0, 1, 2, 3 and 4 errors, in 1000 pages of the 
first print of the book ? (Given that e- 04 = 0:6703) 


[Nagpur Univ. M.A. (Eco.), 1989} 
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Solution. Let the random variable X denote the number of errors per page, 
Then the mean number of errors per page is given by : 


AX = 2/5 = 0-4 
Using Poisson probability law, probability of x errors per page is given by: 


e* x eo 7 
P(X =x) = p(x) = <* a SOS, 01,2. 


Expected number of pages with x errors per page in a book of 1000 pages 
are : 


eo 
1000 x P(X =x) = 1000 x <P 4Y 5 2 9,1,2,.. 


Using the recurrence formula ( 17-20 ), various probabilities can bai casily 


calculated as shown in the following table. 


No. of errors Probability _ Expected number 
per page (X) p(x) of pages 
1000 p(x) 
0 a" - _ = 06703 670-3 = 670 
J p(l) = ot p(0) = 0:26812 268-12 = 268 
2 p(2) = 7 p(1) = 0053624 53-624 = 54 
3 p(3) = van = 00071298 7128 * 7 
4 p(4) = =o Za 7 P(3) = 000071298 0-71298 = 1 


Example 7:44. Fita —— distribution to the following data which 
gives the number of doddens in a sample of clover seeds. 


No. of doddens: 0 I 2 3 4 5 6 7 8 
(x) 


Observed frequency: 56 156 132 92 37 22 4 0 1 
(/) 


Solution. 
1 986 
Mean = 7 >) & = app 7 1972 
Taking the mean of the given distribution as the mean of the Poisson 


distribution we want to fit, we get A = 1-972, 


Pe 
and sell ciation 


p(0)=¢* = zim 
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-, logiop (0) = —1:972 logioe = — 1-972 x 0-43429 
= — 0-856419 = T-143581 
p(0) = 0-1392 


Using the recurrence formula (17-20) the various probabilities, 
yiz.p(1),p(2 ),-.-, Can be easily calculated as shown in the following table : 


Expected frequency | 
N. p(x) 


0 0-13920 69-6000 
1 0-27455 137-2512 
2 0-27006 135-3296 
‘3 0-17793 88-9566 
4 0-10964 43-8556 
5 0-03459 17-2966 
6 001137 56846 
7 0-00320 16013 
8 0-00078 0-3942 


Since frequencies are always integers, therefore by converting them -to 
nearest integers, we get 


Observed frequency : 56 156 132 92 37 22 4 #90 1 


Expected frequency : 70 137 135 89 44 17 6 2 0 


Remark. In rounding the figures to the nearest integer it has to be kept in 
mind that the total of the observed and the expected frequencies should be same. 


EXERCISE 7 (b) 
1. (a) Derrive Poisson distribution as a limiting form. of a binomial 
distribution. (Madras Univ. B. E., Dec..1991] 


Hence find B; and B2 of the distribution. 

Give some examples of the occurrence of Poisson distrhution in different 
fields. 

(b) State and prove the reproductive property of the Poisson distribution. 
Show that the mean and variance of the Poisson distribution.are equal. 

Find the mode of the Poisson distribution with mean value 5. 

(c) Prove that under certain conditions to be stated by you, the number of 
telephone calls on 4 trunkline in a.given interval of time has a Poisson distribution. 

(Calcutta Univ. B.Sc. (Maths Hons.), 1989] 


(d). Show that for a Poisson.distribution, the coefficient of variation is the 
reciprocal of the standard deviation. 
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2. (a) Iftwo independent variables X; and X2 have Poisson distribution 
with means A, and Az respectively, then show that their sum % + X2. is q 
Poisson variate with mean Ay + Az. 

Does the difference of two independent Poisson variates follow a Poisson 
distribution ? Give reasons. [Sri Venketeswara Univ. B.Sc., 1991} 

(b) Prove that the sum of two independent Poisson variates is a Poisson 
variate. Is the result true for the difference also ? Give reasons. 


[Delhf Univ. B.Sc. (Stat. Hons.) 1989) 
(c) IfX1, X2,...,X_ are independent random variables following.the Pois- 
k 


son law with parameter mm, m,.... me respectively, show that 2 X; follows the 
F tol 
Poisson law-with parameter ©. mi; 
iol (Madras Univ. B. E., 1993} 
3. (a) Prove the recurrence relation between the moments of Poisson 
distribution 


du, z 
Ure 2A [rv + rat where wu, = Fe 


(j-aAy 


where p, is the rth moment about the mean A. foe ‘site the skewness and 
kurtosis of Poisson distribution. 


(Delhi Univ. B. Sc. Gtat. Hons.) 1989,’ 86; Utkal Univ. B. Sc. 1993] 

(b) Let X have a Poisson distribution with parameter X > 0. If r is a 
non-negative integer and if p,’ = E (X" ), prove that 

Mret = A [u: + cer 

(Madras Univ. B. Sc. Nov. 1988] 

4. What do you understand by (c) cumulants, (#) cumulative function. 

Obtain the cumulative function of a Poisson distribution with parameter A. Hence 


or otherwise show that for a Poisson distribution with parameter A, all the 
cumulants are A. 


5. For the Poisson distribution with parameter A, show that the 7th 
factorial moment 1) is given by 1(r) = MA 


Show further that py = A, (3) = — 2A and way = 3A(A + 2) 
(a) If X and Y are independent r.v. s.’ so that ¥ ~P(A) and 
X+V-~P(A + wp), find the distribution of Y. [Ans. Y ~P()] 


(6) IfX ~P(A), find 


(i) Karl Pearson’s coefficient of skewness 
(uz) Moment measure of skewness. 
Is Poisson distribution positively skewed or negatively skewed ? 
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7. (a) It is known that the probability that an item produced by a certain 
machine will be defective is 0-01. By applying Poisson’s approximation, show 
that the probability that random sample of 100 items selected at random from 
the total output will contain no more than one defective item is 2/e. 


(b) The probability of success in a trail is known to be 10~. It is possible 
to repeat the trial independently any desired number of times. Do you think that 
the number of successes in a series of trials, if the number of trials in the series 
increases indefinitely, will tend to follow a Poisson distribution ? Give your 
reasons. 


(c) The probability of getting no misprint in a page of a book is e*. What 
is the probability that a page contains more than 2 misprints ? [State the 
assumptions you make in solving this problem.] [Bombay Univ. B.Sc., 1989] 


8. In a certain factory turning out optical lenses, there is a small chance 
{/500 for any lens to be defective. The lenses are supplied in a packet of 10. Use 
Poisson distribution to calculate the: approximate number of packets containing 
no defective, one defective, two defective and three detective lenses in a 
consignment of 20,000 packets. 


Ans. 19604, 392, 4 and 0 packets. 


9, Red blood cell deficiency may be determined by ex»mining a specimen of 
the blood under a microscope. Suppose a certain small fixed volume contains on 
the average 20 red cells for normal persons. Using Poisson distribution, obtain 
the probability that a specimen from a normal person will contain less than 15 
red cells. 


14 
Ans. X { e-29 (20)*/x !} 
ts 


10. Assuming that the chance of a traffic accident in a day in a street of 
Delhi is 0-001, on how many days out of a trial of 1,000 days can we expect : 


(i) no accident 

(ii) more than three accidents, if there are 1,000 such streets in the whole 
city ? 

11. Patients arrive randomly and independently at a doctor’s surgery from 
8-0 A.M. at an average rate of one in five minutes.The waiting room holds 2 


persons. What is the probability that the room will be full when the doctor 
arrives at 9-0 A.M. (Estimate the probability to an accuracy of 5 per cent.) 


Ans. 53-84 % 


12. An office switchboard receives telephone calls at,the rate of 3 calls per 
minute on an average. What is the probability of receiving (4) no calls in a one- 
minute interval, (i) at the most 3 calls in a 5-minuté interval ? 


Ans. (i) 0-0323, (ii) 0 


13. A hospital switchboard receives an average of 4 emergency calls in a 10- 
minute interval. What is the probability that (/) there are at the most 2 
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emergency calls in a 10-minute interval, (ii) there are exactly 3 emergency calls 
in a 10-minute interval ? 


Ans. (i) 1374, (ii) (32/3) e° 


14. (ay A distributor of bean seeds determines from extensive tests that 
5% of latge batch of seeds will not germinate. He sells the seeds in packets of 200 
and guarantees 90% germination. Determine the pene that a particular packet 
will violate the gpirence: 


Ans. 1 — = (eo !° 10'/r!) 


(6) Inan siceatt telephone exchange the probability that any one call is 
wrongly connected is 0-001. What is the minimum number of independent calls 
required to ensure a probability of 0-90, that at least one call is wrongly connected? 

15. (2) Fit a Poisson distribution to the following data with respect to 
the number of red blood corpuscles (x) per cell : 

xX: 0 1 2 3 4 5 
Number of cells f : 142 156 69 27 5 1 


(b) Data was collected overa period of 10 years, showing number of deaths 
from horse kicks in each of the 20 army corps. From the 200 corps-years, the 
distribution of deaths was as follows : 


No. of deaths : 0 1 2 3 4 
Frequency : 122 60 15 2 1 
Graduate the data by Poisson distribution and calculate the theoretical 
frequencies. 
Given e™: 0:6703 0-6065 05488 0-4966 


m: 0-4 0:5 0-6 0-7 
(c) Fit a Poisson distribution to the following data and calculate the expected 
frequencies :- 
x: 0 1 2 3 4 5 6 7 8 
f: 71102 «112, 17s S—éD? 11 3 1 1 
16. (a) IfX is the number of occurrences of the Poisson variate with mean 
X;show that: P(X 2n)-P(X2n+1)= P(X = n) 
(b) Suppose that X has a Poisson distribution. If 
P(X = 2) =i P(X= 1). 
Evaluate (i) P(X = 0) and (i) P(X = 3) [Ans. (i 0-264] 
(c) IfX hasa Poisson distribution such that 
P(X = 1) = P(X = 2), find P(X = 4). [Ans 0-09 ] 
(c) Ifa Poisson variate XY is such that 
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P(X =1)=#2 P(X =2), 
find P (X = 0), mean and the variance. 
(d) If fora Poisson variate X, E(X*) = 6, what is E (X) ? 
(e) If X and Y are independent Poisson variates having means 1 and 3 
sespectively, find the variance of 3X + Y. 


17. Show that for a Poisson distribution 
@ Moynn=1, (i) BY? (B2-3)mi'o =1 
18. Show that the function which generates the central moments. of the 
Poisson distribution with parameter A is 
M(t) = exp| A(eé -1- | 
Show that it satisfies the equation 


dM(t) dM(t) 
ae AtM(rt) +A qh 
19.(a) |The random variable X has p.d.f. 
-¢ OF 
f(x) =e riod 0, 1, 23... 


= Q, elsewhere 
Find the m.g.f. of Y = 2¥ — 1 and Var (Y). 
(b) Identify the distribution with the following mgf’s : 
Mx(t) = (0:3 + 0-7 e)* 
My(t) = exp [3(e - 1)] 
Ans. X ~B (10,0-7), Y ~P(3). 
20. IfX has Poisson distribution with parameter A, then 
1 ~22 
P [X ts even ] 5 [1+ 


‘(Delhi Univ. B. Sc. (Stat. Hons.) 1991] 
21. (a) Themg.f. of arv. isX is exp 4 (e —1) ] . Show that 
P(wp—20<X <p + 20) = 0-931 
Hint. X~P(A = 4); 
Required Probability.= P(0 < X <8) = P(1<X s7) = 0-931 
(b) If X ~P(A = 100), use Chebychev’s inequality to determine a 
lower bound for P (75 < X < 125)  [Ans, 0°84] 

22. IfX ~P(m), showthatE |X —-1| =m-—1+2e” 

[Delhi Univ. B. Sc. (Maths. Hons.), 1983] 


Hint. E|X-1J= 2 |[x-1le"m'/xt=e™+ BLD). ley? 
x=0 xu2 *:> 


a et ee ee 
= ae fesse x! 


1.68 Fundamentals of Mathematical Statistics 


23. If X ~P(A) and Y[X = x ~ (B (x, p), then prove that 
Y ~P(Ap). 

24. If the chances of 0, 1, 2, 3... events from one source are given by a 
Poisson distribution of mean m, and the chances of 0, 1, 2, 3,... events from another 
source by a Poisson distribution of mean m2, show that the chances of 0, 1, 2, 3.,... 
events from either source are given by 
(my +m) 

a, al jee 
Show that the sum of any finite number of Poisson variates is itself a Poisson variate 
with mean equal to the sum of separate means. 


25. X isa Poisson variate with mean A. 
Show that E (X°) = XE(X = 1). 


If A = 1, show that E |X - 1] = = 


eo (mtm) {1 (m, + m2), 


26. Show that the mean deviation about mean for Poisson distribution 
— mm x 


em 
x! x = 0,1, 2.;;. 


p(x) = 

e” - m* 

is (2p). at 
where wz is the greatest integer contained in( m + 1). 


[Delhi Univ. B. Sc. (Stat. Hons.), 1988,’ 84] 


27. LetX, Y be independent Poisson variates. The variance of X + Y is9 
and 


P(X =3|X + Y =6) = 5/54 
Find the mean of X. [Ams. > (9 + 3 V3) ie. 1902 or 7-098 ] 


28. IfX is a Poisson variate with paramter m, show that 
r 
P(X <r)< ote r= 0,1,2,... 


Deduce that E (X) < e”. [Delhi Univ. B.Sc. (Maths. Hons.), 1989] 
29. (a) The characteristic function of a variate X is 


6 
1 2 it) it . 
py (t) E + 3° [exp {— 3(1-e )}]. 
Recognise the variate. ~ 

[Burdwan Univ. B. Sc. (Maths. Hons.) 1989) 


Hint. X = U + V, where U ~B (63) and V ~P(3) are indepen- 
dent r.v.’s 


(b) Identify the variates X¥ and Y where : 
Mx(t) = (1/27) (1 42e) - exp|3(e! a 1)| 
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My(t) = (1/32) (1+ e&)- exp | -2 (1 - e)| 


[Delhi Univ. B. Sc. (Stat. Hons.), 1987, 84] 
Ans. X =U+V;U-B (n=3,p=2/3) and V -P(=3 ) are independent. 
Y= U,+V1;U, ~B(n& 5,p= 1/2) andV,; ~P( X= 2) are independent. 
30. If X and Y are correlated variates each having Poissen distribution, 
show that X + Y cannot be a Poisson variate 
[Delhi Univ. B. Sc. (Maths Hons.), 1988; Poona Univ. B.Sc., 1989] 
Hint. Note that for Poisson variate mean and variance are equal. Let 
X¥~P(A), Y ~P( pe); ‘CX, Y) correlated. 
ow E(X + Y) = E(X)+E(Y) sX +p 
Var (X + Y) = VarX + VarY + 2Cov( X,Y) 
=A+n+2pvau,(p #0) 
Since E(X + Y) » Var (X + Y); X + Y cannot be a Poisson variate. 


31. Let X,Y,Z be independent Poisson variates with parameters 
a,b and c respectively. Obtain : 


(i) mg.f.of X + 2Y + 3Z, 
(ti) Conditional expectation of X given X + Y+ Zen 
| . (Indian Civil Services, 1985) 
Hint. My . 2y + 3z(t) = My(t) > My (2) > Mz(3¢) 
v= exp|a(e'— 1)+ b(e*- 1)+ c(e"- 1) ] 
P(Xex|X+ Y+Zen)«= P(X= xNX+ V+ Zan) Fixe, Zap id 


Pi Xox) P (V+ Zeon-x (-.-X, Y, Z are indep.) 


P(X+V+Z=n) 
e°-a@ ? et). (b+ cl 
x'! (n—x)! 


ms oo ; 


2 mt fia) i f_b+e 
x!(n-x)! lat+be+ec a+b+e 
=> X |(X¥+Y+Z=n) ~B{n, p = a/atb+c)| 
We na 
> EA Se SS oe arc 
32. The joint density of r.v.’s X and Y is: 
f(xy) = e? / (x! Cy —x)!];5 y = 0,1,2,...; x = 0,1,2,...,y. 


Find the m.g.f. M(t,.t2): of (X,Y). and correlation coefficient between 
X and Y. Show that the marginal distributions of X and Y are Poisson. 
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co y a 
; ie Axthy e 
Hint. M (11, 2) nea x Free 


rs) hy y 
ee? x |—) E'C,-(et¥ 
y=0| Y> | x=0 


ever z { é(1+e)] / ys} 


y=0 
=? - exp {e? (1 + e)} 
M(n,0)= exp 2(e"- 1) => X ~P (A= 1) 
M(0,t2) = exp 2(e*-1)] = Y ~P(p = 2) 


Observe M (f1, f2) # M(t,0) x M(0,t) => X and Y are not inde. 
pendent. 


E(X) =1, Var(X)=#21; E(Y) =2 = VarY. 
a” M (ti, t2) 


. ae os 3 
E(XY) Ot, 0i2 \ 
Gov(X%¥) | 3-1 x2 
0(X,Y) ee 1x v2. 1/v2. 


33. The joint p.g.f. of the r.v.’s X and Y is given by: 
P (51, s2) = exp [aes —-1)+ b(s2-1)+c(5; ~1) (9 - 1), 
a,b,c, are all positive. Find p (X, ¥ ) 
Hint. Py(s1) = P(s1,1) = exp [a(s; —1)] > X ~P(a) 
Py (s2) = P(1,s2) = exp [b(s2 —-1)] = Y ~P(b) 


a P(s1,52) | 
e(x¥) =( dd } =c + ab. 


" Cov(X,Y) _ (c+ ab)- abc 
Pry Oy Oy Va Vb Vab 


34. An insurance company issues only two types of policy, household and 
motor. It has carried out an investigation into the experience of a gtoug of 
policyholders who held one of each type of policy over a particular period and il 
has discovered that within that group and over that period the mcan number of 
claims per household policy was 0-3 and the mean number of claims per motor 
poticy was 0-8. Assume that the number of claims under each type of policy is 
independent of the number of claims under the other type of policy and that each 
can be represented by a Poisson distribution. 

(a) If the number of claims per policyholder is the sum of the number of 
claims undef each of his two policies, state with reasons how the number of claims 
per policyholder, within that group and over that period is distributed, and 
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(b) Calculate to the nearest whole number, the percentage of policyholders 
within that group and over that period who made more household claims than motor 
claims. 


Hint. Household claim, X¥ ~P(-3) and Motor claim, Y ~P(8) 


Required Probability= P(X > Y) = = | ZP(YerNXerts) 
r@(0/ s=0 


“5S [Pvene(teres)] - as = 


7=0 380 720 320 


eves Sgt 5 fat 


r=a0 


aaa 


Nate ee 


‘ 3 
manete |15+S y (1+ 3)+ [1+ 3+) 


(-8)* | 09 , 027 
ar 1+ 3 + 3 + —— 31 + ... 

35. (i) Amnevent occurs instantaneously and is equally likely to occur at 
any instant. There is no limit on the number of occurmeces that may happen in any 
interval of time, but the expected number in a given time interval is 7. Prove that 
the probability of the event occurring exactly 7 times in an interval of the same 
duration is (T” e")/r'!. 

(i) An insurance company which writes only fire and accident business 
defines a major claim as one which costs at least Rs. 50,000 for an accident claim 
or Rs. 100,000 fora fire claim. Any excess over these amounts is paid by reinsurers 
and hence every major claim is recorded at a cost of Rs. 50,000 or Rs. 100,000 
respectively. The company divides the year into equal monthly accounting periods 
and a report is produced of the recorded cost of major claims. The expected number 
of major accident claims is 0-2 per month-and of major fire claims 0:5 per month. 
Calculate the probability that in a particular month the recorded cost of major 
claims is Rs. 2,00,000 or more. 


36. (a) The number of aeroplanes arriving at an airport in a 30 minute 
interval obeys the Poisson law with mean 25. Use Chebychev’s inequality to find 
the least chance, that the number of planes to arrive within a given 30 minutes 
interval will be between 15 and 35. [Sri Venketeswara U. B.Sc. 1992] 


(6) Suppose that the number of motor cars arriving in a certain parking lot 
in any 15 minutes period obeys a Poisson probability law with mean 80. Use 
Chebychev’s inequality to determine a Jower bound for the probability that the 
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number of motor cars arriving ina given 15 minute period will be between 60 ang 
100. (Madras U. B.Sc. Nov. 1993) 

7-4. Negative Binomial Distribution. The equality of the mean and 
varaince is an important characteristic of the Poisson distribution, whereas for the 
binomial distribution the mean is always greater than the variance. Occasionally 
however, observable phenomena give rise to empirical discrete distnbutions which 
show a variance larger than the mean. Some of the commonest examples of such 
behaviour are the frequency distributions of plant density obtained by quadrant 
sampling when the clustering of plants makes the simple Poisson model inap- 
plicable. It has been shown by different investigators that insuch Cases the negative 
binomial distribution provides an excellent model because this distribution has a 
variance larger than the méan. Bacterial clustering (or contagion), e.g., deaths of 
insects, number of insect bites leads to the negative binomial distribution and the 
distribution also arises in inverse sampling from a binomial population or as a 
weighted average of Poisson distribution. This important probability distribution 
is Sometimes also referred to as the Pascal distribution after the French mathe. 
matician Blaise Pascal (1623-1662), but there seems to be no historical justifica- 
tion. The negative binomial] distribution can be derived from empirical 
considerations in many ways. Here we consider the Binomia] probability situation 
with some modifications. 

Suppose we have a succession of n Bernou!li trails. We assume that (é) the 
trials are independent, (i) the probability of success ‘p’ ina trial remains constant 
from trial to trial. 

Let f(x; r, p ) denote the probability that there are x failures preceding the 
rth success inx + r trials. 

Now, the last trial must be a success, whose probability i is p. In the i femaining 
(x +-r — 1) trials we must have (r — 1) successes whose probability is given 
by 

x+r-—1 po 
r—i1 q¢ 
Therefore by compound probability theorem, f(x; 7,7) is given by the product 
of these. two probabilities, i.e., 
—_ — rT— 
| rte baer 1 ere 1 rd 

Definition. A random variable X ts said to follow a negative, binomial 

distribution if its probability mass function is given by 


Ul ted r 
p(x) = P(X =x)= [" er ‘| pd ;x = 0,1, 2... 


; ‘= 0, otherwise " 1(721) 


t 


Also 


r t 


(an Gan ee UO) 
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o (etr-i(etr~-2)..(r+ ir 
| xf! 
~ (1 (rr) (-r-1)... (rx 4+2) (7-441) 
x! 


—f 
-(-1F (5 
-—f 
Pp (-q) 3x = 0, 1, es 
p(x) = (: 


0, otherwise ( 7214) 


which is the (x + 1 )* term in the expansion of p’ ( 1 -q)', a binomial 
expansion with a negative index. Hence the distribution is known as negative. 
binomial distribution. Also 


r =F r -? 
2 Ps) PY (y} a px(1-q)y =1 

Therefore p(x) represents the probability function and the discrete vari- 
able which follows this probability function is called the negative binomial variate. 


If p= Zand q = 5 so that Q-P=1, (-.-p+qz=1) 
SF\ cof PY, . 
then p(x) = (‘y} 2 (- 3] pr manna 
0, otherwise o( 7°21 5) 


This is the general term in the negative binomial expansion(Q — P)’. 

Remarks. 1. p(x) in(7-21) or (7:21a) is also'sometimes written as 
f(x5np). , 

2. Some Important Deductions. 

(a2) Geometric Distribution. If we taker = 1 in (7-21), we have 

p(x) = dp; x = 0,1,2... 
which is the probability function of geometric distribution (cf. § 7-5 page 7°83). 

Hence negative binomial distribution may be regarded as the generalisation 
of geometric distribution. 

(b) Pascal’s Distribution. The negative binomial distribution (7-21 a) 
when regarded as one having two parameteis p and r is known as Pascal’s 
distribution. 

(c) Polya’s Distribution. If we take 

r -4°P = a qexi-p eT in (7°21 a) , we get 
r(r+1)(r+2)...(r+x-1) ph mg 


p(x) = = 
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_ (1+ B)(1+ 2B)...[14 al 1 \ “ } 
x! 1+ Bu 1+ By 
(x =0,1,2,...) ...(7-21¢) 
which is known as Polya’s distribution with two parameters, B and uw. 


(qd) Second Form of Gewmetric Distribution. Taking 6 = 1 


in 
Polya’s distribution (7-21 c), we get 
x 
_ {_} U eee 
_ oe en (3 culo toe = Aa 
which is geometric distribution (c.f. § 7-5) with 
ee eo eee eee ee 


. 7-41. Moment Generating Function of Negative Binomial Distribu. 
tion. 


Mx (i) = E(é*) = z e p(x) 


Ate 


=(Q-Pé)y’ (7-22) 
' a se 
wi’ = (4 my) on [Er PEO =P], g 
= rP 
. Mean of the negative birumial distribution is rP. 
, (@& 
"2 = (< M( t ) (7°22 a) 


=Q ; 
as (rPd(Q- Péy'-'s (- r- 1)rPé(Q -Pely'"*(—Pe')) 
=rP+r(rt 1) P? 


wo = pe — pte (r+ 1) P+ Prep? = PQ ..(7-226) 
AsQ > 1, rP < rPQ,i.e., Mean < Variance, which is a distinguishing 
feature of this distribution. 


7-42. Cumulants of Negative Binomial Distribution. 
Kx(t) = log My(t) = — r log (Q.- Pe’) 


. ft 2 € 
~-rglo-Pliscereae te ]| 
2 4 
ee ee 
w-riog[1-P(te estat || 


(-- Q-P=1) 
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Proceeding as in § 7-2-8, we will get (on replacing n with —r and p with 


-P): 


Mean= Ki = rP 
M2 =K2=rP(1+P)= rPQ | wo 7°23): 
ponte (1432 + 2°?) «1B (14P)(1 92°) =1F0(0+P) 
Ka = 7P(1+ P)(1 + 6P + 6P") = rPQ(1+ 6PQ) 
spa = ka + 33 = rPQ{1+ 3PQ(r+2)] 
Since Q = 1/p, P = qQ = @/p, we have in terms of p and q, 
Mean = rq/p, Variance = rq/p” M3 = rq(1 + q)/p° 
a = rq [p’ + 3q(r + 2) |/p* 
5, - 8, Uta 


B3 rq 
beets * + 3q(r +2) (7-23 a) 


2 rq 
VB = (1 + q)/Vrq 
= Br -3 = (p’ + 6q)/rq 
7:43. Poisson Distribution as a Limiting Case of the Negative Binomiai 
Distribution. Negative binomial distribution tends to Poisson distribution as 
p 0, r ~ © such thatrP = A‘(finite). Proceeding to the limits, we get 


lim p(x) = lim ween, pd 


| 


yl 


r 
x 


lim (* ae ‘| om (5) 


_ lim commen | Sammars ie | P 


ra © x! +P 
_ lim Fi leer | ara, 
r—>o)}y! r r 
1 x 
: 1 ay P , 
" ; (t+ -}: Lo (1+ PY)’ (3 PF) | 
1 lim nvr rP 
3 tm fase a] 
— lim (1 al lim (1 +) ee 
x! ro r r—>@ r 
x -h pew 
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which is the probability function of the Poisson distribution with parameter ‘)’, 
7-44. Probability Generating Function of Negative Binomial Distriby. 
tion. LetX bea random variable following negative binomial distribution, then 


Pr(s) = E(s*) = Ee p(s) 


@ a ; 
~ J ( . \p (- qs)" 
‘220 [Using 7-214) | 
=p (1-—qs)’ = [p/1-a4s)] »-(7-24) 
Example 7:45. An item ts produced in large numbers. The machine is 
known to produce 5% defectives. A quality control inspector is examining the 
items by taking them at random. What is the probability that at least 4 items are to 
be examined in order to get 2 defettives ? 


Solution. _ If 2 defectives are to be obtained then it can happen in 2 or more 
trials. The probability of-success is 0-05 for every trial. It.is a negative binomial 
Situation and the required probability is 


= P(X =4)4P(X*5) +... 
- (321) (0-05)? (0-95 J°~2 


2 
= 1-—[(005)? + 2(005 )* (0-95 ) ] 
= 0-995 
Example 7-46. If X ~B(n,p) andy has negative binomial distribu- 
tion with parameters r and p, prove'that 
Fx(r-1)=1-Fy(n-r) 
[Dethi Univ. Sp!. Course (Statistics Hons.), 1987] 


3 
~1-5 (3 = 1] (0-05 )? (0-95 JF~? 
2 


Solution. 
1-Fy(n-—r)= a la aa ea n-r) 


: y (@sulae [z= y—-(n-r+1)] 


Ro-rel 


-ea (Ett) g 


z=0 


re 318) Lal} 
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(i) Ba ig 2) 


er ZN) 3 (tale 


zer-1~k 


Changing the order of summation and noting that © ne O;a<r 
[ g ng that | 


pq"! y (2) > (' : ie ate 


keo0 te 


tez—(r-1-k) 
r= 


“it 2 {(%) a -(1- aye} 


r-1 

n\ & oe 

Puan 
= P(X <(r-—1)) = Fx(r-1) 

Example 7:47. (Banach’s Match-box Problem). A certain mathe- 
matician always carries two match boxes (initially containing N match sticks). 
Each time he wants a match-stick he selects a box at random, inevitably a moment 
comes when he finds a box empty. Show that the probability that there are exactly 
r match-sticks in one box when the other box is found empty is 


=P 
2N -7r 1 
n ae le) 
Solution. Let the two match boxes be numbered 1 and 2. Let the choice 
of the Ist box be regarded as failure and that of second box be regarded as a success. 
Since the mathematician selects the match box at random, 
p = Probability of selecting second match box = 2 
= q = 1 _ Pp = 14 
The second box will be found empty if it is selected forthe (NV + 1 )sttime. 
At this stage, the first box will containexactly r matches if( N — r) matches have 
already been drawn from it. Hence the second box will be found empty at the stage 
when the first box contains exactly r matches if and only if (N — r) failures 
precede the (N+ 1.)st success. Thus in a total of N + 1+ (N—-7r) 
= 2 N —r + 1, trials the last one must be success and: out of the remaining 
(2N — r) trials we should have (N — r) failures and N successes. 


Probability that second box is found empty when there are exactly r 
matches in first box is 
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3N— + ay (ay 1- oN — ch ce 
("w) (2) (2) 2° Ew} (2) 
Similarly, the probability that first box is found empty, when the second box 
contains exactly r matches is given by 


ON - r\ 1 Were+l 
N }\2 
Hence the required probability that one match box is found empty when the 
‘other contains exactly r matches is 
se en i. NF 1)" 
: N 2 N 2 
Remark. The statement that ‘he finds the box empty’ implies that when 
he used the last match.:in this box, he did not throw it away, but instead put it back 
in his pocket. Thus there is a difference between ‘the box i is empty’ and ‘the box 
is found empty’. 


The box becomes empty when the Nth match was taken from it but it is found 
to be empty only when it is selected for the (N + 1 )st time. 


Example 7-48. X is a negative binomial variate with pf. 


k 1 
f(x) = ar e |e. ee 
- 0 


, otherwise 


Show that the moment recurrence formula is 
; dup rk 
Mre+l1 = q dq’ pi 
State how the moments of negatiye binomial variate can be written from the 
corresponding formulas for binomial variate. 
[Punjab Univ. B. Se. (Maths Hons.) 1990] 
Solution. For Negative Binomial Distribution with parameter x and p, 
Mean = k + g/p = y, (say). 


My = E (2 —p)' f(x) 


Differentiating w.r.to % = get 


ola) tale" 


Pr 


ee] 
“ales beaeoeai 
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dq p’ P 
kq) k+x-1\ x1 k kq 
+ x - x- 
r+1 

1 kq ; 

=m — — Meer + — x- - f(x 
--- Wend + Bret 
“> “q r+ 


=e, Mr+1 ™ q -- pe ;r = 1,2,3,... 
7-4-5. Deduction of Moments of Negative Binomial Distribution From 
Those of Binomial Distribution. If we write 
p=1/2, q = P/Q such that Q - P = 1, 
then the m.g.f. of negative binomial variate X is given by [c.f. § 7-4-1): 
Mx(t)=(Q-Pey" ..(*) 


This is analogous to the m.g.f. of binomial variate Y with‘parameters n and 
p’, vi2z., 


My(t) = (q' 4 p' ey; q’ = 1 - p' s(**) 
Comparing (*) and (**), we get 
q=Q,p =-Pandne«-—k ei?” *) 


Using the formulae for moments of binomial distribution, the moments of 
negative binomial distribution are given by 


Mean = np’ = (-k) (- P) = kP 
Variance = np’q' = (-k) (- P)Q = k PQ 
3 =np'q'(q'~ p’)= (-k)(-P)Q(Q+P)=kPQ(Q+P) 
ug snp d [1+ 3p'q'(n - 2)]) 
= (-k)(- P)Q[1 + 3(- P)QO(-k-2)] 
= kPQ[1+3 PQO(k +2)] 
Example: 7-49. Prove that the recurrence formula for negative binomial 
distributionis: f(x +1;47,p).= ~<— qf (x; 7, p) 


x+1 
(Utkal Univ. M.A., 1990) 


Solution. We have 
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f(x3rp) = (* en ‘|e a 


r-—1 
f(x +1377) = (ri )ee” 
f(xt+1srp) _ (x+r)!(r-1)!x! _x+r 


f(xsr,p) (r- Seen 1y14 “x41? 


=> 


= ‘a f(x3np) 


This recurrence relation is useful for fitting of the negative binomial distribu- 
tion to the given data as discussed in the following example. 
Example 7-50. Given the hypothetical distribution : 


No. of cells : 0 I 2 3 4 5 Total 
(x) 

Frequency : 213 128 37 18 3 1 400 
(f) 


Fit a negative binomial distribution and calculate the expected frequencies. 


a sheen BE ate rg 
Solution. ;' = Mean Sf 7 200 6825 = (4) 


S11 
2’ = 400 ~ 1-2775, 


po = we — pi? = 12775 — (-6825)° = 0-8117 
: rq 
Variance = 08117 = 
p ..(**) 


Solving equations (*) and (**), we get ‘ 


0-6825 
P* 08117 7 08408, gq = 1 — p = 0-1592 
~ Px 06825 _ 0-5738 ; 
r ; 0-1592 = 3-60454 


fo= p= & 8408 POS = 5352 


fi = r+ gfe rq fo = 0°5738 x 05352 = 0:3071 


f= ie ; ge fy = TOORS , 0.1592 x 0:3071 = 0-1126 

f=5 =. ofr = aes x 0-1592 x 0:1126 = 0:0335 

feo + ofa = core x 0:1592 x 0:0335 = 0-0088 
r r+ 4 7-60456 


fs = +1 ‘QC fs = —s~C 0-1592 x 00088 = 0-000213 
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.. Expected frequencies are : 


Nfo Nf Np Nfs Nfs Nfs 
2140992 122:8596 45-0308 13-3928 3-5204 0-8524 
. Observed Frequency: 213 128 37 18 3 1 
Expected Frequency: 214 123 45 13 4 1 
EXERCISE 7 (c) 


1. (a) Define negative binomial distribution.,Give an example in which 
jt occurs: Obtain its moment generating function. Hence or otherwise obtain its 
mean, variance and third central moment. {Gujarat Univ. B.Sc. 1992} 

(b) IfX denotes the number of failures preceding the rth success in an 
infinite series of independent trials with constant probability p of success for each 
trial, then identify the distribution of X and obtain E (X ). What is the distribution 
whenr = 1? _ [Delhi Univ. B. Sc. (Stat. Hons.), 1985] 

2. (a), A well known baseball player has a lifetime batting average of 
0:3. He. needs 32 more hits to make up his lifetime total to 3000. What is, the 
probability that 100 or fewer times at bat are required for him to achieve his goal? 

(b) Ascientist needs three diseased rabbits for an experiment. He has 20 
rabbits availlable and inoculates them one at a time with a serum, quitting if and 
when he gets 3 positive reactions. If the probability is0-25 that a rabbit can contract 
the disease from the serum, what is the probability that the scientist is able to get 
3 diseased rabbits from 20 ? 

3. Astudent has taken a 5 answer multiple choice examination orally. He 
continues to answer questions until he gets five correct answers. What is the 
probability that he gets them on or before the twenty-fifth question if he guesses at 
each answer ?° 

4. Ifa boy is throwing stones at a target, what is the probability that his 
10th throw is his Sth hit, if the probability of hitting the target at any trial is 0:5 ? 

5. Inaseries of independent trials with constant probability p of success 
in each tnal, show that the number of successes ina fixed numbern of independent 
trials follows a binomial distribution. Show further that the number of the trials 
required for a specified number r of successes follows a negative binomial 
distribution. Obtain the mean and the variance of this distribution. 

6. (a) Obtain the Poisson-distribution as a limiting case ofthe negative 
binomial distribution. {(Delhi Univ. B.Sc. (Stat Hons.) 1988] 

(b}) Show how the moments of negative binomial variate can be written 
from the corresponding formulae for the bibomiaf variate. 

, {Delhi Univ. B.Sc. (Maths Hons.), 1991] 

7. Consider a sequence of Bernoulli trials with constant probability p of 
Success ina single trial. Find P (x, k), the probability that exactly x + k trials are 
required to get kK successes, x-= 0, 1, 2,.... Show that P ( x. -k) defines the prob- 
ability function of the discrete random variable A. Find the moment generating 
function of X. Hence find EX) and V(X). 
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8. (a) Derive moment generating function of negative binomial distriby. 
tion and hence show that mean < variance 


(6b) Derive — binomial distribution in the a form : 
-k— oF . = 0, 1, 25% 
f(x) = (~*\(-P¥@ Sea p 


Obtain (i) moment ee function, mean <b variance of this distriby. 
tion. 


(ii) Coefficient of skewness 61. 
(iii) Give-an example of its occurrence. [Gujarat Univ. B.Sc. Oct. 1999) 


9. Obtain the characteristic funciion of the negative binomial distribution 
given in the form: 


1+a l+a 
and hence Poe its first two moments. 


10. (a) Show that for the negative binomial distribution (Q -—~ Py" 
where Q-P=1, cumulant genetating function K(t) = 


x x 
f(x; a, A) = ate af - x = 0,1, 2... 


~rilog[1—P (e —1) 1}. Hence deduce thatx, =r P, x2 =r PO Also obtain «3 and 
K4. 

: [Delhi Univ. B.Sc. (Stat. Hons.) 1986] 

(6) Show that the mean deviation about mean for the negative binomial 
distribution is 
Rte) wel -(n+ yu) 
2(n + 1) naenie 64 

where pt is the greatest integer contained inp + [. 


11. The number of accidents among 414 machine operators was inves- 
tigated for three successive months. The following table gives the distribution of 
the operators according to the number k, of accidents which happened to the same 
operator. Fit the distribution of the type 


P(Xzk)=(- (y led k=0Q, 1, 2,..,v>0,g21—-p,0<p<1l1 


0 1 2 3 4 5 6 7 8 
‘Observed frequency ... 296 74 26 8 4 4 1 0 1 
12. If X has negative binomial distribution with parameters (7, P), 
prove that My(t)=(Q—-Pé)". Hence find m.g.f. of 
Z = (X — nP)/VnPO and deduce that Z is asymptotically nonnal asn —> © 
Hint. Prove thatMz(1t) — exp (°/2) as n — © [c.f. Example 7°15]. 
13. LetY have the negative binomial distribution: Let X; be the number of 


; 

failures between the (j ~ 1 )th and jth success. Then & Xj = Y. Find E(Y), 
jel 

oy by obtaining the means and variances of the X;’s. 
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14. Assume that the riutually'independent random variables X;, each have 
the negative binomial distribution with parameters'r;( i = 1, 2,..., 2), where 7; 
are all positive integers, i.e., 


P(X; =x) = ws ‘|e q°; x = 0,1,2,... 


Then show that the probability density function of = X; is the negative binomial 


i=} 


a 
distribution with r = 7; i.e,, the negative binomial distribution (with fixed p) 

je 
is reproductive with respect to r. (Sagar Univ. M.A., 1991) 
15. Suppose that a radio tube is inserted into a socket and tested. Assume 
that the probability that it tests positive equals P and the probability that it tests 
negative is (1 — P). Assume furthermore that we are testing large supply of such 
tubes. The testing continues until the first positive tube appears. IfX is the number 
of tests required to terminate the experiment, what is the probability distribution 
of X ? ((Aligarh U. B.Sc. (Hons.) 1993)] 
16. A man buys two boxes of matches, each containing N matches 
initially and places one match box in his right pocket and one in his left pocket. 
Every time when he wants a match, he selects a pocket at random. Show that the 
probability that at the moment when the first box is emptied (not found empty), the 

other box contain exactly 7 matches (r = 1, 2,...,N) is 
wN-r-1 


2N-1-r)\/1 
r-1 2 

Using this result, show that the probability that the box first emptied is not 

the one first found to be empty is 
2N OU 
| a {2N-1l-r 
(2) 2 (Pw-1’): 
: rel 

: 2N 
which reduces to ( N 9 

7-5. Geometric Distribution. Suppose we have aseries of independent 
trials or repetitions and on each repetition or trial the probability of success ‘p’ 
remains the same. Then the probability that theré are x failures preceding the first 
success is given by ¢* p. 

Definition. A random variable X is said to have a geometric distribution 
if it assumes only non-negative values and its probability mass function ts given by 

dp;x=0,1,2,.,0<ps1 
P(X =x) = 
0, otherwise -+( 7°25) 

Remarks. 1. Since the various probabilities forx = 0,1, 2,..., are the 

various terms of geometric progression, hence the name geometric distribution. 


11 2N + 1 1 iz 
: ors (Nx) approximately. 
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2. Clearly, assignment of probabilities in (7-25) is permissible, since 


EP(X=x)=ldpep(i+qtrtYet palletes ed 
x=0 x=0 ~q 

7-5-1. LackofMemory. The geometric distribution is said to lack 
memory in a certain sense. Suppose an event E can occur at one of the times 
t = 0,1, 2,... and the occurrence (waiting) time X has a geometric distnibution. 
Thus P(X =t)= d-p;t=0,1,2,... 

Suppose we know that the event E has not occurred before k, t.e., X 2 k, 
Let Y = X — k. ThusY is the amount of additional time needed for E: to occur, 
We can show that 

P(Y=t|\X2k) = P(X =) = pd (7°26) 

which implies that the additional time to wait has the:'same distribution as initia] 
time to wait. 

Since the distribution does not depend upon k, it, in a sense, ‘lacks 
memory’ of how much we shifted the time ongin. If ‘B’ were waiting for the event 


E and is relieved by ‘C’ immediately before time k, then the waiting time 
distribution of ‘C’ is the same as that of ‘B’. 


Proof of (7:26). We have 


P(X2zr)= pa pf = pd + q*'+...) = GPL = 


P(Y2tnXzk)_ P(X-k2ztNXek) 
P(Xzk) P(X2k) 


(« Y=X-—k) 


P(Y2rt|X2k)= 


_P(X2zk+1) ee heer 
P(X 2k) qd 
« P(Y=t|X2k)=P(Yet|X2k)-P(Yet+1| X2zk) 
=d-q*'=4(1-q4)=pd = P(X=¢) 
7-5-2. Moments of Geometric Distribution. 


Wy! = ci cos 2 x P= Pq E xq! =pq(1-qy =F 


V(X)=E(X’)-[E(X) PoE [X(X-1)] +E (X)-[E(X)P 


E[(X -1)X] = Ex(x-1)P(Xex) = Bx (2 -1)p¢ 
_xel x=2 


= 2 
= 2p > ert |: 2pq’ (1-q)° = = 


< 
TY 
be 
wea” 
i" 
= 
[ S) 
il 
mm [& 
IS 
+ 
S [OQ 
l 
q 
i 
ae 
+ 
| 
[2 
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7-5-3. Moment Generating Pinctive of Geometric Distribution. 
Mr(t) = E(€%) = Ed pep ¥(eq¥=p(1-aey 

= p/(1 — ge’) (7-27) 
lok) ed 


to 


= pq (1 - ge)? Jeno = pq(1-qy? = 4 


- P 
ya’ = F mio]. 24,24 
. e (On simplification) 


ait ta 4 A _ FF *PI LD 
i 2 i ns ae 
Hence the mean and vanance of the geometric distribution are q/p and 
q/P respectively. 
Remark. The p.g.f. of the geometric distribution is obtained on replacing 
e' bys in (7-27) and is given by : 
Px(s) = p/(1 - qs) (7-27 @) 
Example 7-51. Let the two independent random variables X, and X2 
have the same geometric distribution. Show that the conditional distribution of 
Xi| (X11 + X2 = n+) is uniform. 
[Gujarat Univ. B.Sc. 1992; Calicut U. B.Sc. (Main Stat), Oct. 1990] 
Solution. We are given z 
P(X =k) = P(X) =k) = p¢'sk = 0,1, 2... 
P[X, = r|[(X% + X2 = n)] = ay Te eS =e - ") 
P(X = rt X.=n-r) 
7 P(X + X2 =n) if 
. P(X = 7 NX =n—-7) 


| 


5 [P (mM ahmed) 
OPO er) ent) . 
5 [P(X = s)-P(X, =e n-'s)] 
- [Since X; and X2 are independent) 

P[Mie r|(Xi+Xown)]=- PL PL pg 
= [pg pd") 5 (p’q") 
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% 1 
" (n+el)pg nei’ r= G,1,2,..0 
Hence the conditional distribution of 1 |(X1 + X2 = n) is discrete uniform, 
(cf. § 7:8). ) 

Example 7-52. Suppose X is a non-negative integral valued random 
variable. Show that the distribution of X is geometric if it ‘lacks memory’, ie., if 
for eachk = 0 and Y = X — k one has 

P(Y =t|X2k)= P(X =t), fort 20 
[Madras Univ. BSc. (Main), 1988) 
Souution. Let us suppose 
P(X =r) =p,;r = 0,1,2,... 


Define 
a = P(X 2k) = pet presi t ... -+(*) 
We are given 
PCY =1|X2k)=P(X=t) =p, --.(**) 
We have 


P(Y=#tfN\ X2k) s P(X~katNX2k) 
P(X2k) P(X2k) 
PX ak +t) _ Phat 
P(X 2 k) qk 
: es [Using (**) 
foreveryt 2 0 and all k 2 O. In particular, taking k = 1, we get 
Pest = Qiepr=( pit prt...) pr=(1—po) pr (From (*)| 


P(Y=t|X2k) = 


=> pr=(1—po)pe-1=(1—po)° pr-2=...=(1—po)' po 
Hence pr= P(X =,t)=po(1—po)';t=0, 1,2... 
=> X has a geometric distribution. 

EXERCISE 7 (d) 


1. (a) If the probability that a target is destroyed on any one shot is 
0-5, what is the probaility that it would be destroyed on 6th attempt ? 

Ans. (0:5)° 

(6) Acouple decides to have children until they have a male child. What is 
the probability distribution of the number of children they would have ? If the 
probability of a male child in their community is 1/3, how many children are they 
expected to have before the first male child is born ? 

(Sardar Patel U.B.Sc. Nov. 1991) 
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(c) Let X be a discrete random variable having geometric distribution with 


parameter p. Obtain its mean and variance. Also, show that for any two positive 
integers s and ¢, 


P[X>s+t|X>s]=P[X >t] 
2. The following distribution relates to the number of accidents to 650 
women working on highly explosive shells during 5-week period. Show that a 


negative binomial distribution, rather than a geometric distribution, gives a very 
good fit to the data. How would you explaint this ? 


Number of accidents: 9 1 2 3 4 5 
Frequency: 450 132 41 22 3 2 


(South Gujarat Univ, B.Se 1991)- 

3. (a) Show that the mean and variance of the geometric distribution 

p(x) = qdp;x = 01,2.,... 
are respectively gp ', gp? (Allahabad Univ. B.Sc., 1989) 
(b) Show that the mode of the distribution. 
p(x) = (S)'sx = 1,2, 3,... 
is 1. 

4. Find (i) the probability generating function, (ii) the moment generat- 
ing function, and (i) the cumulant generating function for discrete random 
variable X following the geanietric distribution 

P(X =r)«<«(1 ~p)p'; fw. 2. 3c; 

5. X1 and X2 are independent random variables with the same distribution 
¢' pk ='0,1... Let Y be defined as the largest of X; and X2, ie., Y = max 
(Xi, X2 ). Obtain the joint distribution of Y and X; and the distribution of Y. 

6. Identify the distributions with the following M.G.F. 

| (5-4), 
Ans. Geometric Distribution, p = 5, 
7. Prove the recurrence formula for Ceometric Distribution, viz., 
p(x +1) = qp(x) 
cet X and Y be independent random variables such that 
P(Xer)=P(Y«r)=q'p;r = 0,1,2,... 

p and q are positive numbers such thatp + q = 1. Find (2) the distribu- 
sf{X + Y and (i) the conditional distribution of X given X + Y = 3. 


9. Adie is cast until 6 appears. What is the probability that it must be cast 
ore than five times. 


Ss. 
Ans; P(X >5)=1-P(X¥ s5)=1- 2 (5/6)'~'-(14) 
xef 
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10. -Forthe geometric distribution with p.m.f. 
f(x) = 2*3x = 1,2,3,.:. 
show that Chebychev’s inequality gives 
P(|X -2|s 2) > 3 
while the actual probability is'15/16. 
[Rajasthan Univ. B.Sc. (Hons.) 1992] 
11. The conditional distribution of random variable X given Y = y is 


-Y 
<r and the marginal probability density of Y is e”, where X is a discrete 


variable, i.e.. x = 0,1,2,....and Y is continuous, y 2 0. 


Show that the marginal distribution of ¥ is geometric. 


Hint. g(x,y) = f(xly) A(y) = 2 - 


x! "peel 


12. IfX and Y be two independent random variables, each representing 
the number of failures preceding the first success in a sequence of Bernoulli trials 
with p as probability of success in.a single trial and q as probability of failure, 


show that P(X = Y) = aa 


@ _»y ao’ 
on yt ae 1 ox! 
f(x) =f aw et fery gy - 
0 : ° 0 


[Delhi Univ. B.Sc. (Stat. Hons.) 1993, ’87) 
Hint. Were (xs r)= sa ae =qd-p;(r = 0,1,2,...) 


P(X=Y)«= EP(Xer =r) z iP (Xer)-P(Y=r) 


- X and Y are sii ena dae ’s | 


= 2 7 = 2 + Pp. 27 =, 
Prd P(l+¢ gq’ +...)= iG 1+4 
76. © Hypergeometric Distribution. When the population is finite and 
the sampling is done without replacement, so that the events are stochastically 
dependent, although random, we obtain hypergeometric distribution. Consider an 
urn with N balls, M of whichare white and N — M are red. Suppose that we draw 
a sample of n_ balls at random ( without replacement ) from the urn, then the 
probability of getting-&k white balls out of 2, (k < n) is 
M\ (N-M)\ _(N)- 
k)}{ n-kj]. |an ‘ 
Definition. A discrete random variable X is said to follow the hyper- 
geometric distribution if it assumes only non-negative values and its probability 
mass function ts given by 
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M\ (N= M 
k n—k- 
P(X#k)=h(k3N,M,n) = 7 ;k = 0, 1, 2...., min(n, M). 
n 
= 0, otherwise .»( 7-28). 
Remarks. 1. N, M and n are knownas the three parameters of hyper- 


geometric distribution. 
2. As it can be shown that 


— (M\(N-M)_(N 
B(e} (n=#)*(n)- 1 
k=0 
this assignment of probabilities is permissible. 

76:1. Mean and Varaince of the Hypergeometric Distribution. 


E(x) = Seperate 3 ef (HN (N Me} 


a) 2 (f=) (2-4) 
Fe migsei 
“fs # 
£[X(X-1)]= J k(k-1) (+) Che |* (a) 
ign (3) a] 


(n) n—-—k 
n 
, Mi -1 (252° M(M -1)n(n—-1) 


a n-2]  ~N(N -1) 
n 


*Since k white balls can be drawn from ‘M’ white balls in i ways and out of the remaining 
aes M 


N - M ted balls, ( = &) can be chosen in 


(A) * (no 


k ] ways, the total number of favourable cases is 
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E(X*)=E[X(X-1)] +E (x)= OID) aM 


2 
‘Hence V(x) Ey | 


. NM(N —~ M)(N ~ 2) 
N(N-1) | (On simplification) 


7-6-2. Factorial Moments of Hypergeometric Distribution. The rth 
factorial saaliiay is 


eit?) -§ eran «Se {(H) (oH) o() 


a ker 


50 (Ee) (aC 
09 06 (Goose 


= EOS. (mor) (Gon) (Roe) 


je9 
Mi) Af) Mr) A?) 
= ary 2G: ;N-r,M-r,n-r)= rome 
r) Ar) 
E(x") = Ls ...(7-28 a) 
nM 
= x = E(X) oe N 


2) 4 = M.(M ~1)n(n- 1) 
E(x J soy - 1) 
is 2) ss ip Gee ee eee 
of = E{[X?)] 4+. E(X) -[E(X) Pf =n NON Nod 
(On simplification) ...(7-28 b) 
Remark. Ifwesample the 7 balls with replacement and denote by Y the 


number of white balls in the sample, then Y is a binomial variate with parameters 
n and p where 


ee ger) (@)- k(k-1) (K-2)...(k-r+1)M! 
ki(M-k)! 
_M(M-1)(M-2).. .(M-r+1)(M-r)!, at”) M-r 
(=r)! (Mk) (e | 


= 49(t)- (02) 


R-f 
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p = M/N,q=1-p=(N-M)/N 
E(Y) = np = ™ _ ex) 


N 
oy ean ere aye ial. Peay 
y= Pq NN > (From (7-28 b)] 


equality holding only ifn = 1. 
7-6-3. Approximation to Binomial Distribution. Hypergeometric dis- 


-- 


tribution tends to binomial distributionas N — © and * —> p. 


prsmaind« (2) (oH) «(8 
—— (NM)! ni(N-n)! 
" k!(M-—k)! (n—k)!(N-M-—n+k)! N! 
_ M(M — 1)(M -2)...(M-k+ 1) 
—— ea eee 
. (N= M)(N = M =1)...(N-~M-~n+k+1) 
(n-k)! | 
n!} 
“'N(N —1)(N-2)...(N—-n +1) 


rag a) 8) 


Proceeding to the limit as N > © and putting 4 = p, we get 


lim _(*\ Pp e...p (1 -p)(1-p)...(1 - p) 
N—» wo HC RS NLM, #) k} & times (n — k) times 


. A pi(1-p)'* = b(k;p, 1 -p) 
76-4. Recurrence Relation for the Hypergeometric Distribution. We 


h(k;N,M,n) = (i | (v2 |*(3] 


have 


’ N - ‘ 
wee ermine = (M4) (NG) (8) 
h(k+1;N,Mn)__ . (n—k)(M—k) 


h(k;N,M,n) (k+1)(N-M—n+k+1)’ 
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which is the required recurrence relation. 
Example 7-53. Explain how you will use hypergeometric model «9 
estimate the number of fish in a lake. 


Solution. Let us suppose that in a lake there are N fish, N unknown. The 
problem is to.estimate N. A catch of ‘r’ fish ( all at the same time) is made and 
these fish are returned alive into the lake after marking each with a red spot. Aftey 
a-reasonable period-of time, during which these marked’ fish are assumed to have 
distributed themselves ‘at random’ in the lake, another catch of ‘s’ fish (again, al} 
at once) is made. Here r ands are regarded as fixed predetermined constants, 
Among theses fish caught, there will be, (say), X marked fish where X is a random 
variable following, discrete probability function given by hypergeometric model: 

r\(N-r N a 
fx(x|N) = 4 readt s PCN) aay -+-(*) 
where x jis an integer such that max (0,s-N+r)<xsmin(r,5s) 
and fy(x/[N)=0 otherwise. _ 

The-value of N is estimated vy, the principle of Maximum Likelihood (c/f. 
Chapter 15), i.e., we find the value N = N(x) of N which maximises p (N), 
Since Nis a discrete r.v., the principle of maxima and minima in calculus cannot 
be uséd here. Here we proceed as follows : 

~(N) = P(N) | (N - r)(N- 5S) ae. ; 

P(N-1) N(N-r—-s+x)- (On simplification) 


 ACN)>1 iff N>— = p(N)>p(N-1) iff N>= wai) 
rs us 
and A(N)<1 iff N<> => p(N)<p(N-1) iff N <= (ii) 


From (é) and (ii) we see that p(N) = fy (x|N,) reaches the maximum 
value - (as ‘a function of N ) when N is approximately equal to rs/x. Hence 
maximum likelihood estimate of N is given by 


A LAY 
f= = => N(X) = > 


EXERCISE 7 (e) 
1.(a@) What is a hypergeometric distribution ? Find the mean and 
variance of this distribution. How is this distribution related to the binomial ? 
(Nagarjuna Univ. M.Sc. 1991; Delhi Univ. B.Sc. (Stat. Hons.), 1989] 
. (b) Obtain binomial distribution as a limiting case of byper-geometric 
distribution. (Delhi Univ. B.Sc. (Stat. Hons.), 1989,’ 87] 
2. Suppose that rockets of a certain type have, by many tests, been estab- 
lished as 90% reliable. Nowa modification of the rocket design is being considered. 
Which of the following sets of evidence throws more doubt on the hypothesis that 
the modified rocket is only 90% reliable : 
(i) Of 100 modified rockets tested, 96 performed satisfactorily. 
(ii) Of 64 modified rockets tested; 62 performed satisfactorily ? 


Theoretical Discrete Probability Distributions 7-93 


3. A taxi cab company has 12 Ambassadors and 8 Fiats. If 5 of these taxi 
cabs are in the shop for repairs and"Ambassador is as likely to be in for repairs as 
a Fiat, what is the probbaility that- 

(t) 3ofthem are Ambassadors and 2 are Fats ? 

(tu) at least 3 of them are Ambassadors ? and 

(wt) all 5 of them are of the same make ? 


. (12\ (8) . (20 = (12\( 8 20 

mo (8) (0) (2): 802) (<8) (2 

4. (a) Show how the hypergeometric disrtribution arises, by giving an 
example. Obtain the frequency function of a random variable X following the 
above Jaw: Derive E(X) and V(X). Show that under certain conditions to be 
stated, the Binomial and Poisson distributions are special cases of the hyper- 
geometric distribution. (Dibrugarh Univ. B.Sc. 1992] 

(b) Find the factorial moments of the hypergeometric distribution. 

[Delhi Univ. B.Sc. (Stat Hons.), 1993] 

5. (a) Suppose that from a population of N elements of which M are 
defective and (N — M) are non-defective, a sample of size n_ is drawn without 
replacement.-What is the probability that the sample contains exactlyx defectives? 
Name this probability distribution. 

(b) Show that, for the distribution derived in (a), 

E(X) =" and (i) V(X) = Mt (1 - B) (1-24 

(c) Show that, under certain conditions to be stated, the binomial distribu- 
tion may be looked upon as a lirniting form of the probability distribution as derived 
in (a). 

6. (a) 200students of the F.Y. B.Sc. ciass ina certain College are divided 
at random into 20 batches of 10 each for the annual practical examination in 
Statistics. Suppose the class consists of 40 resident students and 160 non-resident 
students; and let R denote the number of resident students in the first batch. Use 
the binomial approximation to find the probability that R 2 3. 

Hint. The probability distribution of R_ is hyper-geometric with 
parameters: N = 200, n = 10, M = 40 


Since N (= 200) is large, the hypergeometric distribution ,*) can be 
approximated by binomial distribution with parameters n = 10, p = M/N = 
40/200 = 0-2 


P(R«=r)= - (0-2)f (08 )°-": r = 0,1,...,10, 


and required probability is : 
P(R23)=1-[P(R=0)+P(R=1)+P(R=2)]=0:323 
(6) Find the probability that the income-tax official will catch 3 income-tax 


returns with illegitimate’ deductions, if he randomly selects 5 returns from among 
12 returns of which 6 contain illegitimate deductions. 
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6\ (6 12 
ans. ($) (3)-(3 ) = 25/5, 
(c) If X and are independent binomial variates with parameters 
(m, p) and (72, p) respectively, find P(X = r|X + Y =n) 
Ans. m mo )4( ei 
r n-r n 
7. Froma finite population of N animals ina given region, W are caught, 
marked and then released again. The animals are caught again one by one until m 
(pre-assigned) marked animals are caught. The total number of animals caught is 
‘a random variable X. Find P (X =n), forms ns N-W+m 
(Shivaji Univ. B.Sc., 1987) 
Hint. P(X =n) = P {Catching(n-—1) animals of whom(m-—1 )are 
marked}x P {Catching the marked animal 
from the remaining N — (n — 1) animals}. 


(m= 1} (=m) Iw (m0) 


o 


. N “TN - (n—1)} 
n-1 

: (i —n\(n-1)\ (N 

W-m)|\|m-1 W 
8. Anurn contains M balls numbered 1 to M, where the first k balls are 
defective and the remaining (M — K) are non-defective. Asample of n balls is drawn 
from the um. Let A; be the event that the sample of n balls contains exactly k 
defectives. Find P {Ax) when the sample is drawn (i) with replacement and (ii) 
without replacement. [Delhi Univ. B.Sc. (Maths Hons.), 1989] 
Hint. If sampling is done without replacement, we get hyper-geometric 


probability model. 
K\ (M-K M 
co (: n—k +(a 
If sampling is done with replacement, then X ~B(n, p = K/M) 


a-k 


: n-k 
*. P(At) = () (K/M) - (1 ~—-) % 7) K (u- Ky" 


9. X isa random variable distributed according to hyper-geometric law: 
x|}|n-x 
P(X = x) =h(x;n,a,b) = —_— 


{a+b 
n 


-x «= 0,1, 2,... 


Obtain the recurrence formula : 


a. ; (n —-x)(a-x) : , 
h(x + 13;n,a,b)s Cre Lee rer 1b es Ty M(x 40,5) 
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10. For the hypergeometric distribution 


N N 
‘tibciciztia A) eed. 


N 
ea 
Prove that LL)’ = np and b= n(N-n) pq ; 


11. Explain how you will use hypergeometric model to estimate the 
number of wild animals in a dense forest. 

12. A box contains N items of which ‘a’ items are defective and ‘b’ are 
non-defective, (a + b=WN). A saniple of n items is drawn at random. Let X be 
number of defective items in the sample. Obtain the probability distribution of 
X and obtain the mean, of the distribution. 


ee —* Ol ee 


7-7, Multinomial Distribution. This distribution c~ ~*~ regarded as a 
generalisation of Binomial distribution. 

When there are more than two mutually exclusive out trial, the 
observations Icad to multinomial distribution. Suppose » 5, are k 
mutually exclusive and exhaustive outcomes of a trial with ive pro- 


babilities Pi> Pa. «+99 Pk 
The probability that E, occurs x, times, E, occurs x2 tt. ... and E,, 
occurs x; times in n independent observations, is given by 


rt | Dy 
P (X41, XQ, 00 XH) = Cpt PT... Pk 


where >x; = n and c is the number of permutation of the events Fi, E>, ..., 
To determine c, we have to find the number of permutations of n objects c 
which x, are of one kind, x2 of another kind, ..., and x, of the Ath kind, which is 


given by 
n 


x) 1x5! coe Ny ! 


Hence 
P (X41, XQ) «205 Xp) = 


so-called since (7-29) is the general term in the multinomial expansion 
k 
(P| + Po Pisce Py)"; LP; =i’ 


Since, total probability is 1, we have 


n\ : 
LP@M=>r —— pe ... Pit| = (py + pz... + py) = 1. 
: BSE EEN = ..(7-29(a)] 
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7:71. Moments of Multinomial Distribution. The moment generating 
function is given by 


k 
Mx(¢) = Mx, x, ...,.%, (4, 02, «5 tk) = E [em {2 ux! 
k 
= ») Foe mit pr ... Pe exo 2 nx | 
"2 Pres (piety (me | 


x1!x2! 
= (pie! + pre? +... + prety” --.(7:30) 
where x = (X1,X2,...,X%k) ° {On using (7-29(a) j, 


Now My, (f1 ) = Myx (1, 0, 9, ...,0) = (pie + Pp2 + p3t+... + pe)" 
-[(-p)ere] (-° 2 pi = 1) 


=> Xi ~B(n,pi) 

Similarly, we shall get: 

X; ~B(n, pi); i = 1,2,..., k. 
> E(X;) = np; and Var X;,= npi(1— pi); i = 1,2,...,k 


aM —— 
a nas, oe 


. [ mie" (n—-1)(pi et +... + prety~ pie], 
=n (n-1)pip; 
Cov (X;,X;) = E (X;:X;)-E (X;) E(Xj) =n (a-1) pipj— n° pip; = apip; 
Cov (Xi, X; ) — Di Pj 
X;,X;) = Paeedintdl Sinled Ldeoks! SLAP 
OMe Ox; OY, vnpi(1 — pi) npj(1 — H) 


Pi Pj | 


[By uniqueness theorem of m.g,f.] 


|G - pi) (1-7) 
Example 7: 54. The trinomial distribution of two r.v.’s X and Y ts 
given by: ° 
fay (Y) © eq (1-p-qyt 
XY (X,Y yitn-x-y)! P-~4q 
for x, ents. ee oe 
where0 s p,0 <q and p+qs 1. 
(i) Find the marginal distributions of X and Y 
(ii) Find the conditional distributions »f X and Y and obtain 
E(Y|X=x) andE (X|Y = y) 
(ii) Find the correlation coefficient between X and Y. 
[Delhi Univ. B.Sc. (Maths Hons.) 1988; Sp! Course-Statistics 1989,’ 85] 
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Solution. The joint m.g.f.ofX and Y is given by : 


n Rn=-xX 


Mxy(tula mE (AN = SY (very (get (1-p-ay 
7 x=0 y=0 
= [pet + ge? + (1-p-a)] (2) 


Mx(t:) = M(u,0) = {1-p) + pe’ => X ~B(np) (Ei) 


iv (iti) 
My(t2) = M(0,t2) = {(1-4@) + ge} = Y ~B(n,q) 
Observe that Af (t1,t2) # M(m,0) x M(0,t2) = X and Yare not in- 
dependent. 
(ii) The conditional distribition of X given Y = y is given by : 


ay) — Ley) _ __ fir (4 Y) nafs } 
(XY y) ; fr(y) "Cy q’(1-q)"” [. Y B(n,q)| 


eon (oe (ESS 


“xl(n-y-x)! 1-q 1-q 


(12) (ets) (1-385) 


7x = 0, 1, eeey n 


x Ji l—@q 1-q 
=> X|(¥=y) ~B(n-y, pA1 —q)) iv) 
=> E(X|(Y=y))=(n-y) -pH1-4q) ...(v) 
Similarly, we shall get 
f(Y[X=x) = Fey) __ fey) bx (np) 


x(x) "Cypi(1-py™ 


y R-x—y 
‘on N-X\(_ |G __ _-4_ - ys 
. 1-p 1 i 3; y=0,1,.,.2. 
Y|(X =x) ~B(n -— x, q/(1-p) »(Vi) 
=> ELY|(X =x)] =(n-x) q/(1-p) (Vii) 


(ui) Correlation Coefficient pyy : 
Since X ~B(n,p), E(X) = np, Var X = np (1 —- p) 
Y ~B (n,q), E(Y) = nq, Var Y = nq (1-q) 


a M(t, t2) 


E (XY) = 0% 0 f2 


a n(n — 1) pq 


fh, of2 20 


Cov (X,Y) =E(XY)AE(X)E(Y) =n(n-1) pq—-n? pq ==npq 
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YY = Cov (X, Y). — "Pq a Pq 
oxoy  Vap(i-p)nq(1-q) (1-p)(1-q) 
Note. Here p + q « 1. 


Example 7-55. If Xi, X2, ....X~ are k independent Poisson variates 
with prameters 1, 2, ..., Ae respectively, prove that the conditional distribution 
P(X, N X20... OA X_| X), whereX = X, + X2 + .... + Xp is fixed, is mul- 


tinomial. [Lucknow U. B.Sc. (Hons.), 1992] 
Solution. P[X1 A X20... NXe|X = 1] 


=P[(Xx=enNXenn.. AX n| X =n] 
P[M=anAN Xen. AM en AX =n 


P(X = n) ’ 
P(X; =n 7... WXe-1 = reins Xe =n -— 12... — Pk] 
P(X =n) 
_ P(X =r )P(X2=12)...P (Xe-1 = rk-1 )P (Xp en—ri—... —rk-1) 
| P(X=n) 


(+. X), X2, ...,X~ are independent ) 
Further, since X;,(i = 1,2,...,k4) are independent Poisson variates with 
parameters A; respectively, X = X; + X2 + ... + Xx is also a Poisson variate 
with parameter A; + Az + ... + Ae = A (say). 


Hence P(Xi NX2N NM |X =n] 


et eo hte1 eee eh 7 ed a 
- ryt ope {n—rj—...-%-1)! 
en" 
n! 


n' 
~ ts 
rylra! oc. mail (m—-n—-...—re-1)! 


Toad Reet ial 4 | 
Ls 
dh 
k k 


k 
where ) tein 2 > tS]* ; yee 


rt 


lB) Ce) 


n! ry SF. Tk 
= —________ J) -++ Dk 
trator PPA P 


tel ‘ ie] 


tes 


Thus the ‘conditional distribution P(X; N X e oe AN) X a 


multinomial with probabilities p; = (Ai/A.); ¢ = 1, 2,...,& ink classes. 
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Remark. If X;’s are identically distributed independent Poisson variates 
J 


k 
with parameter 7m (say), thend; = m; t= 1,2,..,k andA = =X i = hon. 


iol 
M1 
Pim Nk 
Hence in this case the conditional distribution of X;, X2, ..., Xx, given that 


their sum X1 + X2 + ... + X, =n, iS a multinomial distribution with index 
and the probability in each class being equal to 1/k. 


EXERCISE 7(f) 

1. IfX), Xo, ...,X~ bave a multinomial distribution with the parameters n 

and pi (i = 1,2,..,4) with Zp; = 1, obtain the joint probability 
P(X, = x N X2 =x2N 2... NX = XE) 

Obtain the corresponding moment generating function. Hence, or otherwise 
show that E (X;) = mpi, V(X;) = np; (1 — pi) 
and Cov (Xj, Xj) = — mpi pj, (i j). 

2. Discuss the marginal and conditional distributions, associated with the 
multinomial distribution. If (71, 72, ..., 1) have a multinomial distribution with 
parameters (7, pi, p2,..,pk) and if ci,dj,i = 1,2,...k are constants, find the 

k k k 
variance of £ cjnj and co-variance between £ cj mi and = djnj. 
ie1 inl iol 

3. Ifthe random variables Xi, X2, ..., Xz have a multinomial distribution, 
show that the marginal distribution of X; is a binomial distribution with the 
parameters n and p;, with ¢ = 1,2,...,k. 

4. For the trinomial distribution of two r.v.’s X and Y given by: 

n! 

f(x,y) = Ziyt@ oxy) Pi PP 

where x and y are non-negative integers with x + y sn and pi, p2, p3 
are proper positive fractions with p1 + p2 + p3 = 1, and 

f(x,y) = 0, otherwise 

Show that (i) X ~B (, p:) and Y ~B (n, p2) 


(ut) X|(Y=y)-B (n-y,pi/(1-pr)) and Y|(X =x) -B(n-x, mA(1-pi)) 


(ii) @ (X,Y) = - [pipet - pr)(4- pad] 


4. If ‘n’. dice, each of which has 6 faces marked 1 to 6 are thrown, find the 
probability of yetting a sum ‘s’ on them. 


Hint. The exhaustive number of ways in which” dice can fall is 6”. 

Since the total number of permutations in which six numbers, 
viz., 1,2, ...,6 taken ‘n’ ata time can add to s is the coefficient of x in the 
multinomial expansion -of-(x + x? +... +x°)", the required number of 


-~x-y 
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favourable cases for getting a sam ‘s’ ona dice is the co-efficient of x in the 
expansion of (x + x7 + 2. + x°)% 


*. Required Probability = = [co-efficient ofx’ in (x +x° 4+... x°)"] 


Now identically, we have : 


- 46 
xe ree. +x ax (1 treater (FoF) 


and by binomial expansion 


v(1—x°)" = = (1S "CG." 2 


and (1-x) "= z | Onl -x)= a 'C,.x’ = ttre Cau. .x 
720 
; 1-x° kon ne+r-1 n+6ker 
x] 7 >) FS (-1) e Cre Ch-1-X 
k=0 r=0 


To find the co-efficient of x’, we putn + 6k+r=sie,n+r=s~6k 
Thus the co-efficient of x* in (x + v+... + x8 )" 
(s-n V6 


a 2 be Ly: Cp. $-6k- LY ene i. 
0 


summation being extended over the integral values of k not exceeding 
(s,— n)/6. 
(s-n V6 
Hence required probability = = (1 Cp C6" 


Remarks. 1. The eantaie of getting asum ‘s’ with a throw of n dice, 
each having ‘f faces marked 1 to f is the co-efficient of x* in 


Af (xe2e ute) | 


2. Ifn dice have faces f;, fo, ..., fr respectively, then the required prob- 
ability : getting a sum ‘s’ is the co-efficient of x’ in 


2 


xe tet) (xen?t ct?) (rere ee 
iho | zS alii | 

6. Whatis the probability of obtaining a sum of 15 points by throwing five 
dice toegther? 


Hint. The number of exhaustive cases in throwing of 5 dice is 6. 
The number of ways in which the 5 dice thrown will give 15 points is the 
co-efficient of x> in the expansion of (x! +x7 42° +... 42°). 
Favc urable number of cases 
= Coefficient of x? in (x +x? +... +x° y 
= | coefficient of x!° in ( 1+x+... +x y 
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= coefficient of x" in(1-x° )P(1-x)° 
(2x5 Pw (1-790, 0 C2 !? = 1) (1 5x + 10x'? - ... — 7) 
§$x6 9 Sx6x7 3 5x6x7x8 4 
a1~* ar * tay 

5x6x7x...x 14 19 
10! = 

= (1 + 5x + 15x72 + 35x + 70x* + ... + 1001r + we) 
.. Favourable number of cases 
co-efficient of x” in (1 —5x° + 10x!” — ... -x°°) 

x (1+ Sx +... + 70x" +... + 1001r"? + |...) 

(1001 - 5 x 70) = 651 
651 651 
6 ~ 77796 


7. Four dice, each marked 1 to 6, are thrown together. Find the probability 
of a total count being 

(1) Exactly 12 and 
or (ti) More than or equal to 20. 

8. Four tickets marked 00,01,10, 11 respectively are placed ina bag. A 


ticket is drawn at random five times, being replaced cach time. Find the probability 
that the sum of the numbers on the tickets thus drawn is 23. 


9. Sbow that the mode of the multinomial distribution is given by 
X1, X2, «.. Xk, Satisfying 
mpi-1<xis(n +k - 1) pis t= 1, 2, .., 4% 
{In order to establish this, show that 
Pix; s pj (x; +1)forlsi,js k] 


(1-x)° =1+5x+— 


Hence tbe required probability = 


7%. Discrete Uniform Distribution. A random variable X is said to 
have uniform distribution.on” points { x1, eee if its p.m.f. is given by : 


P(X = xi) = 25 i= Lun 


s(7°31) 

For example, ee ‘bas a uniform distribution on the points 0, 1, 2. Wh, 
then P (X = -— i = 0, 1,2, 5 2. : 

=) +1’ (7:31 a) 


Such edibles can be conceived in practice if under the given experimen- 
tal conditions, the different values of the random variable become eugally likely. 
Thus for a die experiment, and for an experiment with a deck of cards such 
distribution is appropriate. 


7-9. Power Series Distribution. A discrete r.v. X is said to ‘follow a 


generalized power series distribution (g.p.s.d ; if its probability mass function is 
given by-— 


, i 
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ay &* 


;x = 0,1,2,...; a, 2 0 


" 0, elsewhere .».(7°32) 
where f(@) is a generating function, i.e., 
f(®) = La,6, 620 
xéES »«(7°32a) 
so that f (8) is positive, finite and differentiable and S is a non-empty countable 
sub-set of non-negative integers. 


- Remarks 1. By taking proper choice of S and f( 6), the g.p.s.d. can be 
reduced to binomial, Poisson and logarithmic series distribution and their truncated 
forms. 


2. Aninflated powerseries distribution (p.s.d.), inflated at zero is given 
by 


P(X =x)«= f(6) 


a, 
a f(0)’ x | eps oe (7-33) 


where a (0 < a s 1), is the inflation parameter. 
3. The truncated p.s.d. is given by: 
oe 
f (8) 
= 0, otherwise 
a, 6 
FO) (0) 
= 0, otherwise »(7°34) 
7-9-1. Moment Generating Function of p.s.d. 


P(X =x|S)-= [is.xes 


=> P(X =x |S) = ~~;x ES, where f;(0) = = a, & 
xE€S$ 


Mxy(t) = 5 e* P(X =x) = 5 e* | a 6°/f(8)| 
xr=0 ; x=0 


"Fie 2 a, (0¢V 7(®) (7°35) 


7-9-2. Recurrence Relation for Cumulants of p.s.d. The cumulant 
generating function is given by 


Kx(t) = log Mx(t) = log aa 


>) Kr — = log f( 6 d) - log f (8) (1) 
ral 


Diffcrentiating (1) partially w.r.to 6 and ¢ respectively, we get 
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5 2 HSL!) _ £(0) 
gr! 
real 


sae) — f(@) (2) 

a {-' ae f'(0e') ) 

d oe ae = seo 3 
= 2 He) ©) 


Subtracting (3) from 8 times (2), we get 


oa ft _f7 o-F(0) 
oh ger Lay FO) 


o1 
Comparing like powers of ¢ on both sides, we get 
6f'(9) Of’ (6) ..(7°36) 
0 = Kk); - => Ki = = 
1 f(8)  f(@) 
and Kr+i = 0; “ ky; r = 1,2,... (Comparing co-efficient of #7 !) 
Remark. We have 
Of’ (8) 
Mean = x oo : 
me #8) (7-36 a) 
Alternatively 
- @ + -1_ of"(@) 
Mean = x la, O/f (6)} = —— xa, 0! = 


7-9-3. Particular Cases of g.p.s.d.1.- Binomial Distribution. Let us 
take 6 = p/(1-p), f(8) = (1+ 6) and S = {0, 1, 2, .... 7}, a set of 
(n+1) non-negative integers then 


f(9)=fta48 = (1+ 0)" = 5 a, OF 
xGS$ x=0 
x 
n\{__p 7] 
Vx) | (t+ p) 


_ py) 
- | 


. P(X =x) = 


(oP 1-py*;x =0,1,..,2 
0, otherwise 


which is the probability mass function of the binomial distribution with parameters 
n and p. 
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2. Negative Binomial Distribution. Let us take 9 = p/(1 +.p), 
f(®) = (1 - 6) "and S = {0,1,2,... ad infinity |, e s90<1i,n-> 0. 


Now (0) = 24,0 = (1-86)"= 5 a, 8" 


xES x=0 

=> am (-1¥ (| fe ")acay eS) 
P(X=x)= yy ws ‘\{oa+er]/ [1-jp(1+p)}T" 
x=0 
> ("ro P(l+p)i"**?; x=0, 1,2... 
x@0\ , . 
= S Fy (l+p)t"* (py; x =0; 1, 2.... 
xu 


which is the probability mass function of the negative binomial distribution. 
3. oe Series Distribution. Let f(@) = - log(1 - 8) 
and S = { 1, 2, 3,.. 


Then f(0) = 2 a,0° = — log (1-6) = E a4 0%, ie, a= = 


x€ES 
a, & Q* 
Ney 
“P(X = x) = f(6) x[-log(1 = Q)] ? 1253.46 
0, otherwise 
4. Poisson Distribution. ‘Let f (6) = e's S = (0,1,2,...}. Then 
f(8) = 5 asl => oa 50,0" ie., dr = -+ 
x€S x=a@(@ x: 
a, 0° e* ee. 
EE Oy ie ee x! po eens 


which is the probability mass function of the Poisson distribution with parameter 
6. 


ADDITIONAL EXERCISES ON CHAPTER VII 


1. Show that the necessary and sufficient conditions for two given numbers 


a,b to be respectively the mean and the variance of some binomial distribution 
2 


a 
are thata > b > O and , 


5 is an integer. 


Show further that when these conditions are satisfied, the binomial] distribu- 
tion is uniquely detemuned. 
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2. Ina game of taking a chance, a contestant has to give correct answers to 
4 out of 5 questions to win the contest. Questions are given with 3 answers each, 
out of which one is a correct answer. If a contestant answers the questions by 
selecting the answers at random, what is the probability that he will win the contest? 

Ans. 10/3” = 0-0412 

3. Suppose the automatic machines of a plant fail with probability q, the 
machine failure is independent from machine to machine and the plant stays in 
operation, if at'least half of the machines run. Consider a two-machine plant and a 
four-machine plant. Show that the value of g for uninterrupted operations, 

(4) when the value of q is same in both plants is 3, 
(ii) when a two-machine plant is perferred is q > 5, and 
(iii) when a four-machine plant is preferred is q < 5 


4. Ifb(r 3n,p) = "C, p’-q'~’ is the binomial probability in the usual 
k 


° 


notation and ifB (k3;n,p) = = 1b(r;n, p), prove the following results for the 
0 


rev) 


‘seails’’ of the binomial distribution. 


() 1-B(k-1;n,p)s—— 


k — np 


si . n_- . 
(it) Bék;n,p)s por eR es 


P 
(iii) 1- Bkimp) am (Meh) feed — ee dt 
0 


b(k;n,p), kK > np +1 


5. ifacoin is tossed times where n is very large even number, show that 
the probability of getting exactly (5 — p) heads and (5 n + p) tails is ap- 
proximately 

1 
2 2 
= onze / 
un 
6. IfX ~B(n,p), show that 
P(Xs2)=P[X 2 (n - 2)], ifand only ifp = 5. 


[Calcutta Univ. B.Sc. (Hons.), 1989] 
7. ‘SfX ~B (n,p), show that X is symmetrically distributed about c if 
and only ifp = 1/2 and c = n/2. (Madurai Univ. M.A., 1991) 
8. IfX ~B (n,p), andY = X°, find corr. (X,Y) 
(Delhi Univ. (Stat Hons.) Spl. Course; 1989] 
9. A and B have equal chances of winning a single game, A wants 7 
games and B, n + 1 games to win a match. Show that the odds in favour of 
A are 1+ P to 1 ~ P, whereP = acai 
nin!2 
Hint. The probability that A wins at least'n games is 
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Crd’ p+ "Cnsiq’ | p"*'+ ae" Cup 

Now "Co + "Cit 20. Cai "Cnt "Cara ty. + "Con = 2” 

.. ax ore + soi ere +...¢ "Cr =i [ oF as OP 

.. Probability of A’s win = - (3 ( 2. )) = = ( 1-P) 

.. Probability of A’s losing = 1 -i(1-P) =5(1 +P) 

Hence the result. 

10. (a) The chance of success in each Béinoulli trial is p. If px is the 
probability that there are even number of successes ink trials, prove that 

Pr = p + pk-1 (1 - 2p) --o(*) 
Deduce that px = 5 f 2 +(1- 2p | 


(b) Also obtain the probability generating function of (*) and hence obtain 
an explicit expression for px 

(c) Obtain an expression for px directly without using (a) or (b). 

11. A spider and a fly are situated at the comers (0, 0) and (1,7) of a 
rectangular grid. The spider walks only north or east, the fly only south or west, 
They take their st¢ps simultanequsly to an adjacent vertex of the grid. Show that, 
if the successive steps are independent and equally likely to go in each of the two 


2n 
possible directions, the probability that they will meet is . 5 


(Delhi Univ. B.Sc. (Statistics Hons.) Spl. Course, 1988] 
12. For the binomial distribution, show that thé probability that the 
number of successes in n trials should not exceed x is given by 


a a 
ive syn 


Aa na gel 
—— dy 
Sato 


where p is the probability of success. 
13. Prove the identity 


pe(t)oae(3) ete a+(f)ote _ (ta) 


P 
fx (1 xy dx 
0 


go 
fet -xhds 
0 
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Hint. For Questions 12 and 13, see Example 7-23. 

14. Let X be a random variable whose probability function is 
b(x3n,p).LetY = X/n bea new random variable. Show that the expected value 
of Y is p andthe variance of Y is pq/n. If p(y ) is the probability function for 
Y, show that p(y) = 6 (ny; n, p). What are the possible values that y can take 
on? 

15. Suppose that the number of telephone calls that an operator receives 
from 9-00 to 9-05 hours ina day follows a Poisson distribution with mean 3. Find 
the probability that (i) the operator will receive no calls in that time interval 
tomorrow, (i) In the next three days the operator will receive a total of 1 call in 
that time interval. 

Ans. (é) e? (ii) 3x (e°P (1 - e? ). 

16. A large number of observations on a given solution which contained 
bacteria were made taking samples 1 ml. each, noting down the number of bacteria 
present in each sample. Assuming the Poisson distnbution, and given that 10% 
samples contained no bacteria, find the average number of bacteria per ml. 

Ans. log. 10 or 2°3026 

17. The number of oil tankers, say N, arriving at a certain refinery each day 
has a Poisson distribution with parameter 2. Present port facilities can service three 
tankers a day. If more than three tankers arrive in a day, the tankers in excess of 
three must be sent to another port. 

(i) Ona given day, what is the probability of having to send tankers away? 

(i) How much must present facilities be increased to permit handling all 
tankers on approximately 90 per cent of days ? 

(tii) What is the expected number of tankers arriving per day ? 

(tv) What is the expected number of tankers serviced daily? and 

(v) -What is the expected number of tankers turned away daily? 

Ans. (i) 0-145, (u) 4, (i) 2, = (ev) :1°785 and ~—s (v) 0-215. 

18. IfX is any non-negative integer valued variate and a is any positive 
number, show that 

P(X2a)s°.E(t*);t>1 

Verify the inequality 

P(X = 22) s (e/4)* when X¥ ~P(A). 

19. IfX is any non-negative integer valued and a is any positive number, 

show that 
P(X sa)sf°E(t*),0<ts1. 
Verify the inequality: 


P(X s 5 m) < (2/e)"”, when X ~P(m) 
20. Suppose (X, Y) have the joint p.m-f. 


flny) = Lee 


RiGee ee 
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Show that the correlation coefficient between X and Y is 
[a/(a+b)]”. 
Also obtain the distribution of Y — X. 
(Delhi Univ. (Spl. Course. Statistics Hons.), 1988] 


~ = -~(a+b) -x a 
Hint. Myy(u, 02) = » » eft thy aria 


ar ire x!(y-x)! 
x z 
fh of f 
o (ae'e’ rm) be? 
=e (a+b) > aaclle anil & a 
7 x! z! s(yax=z) 
xe@ zraQ 
= exp [aen*® + be® -a - b| a | 


Mx(t) = M(t,0) = exp [a(e* ~ 1)] => X~P(a) 
My (t1)=M (0,12) =exp[(a+b){e*-1} ] => Y~P(at+b) 


a°M (ti, t2) 
0% 0b 


E(XY)= =a°+ab+a 


hehe 


Cov (X,Y)=E(XY)-E(X)E(Y)=a°+abt+a-a(a+b)=a 


Distribution of Y—X. Taking 4 + 2 = 0 = t% =—- 1 in (**), 
M(~t2,2)=E(E ENE") SE (eh O") & exp| b(e*-1)| | 
=> Y~X ~P(b) 


21. IfX is Negative Binomial variate with parameters (k and Q° ), 


prove that 
P 
x" de 


1 . 
Bim, bys (1+x)t™ 


HintP(X =m) = » | g*-"(-py 


P(X 2m) = O-P=1 


= k+r-1 -k-r pr 
x | : |2 P 
d i k+r-1)\ ,p-1 ,-k-r 
gplP(x zm] = ¥ (Tr~ Tord Trar-| r | Q 
1 m-1 p-k-m 
""Bimky  & 


Integrating, we get 
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P 
1 wt dx 7 
lata ae 5) Tees eu a dia 


(b) IfX is N.B.(k,p), show that’ 


1 
1 2 me 
P(Xzm)= ary Sy yt 
79 P 


22. Ina sequence of independent trials, the probability of a success on 
each trial is ‘p’. By considering the outcome of the first trial, show that G, (2), the 
p. gt of the number of trials required to achieve the mh success, satisfies : 


G,(t) = pt Gy-\(t) + qtG,(t) 

and hence obtain G, (+). (Delhi Univ. (Spl. Course Statistics Hons.)y1987] 

Ans. G,(t) = [pt/(1 - gt) ] 

23. Let X and Y be independent random variables with the same 
(geometric) distribution given by P (X = k) = pq‘; k = 0,1;2,... 

Let Z= max (X,Y) 

(i) Find the probability distnbution of Z. 

(ii) Find the joint probability distribution of X and Z. 


(iti) Find the conditional probability distribution of X given Z = I, 1e., 
compute P (X = k| Z = 1) for all k,l = 0,1, 72.... 


(iv) Find the conditional probability distribution of Z given X =k, i.e. 
compute P(Z = /| X = k) forallk, | = 0, 1,2... 


Ans. (i) P(Z = 1) = pq [2 a ae ¢*'); t= 0,1, 2,... 


0,ifl<k 
(ii) P(Z=INX=k) =) pd (1-d¢*') ifl=k=0,1, 2... 
pd * if l>k=0,1,2y.. 
Oifl<k 
(iii) P(Xek|Zel)= (1- -F"'W(2~4 * qg*!)ifl=k=0, 1,2... 
p¢/(2-q'- Pane are 
Oifl<k 


(iv) P(Zal|X=k)=d1-d*!ifl=k=0, 1,2, ... 
p@ifl>k=0, 1,2, ... 
24. Suppose that.X}, X2, ..., Xn are mutually independent indicator random 
variables, with P(X; = 1) = p, 0 < p < 1.Showthatfor1 s M s N, 
M f -M 


M (t na=k 
P| i Xj =k Exen] 


— 2 
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25. Suppose one makes (m +n) independent trials of an experiment 
whose probability of success at each trial is p. Letq = (1 - p). Show that for 
any k = 0,1, 2, ... , the conditional probability that exactly (m + k) tials will 
result in success, given that the first m trials result in success, is equal to 


(i) p qd’ -* Show further that the conditional probability that exactly 


m+ k) trials will result in success. given-that at least m trials result in success 


is equal to 
CO /2 mtn p\ 
m+k}\q r=0 bees ta) 

26. Let Xi, Xo, ee be independent Bernoulli variates with common 
parameterp = P(X, = 1). Let Sj =X1 + X2 +... + Xj forlsjsn. 
Show that P (S; = r| S, = s) does not depend onp(0 < p < 1) and takes 
the form of a hypergeometric probability forl sy sn,Osrsssn. 

Hint. S, ~B (np) 


P(Sj=rNS,=s)= ce rs ee ee 


P(S;=r|[S,=s)=P(S;2=rNS, =s)/P(S, = 5) 


rjls-—r S 
a result, which is independent of p. 
27. Anum contains w white balls and b black balls. Balls are drawn one 
at a time from the urn, without replacement. Find the distribution of the number 
X of draws needed to obtain the k th black ball. Find also the factorial moments 


E [ r) (Dethi Univ. B.Sc. Statistics Hons. (Spl. Course), 1989] 


Peed b-k+1 


b+w eo prssy 
x-1 


t-1}( oe | +(°5""}:(Onsimptieation 


ut Mr Sle (Pl, asjsnOsrsssn 


Ans, P(X =x)= 


“(k-1]| b-k b 
For E [x] , proceed as in § 7°6-2. 


? 


E[ XO) =k) (b+ wt 17 (b + 1) 
28. The joint p.m.f. of two discrete r.v.’s X1 and X2 is: 


X1,X2) = 1 — 
p (1, x2) (z:] fn) (1-p) 
withx) < x2 Ss m2 + X13 0 s xX, Sn. 


Find the marginal distributions ¢{ Xi and X2. 
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Ans. X, ~B (m,p') and X2 ~B (mn, + m2, p) 


29. Two discrete random variables X and Y have the joint probability 
distribution : 


fy!(9 -x-y)! 


Osxs9, 0s ys 9andOs(x+y)s 9 


(<) Show that the marginal distribution of X is binomial with parameters 9 
and 4. 


(ti) Show that the conditional distribution of Y given X = 3 is also 
binomial with parametérs 6 and ¥2. 


30. A Polya process is defined by the quantities : 

rp 0k 11 +b)... {1 + (K-1)8)| 
fob|~*~=C~S Po(t) 
wherePy(t) = (1+ barry 


and A, b are parameters, f is a continuous variable and & may take zero or poisitive 
integral values only. Verify that the distribution satisfies the requirements for a 
probability distribution in K and find the expectation of K and its variance. 


91 1) 
p(x,y) = yivi(Ors ea wt : (5): where 


Py(t) = | 


Hint. Let K be a random variable with the distribution, 


P(K =k) = Pk(t); K=0,1,2,...,%. 
ef 1]fi 1 oye 
o[2][bor] [Bo] 
k! 


(1/b)+k-1 
Hee) 


z, Py(t)= <(1+bAy >» Aperre 


-Vb 
=(14bAt) PAbessy 


ae 
| 


=(1+bary'” 


1 
wm*zl 


isc bat 
1+bAt 


. Py (2) represents a probability distribution for every fixed b, A and ¢. 


M.G.F. of K = E(M*) = 5 ¢ m* P.(t) 


s(1—x)"= S(*) he y (“eo *)# 
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aE | E al \3 +k- tle ae) 


=(1+bary y Fa Pre 
k=0 


aye ee C2 ee 
(1+bAr) 2 e qona 


1+bAt k 
1 


Fed 


=(1+baAry’” og o (1 tbar—ebary” 


he ‘14+bAt 


= g(u), (say ) _ (1/b) - 1 
g(u) = 5 [1+ bare eit] » (e“bAt) 
E(K) = wp,’ (about origin) = Mean = [g’ (u) Juno 
1 -—(1/b)-1 . 
=|] [1 oar bar] ae aa 
Similarly 2 =[g''(u)lu-0 = (b+ vee Kne 
Variance = p'2 — wi"? = At (1 + DAL). 


OBJECTIVE TYPE QUESTIONS 
1. (i) Match the correct parts to make a valid statement : 


(a) Binomial distribution applies to 1. rare events 
(b) Poisson distribution applies to 2. repeated two alternatives. 
(c) The mean of a Hypergeometric 3. 1-—- 6pq 
distribution npq 
(d) The moment generating function 4. M 1 M\(N-n 
of negative binomial distribution a n nll N-1 
(e)  Thecoefficient cf kurtosis of a 5. (Q-pey’ 
binomial distribution i 
n 
(f) The variance of geometric 6. W 
distribution 
(g) Variance of Hypergeometric he mci 
distribution P 


II. Under what conditions binomial distribution tends to (i) Poisson 
distribution, (2) Norma] distribution, (4) Geometric distribution. Give practical 
examples (one each) where you would expect binomial, Poisson, negative binomial 
and geometric distribution. 
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III. 
(i) 
(it) 
(iii) 
(iv) 
(v) 
IV. 
(!) 
(it) 
v. 


(i) 
(i) 
(iii) 


(iv) 


(v) 


(vi) 
(vil) 


(viii) 


VI. 
() 
(tt) 
(iii) 
(iv) 
(v) 
(vi) 
(vit) 
(viit) 


(ix) 


(x) 


State the relationship between : 

Mean and variance of Poisson distribution. 

Mean and variance of negative binomial distribution. 
Mean and variance of geometric distribution. 

Poisson distribution and binomial distribution. 
Hypergeometric distribution and binomial distribution. 
Name the discrete distribution for which 

Mean and variance have the same value. 

Mean is greater than the variance. 


State which of the following statements are True and which are False. 
In case of the false statement, give the correct statement : 


Mean of binomial distribution ts 3 and variance is 5. 
Mean of Poisson distribution is 2 and variance is 3. 


The sum of two independent Poisson variates is also a Poisson variate. 
The result holds for the difference also. 


For a binomial! distribution, 
Mean = Mode = Median 


The Poisson distribution is a limiting case of bincgmial distribution 
when n — ©, p > 0, np > m. 


Nearly all the distributions are particular cases of Poisson distribution. 


The sum of two binomial variates is a binomial variate if the variables 
are independent and have the different probabilities of success. 


Negative binomial distribution may be regarded as the generalisation of 
geometric distribution. 


Fill in the blanks : 

The variance of a binomial distribution is ......... 

The B-coefficient of skewness of the binomial distribution is ......... 
The moment generating function of Poisson distribution is ......... 
The characteristic function of negative binomial distribution is ......... 
The coefficient of skewness of a Poisson distribution is ......... 


Poisson distribution is a limiting case of binomial distribution under 
the conditions ......... 


For Poisson distribution al] cumulants ............ 
Mean > variance for ......... distribution. 
For the Poisson distribution, the variance and the third central moment 


Mean < variance for ......... distribution. 
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VII. 
(1) 


(i) 


(iit) 


(iv) 


(v) 
(vi) 


(vit) 


(viii) 


(tx) 


(x) 


VOT. 


Ans, 
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Give the correct answer to each of the following : 

The skewness in a binomial distribution will be zero, if 
1 ! 1 

(a) p <3 (b)p=7'(c)p>z° QAp<gq. 

The mean and variance of negative binomial distribution : 


(a) are same, (b) cannot:be same, (c) are sometimes equal in limiting 
case, aS n — co, 


The characteristic function of Poisson distribution P (m) is 
(a) em(it-) | (b) gintet (c) e™, (d) none of these. 
The coefficient of variation of Poisson distribution with mean 4 is 
(a)z, (b)a° (c) 4, (a) 2 
The coefficient of kurtosis of a Poisson distribution with mean i is 
(a) 1/m, (b) —\/m, (c) m, (d) 3 + (1/m) 
The mean of a Hypergeometric distribution is 

M M(M- NM (M 
OWN ON Won © WUD’ @ None of 
these 
In a Poisson distribution, the second moment about the origin is 12, 
Then its third moment about mean is (a) 2, (b)3, (c) 5, (d) 10. 

x 10 -x 

The mean of the binomial distribution "°c, (2) (2) sx = 0,1 
2, ..., 10 is (a) 4, (6) 6, (c) 5, (d) 0. 
The mean of Poisson variate is 
(a) greater than, (b) less than, (c) equal to, (d) twice, its variance. 
The moment generating function of Geometric distribution is 
(a) p (1 — ge"), (b) pA - ge), (c) pel(1 — ge'), (d) None of these. 


By using the uniqueness property of m.g.f.’s, determine the 
distribution if the M.G.F. is as follows : 


6 
(a) M(d=(L+ det) OM (n=O 


’ 


(6) M(t) =A A2EY, (ay M(H = eX -», 

(e) M(t)= ele!- 1/4 (f) M (f)-= ; en! (<- = 2) | 
(g) M (t) = 4 (3e* - 1), (A) M (f) = Bet - 2)°73 
(a) Binomial, n= 6, p = 2 ; (b) Binomial, n= 5, p =7; 
(c) Binomial, sa pat; ; (da) Poisson, rn = 3, 

(ce) Poisson, A= a , (f) Geometric with we 1/3, 


(g) Negative binomial with r= 2, p= ; ; 
(h) Negative binomial with r= 3, p= 1/3. 


CHAPTER EIGHT 
Theoretical Continuous Distributions 


8-1. Rectangular (or Uniform Distribution. A random variable X is 
said to have a continuous uniform distribution over an interval (a, b) if its prob- 
ability density function is constant =k (say), over the entire range of X, i.e., 

k,a<x<b 
f@ = 


0, otnerwise 


Since total probability is always unity, we have 
b b 


J f@de=1 = kJ de=1 ie, k= 


-—a 
a a 
l 
poe 
f= 
0, otherwise ..(8-1) 


Remarks. 1. a and b, (a <b) are the two parameters of the uniform dis- 
tribution on (a, b) . 

2. The distribution is also known as rectangular distribution, since the 
curve y =f (x) describes a rectangle over the x-aixs and between the ordinates at 
x=a and x=b. 

3. Thedistribution function F(x) is given by 

0, if - o<x<a 


F(x) = amet asx<b (8-1 a) 


1 b<x<co 
Since F (x) is not continuous at x=a and x=), it is not differentiable at 
these points. Thus < F (x) =f (x) => #0, exists everywhere except at the 
points x=a and x=b. and consequently p.d.f. f(x) is given by (8-1) . 
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4. The graphs of uniform p.d.f. f(x) and the corresponding distribution 
function F (x) are given on page &1: 
5. Fora rectangular or uniform variate X in (—a, a) , p.d.f. is given by 


fo) - A i a 
0, otherwise. 
8-1-1. womens of Rectangular Distribution. 
| 1 bt l _ qt | 
ia fran anes [rdeagta [ee | 
b-a)’ (b -a) r+] (8-2) 
In seed lak 


2 2 
Mem =n'=Gho| * g |- b+a 


(b-a)} 2 2 
ee b> -a° _1 2 2 
and w=at5| 5 |-ko + ab+a’°) 
1 b+a an 
maggie? gle ee, Ce 2) ee | ee 
2 = ple — py 5 (0 +ab+a’) ie 7 19 a) 

8-1-2. ene Generating Function is given by 

-| ol! acpi! 

Mx (t)=J e "f@)de=7G a 

8-1-3. Characteristic Function is given by 

a el? _ et 

@x () = e f(x) ea 
8-1-4. Mean Deviation about Mean, 1) is given by 
b 
T= E 1X —Mean! = | | x — Mean | f (x) dx 
l f b 
at 
~ (ba) ae wall 
, (b-«a)/2 [tas 25° | 
= lel de 2 
~ ( - -(b-a)f2 
(b -a)/2 


| b-—a 
“o-a 7? 4 
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Example 8:1 If X is uniformly distributed with meatil andvariance V, 
jad P(X <0). ‘[Delhi Univ. B.A. (Hons. Spl. Course-Statistics), 1989] 


Solution. LetX ~U[a.b|. sothatp (x) = j radeau<h. Weare given: 
I- a e 


b+a 
Mean = m =. => ht+a=2 ‘ 


Var (X) = (b-a)?= 5 => (b-ay = 16 => b-a=t4 ; 
- ow, tee aye? 


i 
Solving, we get : — and b=3; (a<b). 


p@=zs -1<x<3 i ee 
1, 7.1 
P(X <0) -| p@)dx=7 | xl sq 


Example 8-2. Subway trains ona certain line run every half hour between 
mid-night and six in the morning. What is the probability that a man entering the 
gation at a random time during this period will have to -wait at least twenty 
minutes? 

Solution. Let the r.v. X denote the waiting time (in minutes) for the next 


rain. Under the assumption that a man arrives at the station at random, X is 
distributed uniformly on (0, 30), with p.df., ° : 

l 
AA? O<x < 30 y 
f=} 0 


é 


0, otherwise 
The probability ae he‘has to wait . ee 20 minutes is 


P(X 220) = i f(x)de = 55 ri 1 de =35 30 20) = ; : 
: : 20,14, 2, ae 
Example 8-3. “if X has a uniform distribution a 0. 11, find the distribu 
tion (p.a.f.) of —2 log X. Identify the distribution also. 


_ [Delhi Univ. B.Sc. (Stat: Hons.), 1989,’ 86] 
Solution. Let Y=~2 log x, Then the distribution function,G of Y is 


Gy (i= P(YS))= ‘P(-2 log X<y) th sycall 7" 
=a les gx2- y/2)= P(X2 62) =1-P(K Se). 
at “y/2 : Pig de 
€ 


wi freydeet f ioieastee 


eg ..(*) 


[-.: as X ranges in (0, 1), Y=—2 log X rangés‘from 0 to © | 


ee 


roe teialor 0< yee 


X 
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Remark. This example illustrates that if X ~U [ 0, I. then Ya_ 
log X, has an exponential distribution with parameter 9=-. [cf.§ 8-6 ] Or 
=-2logX has chi-square distribution with 2=2 degrees of freedon, 
[ cf. Chapter 13, § 13-2 ]. 


Example 8-4. Show that for the er ee distribution : 


fase -acxca 


the m.g.f. about origin is > ( sinh at) . Also show that moments of even order are 
a?" 

(2n + 1) 

Solution. M.G.F. about origin is given by 


given by — Hon = 


Mx (1) =E(e") = =| ety eayar = af ef dx 


_ tw ai, _ Sinh at 
7 at °° )= at 


Since there are no terms with odd powers of't in M (t) , all moments of odd 
order about origin vanish, i.e., 


-Won+) (about origin) =0 
In particular 1)" (about origin) =0, i.e., mean =0 
Thus ul,’ (about origin) = p, (since mean is origin) 
Hence Han+1=0;2=0, 1, 2,... 


e., all moments of odd order about mean vanish. The moments of even 
order are given by 
2n 


rn Qa 
ant "“O=Gnan 


Example 8-5. if X; and X2 are independent Pecan eian variates on 
(0, 1], find the distributions of 


(i) X\7X2, (ii) X; X2, (iii) X; +X2, and (iv) Xi - 
Solution. We are given 
Fx, (x1) = fx, (x2) =1;0<x,<1, O< x2 <1 
Since X; and X2 are independent, their joint p.d.f. is 
Ff (x1, x2) =f (x1) f (x2) = | 


(i) Let us transform to 


L2n = coefficient of ——; 
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Wt VHX £2, Kp=uvix=V 


td , X2) 
O (ut, Vv) 


~ 
Ni 
lt 
=< 
- 
it] 


x; =0 maps tou=0, v=0 
xt = 1 maps to uv = 1 (Rectangular hyperbola) 
x2 = 0 maps to v=Q and x2= 1 mapstov=1. 
The joint p.d.f. of U and V becomes 
g (u,v) =f 1, x2) IJ 1 =v; 0<u<0,0<v<oo 
To obtain the marginal distribution of U, we have to integrate out v. 


In region (/) , 

| vy? |! \ 
giuy=] vdve|S| =L0sus) 

0 o 

In region (//), 

1/u 2 Wu 

v | 

1 (u) = vdv=| — =—~ ,Il <u<oo 

an 2/0 2 


Hétice the distribution of U= x is given by 


g(u)at,Osusl 


c 


l 
=—7,h4u<oco 
2u° 


xf oes a uu 
(ii) Letu=x) x2, v=Ex, Le., XSV,X2= 7 


0 l 

J= | a@js-- 
T 7% Vv 
Vv y 


x1=Q0 maps to v=0,x,=1 maps tov=I | 
x2=0 maps to «=0, and 12=1/ maps tou=v 


\ 
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it 
Moreover, v=— => vou (since 0 <x2< 1) 
NX? 9 


The joint p.d.f. of U and V ts 


g (uv) =f (sine) 1S—50<urs LO<v<} 


| 
(=| 7 dvs [log ¥ | =-—logu,O<u<l 


ut 


ul, 


(iit) and (i). Let t=x +25, 

V=X1-X2 
xy =O>utv=0 
U+tV| Leyvs—u 


i.é., X1 = 
: 2 Wex22=O0Omu-v=0 
pee e.,V=u 
7 2 Jx=lmut+v=2 
x2=1=pu-v=2 
L i 
eax |[52 2 ce 2 es 
and J = 1 1| 772 
2 2 


2 glu, v) =f 1, 2) IJl=1,0<u<22,-1< <1 


2 
In region (J), (see figure below) 


£1 (u) =| -tdv=i 


—u 


and in region (//) , 


- “e 4 
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“w-2 
u,O<u<Il 
2-—u,1<ux<2 es 
For the distribution of V, we split the region as: OAB and OAG 
In region OAB: 

by (v=) 27Y Ldu=1[2-v-v]=1-v,0<v<1 
In region OAC : 

ho(vy=J 2*” Ldu=2[2(1+v)]=14+y,-1<v<0 
Hence the distribution of V= X; — X2 is given by 

| l—v, 0O<v<! 


ew=| 


h(v)= : 
l+v, -lL<v<0 
Example 8-6. If X is.a' random variable with a continuous distribution 
function F, then F (X) has a uniform distribution on (0, 1]. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1992, 1987,’ 85] 
Solution. Since F is a distribution function, it is non-decreasing. Let 
y= F (X) , then the distribution function G-of Y is givén by 
Gy (y)=P(Y Sy) =P[F(X)Sy]=P[XSF'()], 
the inverse exists, since F is non-decreasing and given to be continuous. 
Gy(Q)=F EF '()1, 
since F is the distribution function of X . 
Gy Q)=y 
Therefore the p.d.f. of Y= F (X) is given by: 


ev(y -Z1Gr0)1=1 


Since F is ad.f., Y takes the values in the range [0, 1]. 
Hence gy(y)=1,0S yl 
=> Y is a uniform variate on [0;1.]. 
Remark. Suppose X is arandom variable with p.d_f,, 

e* x20 ; 

fx) = 

-0, otherwise 

0, 1f x <0 
then -F@) = 

l-e*,ifx20 

Then by above result F(X) =1- e* is uniformly distributed on (0, t]. 


Example8-7. /fX and Yare independent rectangular variates for the range 
~a to a each, then show that the sum X + Y = U, has the probability density 
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oat = a9 Wve0 
4a" 


wie" Henri 
4ar° 


Solution. Since X and Y are independent rectangular variates, each in, 
the interval (- a, a), we have * 


0, elsewhere 


—,-a<y<a 
and fh (y= 4 7 
0, .elsewhere 


Hence by compound probability theorem, the joint probability differential of 
X and Y is given by 


: ] 
dP (x, y) =fi (x) f2 Q) de dy = i dx dy, -a<(x,y)<a 


Let us define new variables U and V as follows: 


u=xty, v=x-y 
= pte. cant ne u-v 
2 C- 2 
Jacobian of the transformation J is given by 
Ox Ox 


Fale) eed 
~O(uyv) | oy dy 
du -dv 


Thus the probability differential of U and V becomes 


Nl— vwol— 
N|— 


dG (u,v) =—51J du dv=— du dv 
4a 8a ...(*) 
Integrating w.r.to. v over specified range, we can find the distribution of 
U. 
Let us consider the region to the left of v— axis, i.e., to’ the left of the line 
AC . In this region, the values of v are bounded by the lines x =-— a and y=-a. 
For fixed values of u , 


7 = -@ => =—(u+ 2a) 


and y=-a > 5 Se = v=(u+t 2a) 
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¢a ,0) 


D 
(0,0) (0 0) (a,-G) 
Thus integrating (*) w.r.to. v between the limits — (u + 2a) and (u + 2a) , the 


distribution of U becomes 


_fu+a% 1 oe u+2a . -ut2a 
gi(u) du=J upg gree e2 5 v | ~(u+20) =] 


du 


In the region to the left of v -axis, i.e., below the line AC, u varies from the 
points (x=-a, y=—-a) to the point (x =0, y=0) and since u=x+y, in this 
region u lies betwee (-a-a) and (0+0), i.e., between -—2a to 0. 


g1 (u) du = u+ 2a 
4a 


In the region to the right of v—axis, i.e., above the line AC, the values of v 
are bounded by the lines x=a@ and y=a and for fixed values of u, 


uty 

x=a > > = 4 => V=2a-u 
u—-v 

y=a > > =@ => v=-(2a- 4) 


In this region # varies from the point (x=0, y=0) to the point (x=a, 
y=a), ie.,u=x+y varies from 0 to 2a. Thus integrating (*) w.r.to. v between 
the limits - (2a — u) to (2q— u) , we get the aca ass of U as 


l 
gi (u) du=J 72 - (Qa - “ay gg UWE lv | 5 ee du 


= 2—4 ty, 0<uS2a 
4a 
For an alternative and simpler solution, see Remark 5 to § 8-1-5 , (Triangular 
Distribution). 
Example. 8-8. On the x-axis (n+ 1) points are taken independently be- 
ween the origin and x = I , all positions being equally likely. Show that probability 


+ 6 2 Jae 
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that the (k + I) th of these points, cone from the origin, lies in the interyay 
x~Zdxtoxt dx is- 5. 
a a] XN - -. 
n hk ie 
, | art ihe (1-x)"~* dy 
e a a 
° e ° f ° ° 

Verify that integral of this expression from x =Q to x= 1 ts tanity. 

Solution. Here X 1s given to be a random variable uniformly distribute 
on[ 0,1]. os 


Be Ix(O=1,0Sx81 
Now PRR g OIRe= hs 1 .dx=x (1) 
P(X>x) 2 1-P(XSx)=1-x .-(2) 
de 
Also Pl fexceeS Jel g f (x) dx =dx 
(3) 


Required probability ‘p’ is given by 7 


p=P { out of (n+ 1) points, k points lie in the closed interval 0, x - - 


and out of the remaining (n + 1 — k) points, (nt — k) points lie in 


pre ] ‘and one oint lies in ‘ pa eae’ | 
> / 2 * p fe : a) 1™ 2 


_[ (ntl \4 n+1—k _yynek 

“| k mall oa Je x) |xae, 
on using (1), (2) and (3) respectively. : 

___(n+))! (n+1—-k)! 

—ki(ntl-b'"  6(n-b! 


(n+1)X(1—x)""* dx 


(1 x)" * dx 


To prove that “(i } area of this expression from x =0 oa =] Is unity, use 
Beta- puneee 


m-1¢ n-l oy peg, _Im-n. 
|. (1~ay""! de = B (m,n) = 7 sm > 0,n>0, | 


8-1-5. Triangular Distribution. A random variable X is said to havea 
triangular distribution in the interval (a, 5), if its p.d-f. is given by: 
2(x-a)/| (b-a)(c-a)} 3a<xSe 
f= .-.(8-2a) 
2(b-x)/ (b-a)(b-c)}  3e<x<b 
Remarks. 1. We write X ~Trg. (a, b), with peak atx=c. The graph of the 
p.d.f..is shown in the diagram on page 8-11. 
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2. The distribution is so called because the graph of 1 its p.d.f, is a triangle 
with peak at x =-c. ' 


~7- 


P(x) 


p(x) 


0 A(o,0) (c,0) B(b,0) = x- 


t é 


3. The m, 8- f. of Trg (a, 5) sie with peak atx=c is given by: 


Mx (2) = -| ef (x) dx = fis {| e f(x) de. 


¢ ae & b 
2 - 2 : 
‘7-ae-n OT ap ae) Gale + (b-x) dx 
Oe 
(a—b)(a—c)  (c—a)(c-b). (b—a)(b~a) [°7 ~° *° 
(On integration by parts) ‘ ...(8-25) 


4. In particular, taking a=0,c=1 and b=2, in (8: 2a), the p.d. f. of the 
Trg (0, 2) variate with peak at x= 1 is given by: 


x: O<sxsl 
f(xy ay2-x; 18x52 
0, - otherwise | | ..(8-2 c) 
anditsm.g fis.’ ‘Mr(@j=#(é-1)/", ° + (82d) 


t 


which is left as an exercise to the reader. 
5. In particular, replacing a by — 2a, b by 2g andc by 0, the p.d.f. of 
triangular distribution on the interval (- 2a, 2a) with peak atx =0 is given by: 


(2a+x)/4a’; -2a<x<0 7 
f(Qx)= 3 
(Qa-x)/4a’;  O0<x< 2a ,..(8-2e) 


The m.g.f. of (8:2e) is given by : 
2a 


Mx(t) = f e* f(x) dx | 


20 
¢ 

=<; fe. (2a+ xydx+ f e®(Qa—x) de 
4a" | _>, 0 
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AV? ALL gf 2a-e 17 
ee 4a’ t f 


0 


[On integrating by parts] 


..(8-2 f) 
Aliter. We may obtain (8-2 f) directly from (8-2 6) on replacing a by 
~2a, b by 2a andc by 0. 


Example 8-9. [fX and Y arei.i.d. U[-—a,a] variates, find the p.df. of 
Z=X+Y and identify the distribution. 


Solution. Since X and Y areiid.U[—a,a], wehave: [c/f. § 8-1-2.], 
Mx (1) = My (t) = (e" -e7™ )/(2 at) ...(*) 


e™! cs e at 2 
since X and Y are independent. 
But (**) is the m.g.f. of Trg (~ 2a, 2a) variate with peak at x = 0 
[cf Remark 5, equation (8-2) ] 
Hence by uniqueness theorem of m.g.f.. Z=X + Y~Trg (—2a,2a) with 


p.d.f. as given in (8-2 e), ae 5. 
: 2at -2at 
Aliter Mx.sy(t) = a ra|e —2+e | 


[From (**)] 
7 2 e 2 at ? eet ‘ ett 
(-2a-—0)(-—2a-—2a) (0+2a)(O-—2a) (2a-~0) (2a + 2a) 


which is of the form (8-2 5), [ cf. Remark 3], with a replaced by —2a and b 

replaced by 2a and cc by 0. Hence X + Y ~Trg (— 2a, 2a) withp.d.f.p (x) given in 

(8-2e) . 

Remarks 1. The distribution of X + Y has also béen obtained in Example 

8-7. 
2. Similarly we can find the di.tribution of X — Y. 


t -at 2 
Mx-y(t)=Mx(t).My(-d)= Ss [From (*)] 


=> X ~ Y ~Trg (— 2a, 2a) , with peak atx=0. 
EXERCISE 8 (a) 
1. The bus company A schedules a north bound bus every 30 minutes at a 


certain bus-stop. A man comes to the stop at arandom time. Let the random variable 
' X count the number of minutes he has to wait for the next bus. Assume X has a 
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uniform distribution over the interval (0, 30). This is how we interpret the’statement 
that he enters the station at the random time). 


(i) Foreachk =5, 10, 15, 20, 30 compute the probability that he has to wait 
at least K minutes for the next bus. 


(ii) Acompetitor, the bus company B is allowed to schedule a north bound 
bus every 30 minutes at the samé station but at least 5 minutes must elapse between 
the arrivals of the competitive buses. Assume the passengers come at the bus stop 
at random times and always board the first bus that arrives. Show that the company 
B can arrange its schedule so that it receives five times as many passengers as that 
of its competitor. 


2. (a) A random variable X has a uniform distribution over (— 3, 3) , 
compute 


(i) P(X=2), P(X <2), P(IX1 <2) and, P(IX-21<2) 
(ii) Find k for which P(X>k)=1/3. [Gorakhpur Univ. B.Sc. 1992) 


(b) Suppose that X is uniformly distributed over (— a, + ), where a >Q. 
Determine & so that 


(i) P(X>1)=1/3, (i) P(X<1/2)=0-3 and 

(Gi) PCIXI<I)=P(IXI>1). 

Ans. (i) «=3, (ii) a=5/6, (iii)a=2. 

(c) Calculate the coefficient of variation for the rectangular distribution in 
(0, b) given that the probability law of the-distribution is 


P(XS)= 7 
(d) If X is uniformly distributed over [1, 2], find z so that 
P(X>z+h)=5 (Ans. Z=4). 


3(a). Ifarandom variable X has the density function f(x) , prove that 


y=| f(x) de 


has a rectangular distribution over (0, 1). If 
f@)=F@-1), 1sxs3 
= 0, otherwise 
determine what interval for Y ‘will correspond to the interval 
11i<sX<2-9. 
Ans. y = F(x) = (x - 1)’/4; 1 $x $3 50-0025 < y $ 09025 
(6). Show that whatever be the distribution function F (x) of ar.v. X, 
P[asF(X)sbj=b-a, 90S(a,))S 1. 
[Delhi Univ. B.Sc. (Stat. Hons), 1986] 
Hint. Y=F(X)~U[0,1]. 
4. (a) For the rectantular distribution, 
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J 
fa)=s7 :-ASxXSa ; 


= 0, otherwise, 
show that the moments of odd order are zero, and [42, = a’ / (2r+ 1). 
; , [Madurai Kamraj Univ. B.Sc.. 1992 ] 
(b) A distribution is given by. 
f(s) de=s-dx,—a Sx<a 
Find the first four central moments and obtain B; and B2. 
[Delhi Univ B.Sc. Oct., 1992; Madras Univ. B.Sc., 1991} 
(c) Fora rectangular distribution 
dP =kdx,1<xS€2, 
show that Autuiets mean >-Geometric mean > Harmonic mean. 
: : {Vikram Univ. B.Sc. 1993) 


(d) If the random variable X follows the rectangular distribution with 

p.d.f., 
f(x) =1/0,0<x<6, 

derive the first four moments and the skewness and kurtosis confficients of the 
distribution. ie 

(e) LetX and Y be independent:variates which are uniformly distributed 
over the unit interval (0,1). Find the distribution function and the p.d.f. of random 
variable Z=X+Y. Is Z a uniformly distributed variable ? Give reasons. 

[Delhi Univ. B.Sc. (Maths. Hons.), 1986] 

5. Let Xi: and X2 be independent random: variables unifromly distributed 

over the interval (0, 1). Find i oN 


(@) P(Xi+X2<05), (ii) P(X1 — X2 <.0'5), 

(iti) P(X} +X3.<0-5), (iv) P(&*" <0-5), and (v) P (cost X2 < 0-5). 

Ans. (i) 0-125, (ii) 0-875. (iii) 0-393, (iv) 1—log2, and (v) 2/3. 

6. A random variable X is uniformly distributed over (0, 1), find the 
probability density functions of 

(i) Y=X?+1, and (i) Z=1/(X+1). 

7. (a) If the random variable X is uniformly distributed over (0,5%), 
compute the expectation of the function sin X ., Also find the. distribution 
of Y=sin X, and show that the mean of this distribution is the same as the above 
expectation. 

Ans. 2/n, fy (y)=2/(nV1—-y*), O<y<l. 

(b) If X ~U[-12/2, 2/2 ] distributed, find the p.d.f. of Y=tan X . 

[Delhi Univ. B.A. Hons. (Spl. Course-Statistics), 1989] 


8. (a) Show that for the rectangular distribution, : 
dF=dx,0Sx<'! 


Theoretical Continuous Distributions 8 15, 


’) (about origin) = 1/2, variance = 1/12 and mean deviation about mean 
=1/4. [Madras Univ B.Sg Sept. 1991; Delhi U. B.Sc. Sept. 1992] 
(6) Find the characteristic function of the random varjable Y = log F (X) 


where F (X) is the distribution function of a random variable X . Evaluate ‘the rth 
moment of Y . 


9 IfX ~U[9O, Fj ; find the distribution of Y=1/X . Find E(I/X) , if it 
exists. 


Ans. gy(y)=1/y?; 1Sy<eo; E(Y)=E(1ZX) dogs notexist. 
10. LetxX be uniformly distributed on [-1, 1]. Find the distribution function 
and hence the p.d.f. of Y= x. [Delhi Univ. B.Sc. (Maths. Hons.), 1988] 


11. Let fx (x) = 6x (1 -x); OS x1. Find y as a function of x such that 
Y has p.d.f. 


] 


g(Q)=3(1-Vy); Osysl 
(Dethi Univ. B.A. Hons. (Spl. Course-Statistics), 1988] 


Hint. F(x) =f fe) de =3x2-23 ~ U0, 1] 
0 


y 
Go)=J g-Q)dy=3y-2y*?- UL 0,11 
0 


Setting F(x) =G(y), we get y=x"*4 

12. The variates a and b are independently ana uniformly distributed in 
the intervals [0,6] and [0, 9] respectively. Find, the probability that x -axt+b=0 
has two real roots. 


y a’/4 
Ans. P(bsa@/4y='  { 35 
a=0 6=0 
13. Find the probability that the roots of the equation x°+2bx+c=0 
should be real, given that b ~U[ —@, @] and c ~U[- B, B ] are independent. 


Ans. Probability =P? Be) - -P(p Sc) =1- P(bIs Ve) 


“Il Les ag) |« 


14. Ifa, b,c are randomly chosen between O’and 1, find the probability 
that the quadratic equation ax’ + bx+c=0 has real roots. | 
1 faa deck Fy eA 
Ans. Probability = P(t? 2 4ac).= t~ fo fesfy sdadeadb=kt 81 
DR) ot | Lc 
13. (a) = X oe a rectangular distribution on (—1;‘I). 'Cdnipute:’’ 


dadb= 1/3. 


X 
P arma >2 {and compare it with the upper bound given by Chebyshev’s 


aeakia. 
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(6) Compare the upper bound of the probability, 
P{IX-E(X)l22VV(X)}, 

obtained from Chebyshev’s inequality with exact probability if X is uniformly 
distributed over (— 1, 3). 

Ans. (b) Probability < 1/4, Exact Probability =0 

16. Two independent variates are each uniformly distributed within the 
range —a@ to + a. Show that their sum X has a probability density given by 
2a+x 


f@ = re =>, , ~2asxs0 
= — ae O<x<2a 
Qa 


Verify that the m.g.f. calculated from the value of f(x) is equal to 
2 


- sinh at 
at 
17. The random variables X and Y ‘are independent and both have the 
uniform distribution on {0, 1}. Let Z=1X -— Y1. Prove that. for real 9, 
9(Z,0)=2[ 1+10~e? |/2. 
Hence deduce the general i ca for E(Z'). 


Hint. 9(0;1X-Y1)= ce (Ixy e0y dx dy 
0 0 


J 
=2] [ ean yy dx 
0 | 0 


Ans. 2/| (n +1) (+2) : 


18. If X and Y are independently and uniformly distributed random vari- 
ables in the interval (0, 1), show that the distribution of X + Y is given by the 
density function 

z Osz<!l 
f(@=)2-z2 18282 
0 elsewhere 
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(Hint. See Triangular distribution] 


19. Ship A makes radio signals to the base and the probability of the 
interval between consecutive signals is uniformly distributed between 4 hours and 
24 hours and is zero outside this range. Ship B_ makes radio signals to the base and 
the probability of the interval between consecutive signals is uniformly distributed 
between 10 hours and 15 hours and is zero outside this range. 

(é) Ship A has just signalled. What.is the probability that it will make two 
further signals in the next 12 hours ? 

(ii) Ships'A and B have just signalled at the same time. What is the 
probability that Ship A will make at least two further signals before ship B next. 
signals? {Institute of Actuaries (London), April 1978} 

20. If X~U[0,1], prove that for b<c fixed, Y=(c-—b)X+b is 
uniform on[ b,c]. 

8:2. Normal Distribution. The nornial distribution was first discovered in 
1733 by English mathematician De-Moivre, who obtained this continuous distribu- 
tion as a limiting case of the binomial distribution and applied it to problems arising 
inthe game of chance. It was also known to Laplace, no later than 1774 but through 
a historical error it was credited to Gauss, who first made reference to it in the 
beginning of 19th century (1809), as the distribution of errors in Astronomy. Gauss 
used the normal curve to describe the theory of accidental errors of measurements 
involved in the calculation of orbits of heavenly bodies. Throughout the eighteenth 
and nineteeth centuries, various efforts were made to establish the normal model 
as the underlying law ruling all continuous random variables. Thus, the name 
‘‘normal’’ . These efforts, however, failed because of false premises. The normal 
model has, nevertheless, become the most important probability model in statistical 
analysis. 


Definition. A random variable X is said to ahve anormal distribution with 
parameters 1 (called ‘‘mean"’)and o (called ‘‘variance’’) if its density function 
is given by the probability law : 


F ‘ ya 
eM SR exp fs} 


or =f (3,0) = "SWE ¢ gene 


Tr 


~m<cxKco, —- oc pi<co,g>0_...(8-3) 
Remarks. 1. A random variable X with mean «1 and variance o” and 
following the normal law (8-3): is expressed by X ~N (1, 0’) 


2. If X ~N (pL, © *), then Z= <- , 1s a Standard’ normal variate with 


E(Z)=0 and Var pe | 
and we write Z ~N (0,1) . 


Pa 
3. The p.d.f. of standard normal variate Z ts given by 
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] “12 ; 
9@= pe ~e*f ,- KZ Sw 


and the corresponding distribution. function, denoted by @(z) is given by 
®(=P(Zs2= J oud 


“4 ~ 00 


{ ouw/? 
a J edu 
V2 7 


mo 
We shall prove below. two important results on the distribution function 
&(.) of standard normal variate. 


Result 1. ® (-z)=1-@(z) 
Proof. ® (-z)=P(Zs-z)=P(Z22) (By symmetry) 
= 1-P(Z<z) 
= 1,-@ (z) 
ania (tat gee , where X ~N (p, 0°) 
Proof... P.(a<X<b) = p( eBezs-— |; 208} 


4. The graph of f (x) is a famous ‘bell;shaped’ curve..The top- -of.the bell 
is-directly above the mean ji . For large values of G, the curve tends to flatten out 
and for small values of o, it has a sharp peak. 

8-2-1. Normal Distribution as a Limiting form of Binomial Distribution. 
Normal distribution is another limiting fornyof the binomial distribution under the 
following conditions : 

(i) n, the number of trials‘is indéfinitely large, i.e:, n — o> and 

(ii) neither p nor.q is very small. 

The probability function of the ae distribution with parameters 7 and p 
is given by 


n't 
= ———_—_—_—_—_ ec — | ae ee ee 
P= Bg arce mer ” ee ...(*) 
Let us now consider the standard sage variate: 
Z= lise (X) _ ; X.=0,-1 72, 0, 1 


WX) Ta | .(**) 
W x= = s = J 
hen 0, Z= — Vp q 
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n 
and when X =n ,Z= Ter > =Vnq/p 


Thus in the limit as.r1 4 oo , Z takes the values from — °° to oo. Hence the 
distribution of X will be a continuous distribution over the range — 2 to ce. 


We want the limiting form of (*) under the above two conditions. Using 
Stirling’s approximation to r! for large r, viz., 


- + (172 
im pie Ve tl, 
r —) 00 


we have in the limit as 2 — o° and consequently x > o , 


{ 
—-2  N+e~ XK N-XK 


: eT ee V2 0 eo" n"*? pq 
lim p (x) = lim } 
V2n ex *7V2 7 eo" (n-x)**2 


)" -X ra! 


(npy : ; (ng 2 


. Lo 
= lim 
1 
V2 Vnpq | 2 (nx) 243 


H 


= jj np nq 
= lim A RS Pe = PF re) a (P**) 


From (**), we have 


X=np+ZNnpq => aya ltzZ Vq/(np) 


aioe 
-X=n-np- ~ZNVnpq =nq-Z Vnpq 


== ZN p/({nq) . Also de = a 


Hence ie probability differential of the distribution of Z, in the limit is given 
from (***) by 


d G(z) =g (2) dz= lim | ea | (8-4) 


+i -x+4 
where v=t=] ima 
log N=(x+ 3) log (x/np) + (n-x + ) log'| (n —x)/ nq | | 
= (np +z Vnpq ++) log £1+2V(q/npy) 
+(ng-z Vnpq +4) log [ 1-2 V(p/ng) | 
= (np +2 Vnpq +4) [2.VCq/mp) -42 (q/np) +42 (q/npy? —.. ] 
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+ (ng —zNnpq + 3) [- 2 V(p/ng) ~ 3.2 (p/ng)- 42 (p/ngy”* =...) 


. 2 


=| | ied} 02 + sf? q-iz Ser 


i.é., 


log N= 52 p+Q+2+q)+5 = Zum \V9/P +VP7q}+0|n- “i 


2 
=F tO) oe as n—poo 
li 2 li 2 
mM jogN=> = "™ nae?” 
n —> co 2 nh —)> co 


Substituting in (8-4) , we get 


- 2/7 
ms = 22 < 00 
d G(z) = (2) dz = “Vine d. <2Z ...(8-4 a) 


Hence the probability function of Z is 
l -z/2 
BS pg oe Se (8-4 b) 

This is the. probability density function of the normal distribution with 
inean O and unit variance. 

If X is normal variate with mean p ands.d.o then Z=(X-—U)/o is stand- 
ard normal variate. Jacobian of transformation is 1/o . Hence substituting in 
{8-4 (b)}, the p.d.f. of a normal variate X with E(X) =p, Var (X)=0° is given ; 
by 

I ~Qa-pyo’ 
f=, oV2n © are 
0, otherwise . 

Remark. Normal distribution can also be obtained as a limiting case of - 
Poisson Distribution with the parameter A > © . 

8-2.2. Chief Characteristics of the Normal Distribution and Normal : 
Probability Curve. The normal probability curve with mean p and standard ' 
deviation O is given by the equation 
~(v-py/20° . 


fW=oee 


and has the following properties : 


—-o< K¥ <0 
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(i) The curve is bell shaped and symmetrical about the line x = pL. 
(ii) Mean, median and mode of the distribution coincide. * 
(iii) Asx increases numerically, f(x) decreases rapidly, the maximum 


one bd ° « | 
robability occurring at the point x = pt, and given by [p (x)] max = —=— - 
™ oV2 


(iv) B, = 0 and B2.= 3. 
(V) Horer =9, (r=, I, 2,...), 
and ‘py, = 1.3.5... (2r— lo", (r= 0, 1,2, ..). 
(vi) Since f (x) being the probability, can never be negative, no portion 
of the curve lies below the x-axis. 
(vii) Linear combination of independent normal'variates is also a normal 
variate. : : 
(viii) x-axis is an-asymptote to the curve. 
(ix) The points of inflexion of the curve afe given by 


wot | 
x= nto, F(x) ee] 
F T 


q t . ON2 Tt 


X= Il 
(Normal Probability Cuve) 


(x) Mean deviation va wae . on Or Oi 2 
2/t oO = 5° (approx.)J ~~" 2 3 
We have (approximately) 
LO en 
Q.D. : M.D. : S.D. 3302 (Oi: 3¢5i! 


=> Q. D.: M.D.: S.D. 10: 12: 15 
(xi) Area Property = 
P(p-O<X<p.+0) =0-6826 
P(w-20<X<p+206) =0-9544 
Pq-30<X<p+36) =0-9973 
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The following; table gives tthe, area under the normal probability curve for 
some important values of standard normal variate:Z 4 


——. 


Distances from the‘mean ordinates Area under the curve 
in terms of £6 od + 2 
Z=+0-745 50% = 0-50 
Z=+1-00 68-26% = 0-6826 
Z=+ 1-96 _ 95% = 9-95 
Z=+2.0 | 95-44 % =0-9544 
Z=+2-58 | 99% = 0:99 
Z=+,3-0 a 99:73% = 0-9973. 


(xii) If X and Y are independent standard normal variates, then it can be 
easily proved that U=X+Y and V=X-.Y are independently distributed, 
U~-N(0, 2) and V ~N (0, 2). 

We state (without proof) the converse of this result which is due to D, 
Bernstein. 

Bernstein’s Theorem. If X and Y are independent and identically dis- 
tributed random variables with finite varaince andifU=X+Y and V=X—Y are 
independent, then all r.v.’s X, Y¥; U and V_ are normally distributed. 


(xiii) We state below another result which characterises the normal dis- 
tribution. 


If Xj, Xo, ...,X, are iid. r.v.’s with finite variance, then the common 
distribution is normal if and only if : 


-_ 


K =+ 
n 


n n 
or y) Xi and Y (%-X ¥ 
i=] i=] 

are independent. [For ‘If part’, see Theorem 13.5} 

In the following sequences we shall establish some of these properties. 

3-2-3. Mode of,Normal Distribution. Mode is the value‘of x for which 
f(s) is maximum, i.e., mode is the solution of, 

f'@)\=0 and f” (x) <0 
For normal distribution with mean : and standard deviation 0, 


e jog f(x)=e- elas =p), 


' 
where c = log (1/ V2 & 6), is aconstant. 
aaa W.r.t.x, we get 


Poy 


eee ae 
ao f OF ee W) =f W=-Ge-wFw) 
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" 1 , ro ~u)* 
and f @=—+[ 1: f(x) +@—-pf cy | er alae oe 
Now f'(x)=0 => x-y=0 ie, x=y 
At the point x=, we havé from (86) 


" 1 1 1 
f )=-SLF@) aa Ge mi 


Hence x = p, is the mode of the normal distribution. 
8-2-4. Median of Normal Distribution. If M is the median of the normal 
distribution, we have 
M “4? 4 i ; 1 
f fQ)dras => GVIH f exp | ~ (x —p) /2.0"| de => 
=.) T , 


— ® , 


ie 
~ sain fpr [er wy" | ae N 


a - 
1 2 2 1. 
Wo f exp|—(x —p)- /(2 8) )t dx = ‘ 
Ty “ : ‘ »-(8°7) 

0 


J 1 
T5z J exp (-7/2) dz => 


p 
1 ¢ 7 
But ovin 5 exp > (x =p) /2'0 | dx = 
. From (8:7), we:get 


M. 
2*ovea Jo { -(-w/207 } dx = 2 


MC 7 
1 2 : 
x avin. J om |-&-wi720'}de=0 => u=M . 
Hence for the normal distribution, Mean = Median., 
Remark. From § 8-2-3 and § "8-2'4, we find that for the normal distribu- 
tion mean, median and mode coincide. Hence the distribution is symmetrical. 
8-2-5. M:G.F. of Normal Distribution. * The m.g.f. (about origin) is given 
by 


Mx (¢) =f ES) dem pe J Pew [-e-w)'/20°} ae 


3 


74 nN 
“az J ow {1 (e+ 02) exp (L272) dz, es*s"| 


= et! aaa f exp {+1 (2 - 2102) de~ a 
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2g Sot [-ale —ot)- ot} | de 


2 2 
= gittt! a/2 , 


{-$ nae} de 


2 2 . 
= titho” y aS exp (—u?/2) du 


a | a” 


- » 2 2 
Hence Myx (t) = eh itt We (8° 8) 


Remark. M.G.F. of Standard Normal Variate. IfX ~N (up, o ), then stan- 
dard normal vaniate is given by ; 


Z=(X-)/o 


3 


‘oo 


= exp (t°/2) ..(8°8 a) 
$-2-6. Cumulant Generating Function (c.g.f.) of Normal Distribution. 
The c.g.f. of normal distribution is given by 


2 2 
Now Mz (t)= et’F My (t/c) = exp (+p 1/o), esol K+ — a 


22 2 
Kx (t) = loge My (t) = log. (e*‘* ove = ut ee 


Mean = k, = Coefficient of ¢ in Ky (t) =p 
2 
Variance = k2 = Coefficient of 5 a7 pin Kx(t)= o” 


and K, = Coefficient alc oped 4... 
Thus ae oes and Mamie + Suda 3 5" 


Hence A= 0 and B= Hf #3 : 
yw ... (8-9) 
8-2-7. Moments of Normal Distribution. Odd order moments about 
mean are given by 


Hansa = f @— py" fa) de 


—o 


ze SiR: J (x- uy" : exp,|-(x— My /2 0° dx 


Un+1 = Ag S002? em 27/2) dz aad 
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[> =] 
o"*! 


ee J 2"*! exp (~2’/2) dz =0, ...(8-10) 


<jnce the integrand ge F is an odd function of z. 


Even order Suan about mean are given by 


= J wf de 


=—— | (o2™ exp (27/2) dz 


z exp (- z*/2) dz 


(since integrand is an even function of z) 
n -« dt Zz - 
tn 3 | co" Voy E | 
_ 2" 6 =—{$ 1" Tr 3)- ! 
22 I. a 


=> ar ae a Mn+; 
Changing n to (2-1), we get 


no} g2!'-2 


Hy -2 =p P(n -3) 


Un 9 T(n+4) 


= = 2 Nae ee ee = _ - 
U2n-2 ; T(n cae (n >) T(r) =(r 1) I(r 1)] 


=> Lan = 0° (20 - 1) Un-2 (8-11) 
which gives the recurrence relation: for the moments of. normal distribution. 
From (8-11), we have 
Man = [ (20 — 1) 0°] £ (20 - 3) 07) Han 
=((2n- Io" } [2 -3) 07} [ @n—5) 0" } Hon -6 
=[(2n-1)¢ ee re ek. (30°) (16°). bo 
= 1.3.5... (2n-1)o" (8-12) 
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4 


From(8 10) and (8-12) weconclude that for the normal distribution al] odd 
order moments about mean vanish and the even order moments about mean are 
given by (8-12). : 

Aliter. The above result can also be obtained quite conveniently as follows; 

The m.g.f. (about mean) ts given by 

Efe ae Bl Ee) ae! Mx (1) 
where My (1) is the m.g.f. (about origin). 


: 2 2 24 2 24 
m:g.f. (about mean) =e 4 ett Sse! oO”? 


(P0°7/2 (2 (Ff 0°72)" ge o*/2)" 
ot 31 “, 5 Sareea . | ..-(8-13) 


r 
The co efficient of in(8-13) gives |t,; the #h moment about mean. Since 


+ (1° 07/2)4 


there is no term with odd powers of t in (8-13), all moments of odd order about 
mean vanish. 


ie,  Pne1=03;n=0,1,2,. 


r" 
and 3, ¢ Coefficient of ;in (8- 13) =2- *2n)! asl 
(2n) 2" n! 
oT, 
my tL 2M (2m - 1) (2m 2) (2n - 3)... 5.4.3.2. 1, 
n! 
o” 


| 1.3.5... (2n -1)][2.4.6 ... (Qn-2). 2n | 


o*" x 
2 135 ... (Qn=1)}2 [123.2] 


= 1.3.5... (2n—1) 0” 
Remark. In particular, we have from (8-10) and (8-12) , 
3=0 and o=1.67, 4=1:3 0° 


2 4 
_ Hence B= #3=0 and Bp= M4 =22% =3, 
2 U3 


the results which have already been obtained in (8-9) . 


8-2-8. A linear combination of independent normal variates is also a 
normal variate. Let X;, (i= 1, 2, ...,2) be nm independent normal variates with 


mean pi; and variance oF cspertiveli Then 
Mx,(t) = exp pi t+ (1° 07/2) | (8-14) 


a 


n 
The m.g.f. of their linear combination 2 a; X;, where aj, a2, ...,Qn are con- 
i=] 
Stants, is given by 


My a; X;,(t) = Ma, Xy + ay X2+.. +a, Xp () 
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= Ma, ».¢) (3) ~ Ma, x2 (t) eee M o,X, (t) 


(-. Xs are independent ) 
ae Mx, (a t) . My, (a2 t) re Mx, (a, t) ..(8°15) 
[°° Mey (t) = My (ct) ] 
From (8°14) , we'have 
My,(ajt) = eet 72 
-, (8-15), gives 
Mx2,X; (1) =[ elt ait t a, 04/2 2 girartet a,0%/2 — gin t+ ah on/2 ; 
é 


-eo|{ 3 = amore 2 a /?} 


which is the m.g.f. of a normal variate with mean <2 a; uj; and variance 
ie, 

Ea’. o?. Hence by uniqueness theorem of m.g.f., 

iel 


iol 


> a;X; ~N z aipi, 2 a} oF |. 
i .-(8°15 a) 


Remarks 1. If we take a, = a27-= 1, a3 = a4 =... =0, then 


X, + X2~N (1 + p2, 01 + 03) 
If we take a; = 1, a2 =—1,:43,=a42...=0, then 
X\-X2 ~N(pi—p2, 01 + 03) 
Thus we see that the sum as Well as the difference, of two independent normal 
variates is also a normal variate. This result provides a sharp contrast to the Poisson 


distribution, in which case though the sum of two independent Poisson variates is 
a Poisson variate, the difference is not a Poisson variate. 


2. If we take 
a, =42=...ea,=1, then we get (8°15 b) 


re | tol to] 


Ex -N{ Ew, Eo? 


Le., the sum of independent fidrmal variates is also a normal variaté, which 
establishes the additive property of the normal distribution. 

3. If X;; i=1,2,...,2 are identically and independently distributed as 
N (u, 0”) and if we take a, = a2 =... = Gn =1/n, 


then Peo ea et 
iol Rojel n? jel 
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=> X ~N(y,0°/n), where X = fa Ss X; 
© ia 
This leads to the following important conclusion : 
If X;, (t = 1, 2, ...,m), are identically aud independently distributed normal 


variates with mean and variance o”, then their meanX is also N (p, o7/n) ; 


$-2-9. Points of Inflexion of Normal Curve. At the point of inflexion of 
the normal curve, we should have 


f" (x) =0, and f''' (x) #0 
For normabcurve, we have from (8:6) 
’ (x) |, . &—p)? 
py = FO | 1. ow 
oO 0 ' 
(x-)’ 
f" @=0 = es > x ( = X=uUrzoO 
oO 
It can be easily verified that at the points x = 1 +0, f''' (x) #0. 
Hence the points of inflexion of the norinal curve are given by x = 1 + o and 


f(x) = z a oie. they are equi-distant (at a distance o) from the mean. 


8-2:10. Mean Deviation from the Mean for Normal Distribution. 


M.D. (about mean) = f |x-p | f(x) dé 


- 


@ 
2 2 
“STR J lene OM de 
me fs ; ~z/2 x-U_ 
- 20 f |z | -272 
a e | 
2 
since the integrand|z|e~7’2 is un even function of z. 


Since in[ 0, ],|z|=z, we have 


i 2 
M.D. (about mean) =V27x of ze*” dz 
0 vy 


oe 2 | 
=VIVax Oo. dt a, 
si hg , E 
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—{ 


2/x oO 


2/x oO 
4 
=o? (approx.) 
8-211. Area Property (Nornial Probability Integral). If X ~N (p, 0°), 
then the probability that random value of X will lie between. XY = and X = x1 is 
given by 


1 


P(u<X<x)= J fe) de- sare aS -(x- ~WA20°) ay 
X — 


Put Pa=Z,ie,X-peoZ 


When X = up, Z=0 and when Vex ze! a = Z1, (say) . 


21 


~ P(w<X <x) =P(0<Z <2) = aah “Ade J o()de 


2 
where @ (z) = 5s er? s is the probability function of standard normal variate. 


2 

The definite integral f gy (z) dz_ is known as normal probability integral and 
0 

gives the area under standard normal curve between the ordinates at Z = 0 and 


Z = 2. These areas have been tabulated for different values of.z1 , at intervals of 
0:01 [c.f Appendix, Table IV]. 


X=U-30 X=fl-20 Xs 


+O X=u+2o X=a+3o 
Z=-3. Z:-2 2 


7:0 Z=1 2:2 233 


‘ ; noe 
Q 
x 
1 
- 
»< 
" 


In particular, the probability that a random value of X lies in the interval 
(u—0O, 2 +0) is given by 
B+O 
P(y-o<X<pto)=f f(x) de 


B-o 
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1 


=> P(-1<Z<1)=f (a ace 
| : 
= 2f e@az (By symmetry) 
0 
= 2'x-0-3413"= 0-6826 (From tables) ...(8-17) 
Similarly 
2 
P(w-20<X<p+20)=P(-2<Z<2)=f oad 
23 


2 
= 2 f (2) dz=2x0-4772 =0-9544 _...(8-18) 
0 


and 
3 


P(o-30<X<p"+3 0) =P (-3 <Z<3)=f @ (z) dz 
-3 
3 
=2f 9 (2) dz =2 x 0-49865 =0:9973 _...(8-19) 
0 


Thus the probability that a normal variate XY lies outside the range 1, + 3.0 
is given by 

P(|X—w|>3 0) =P(|Z|>3) «1-P(-—3 +Z <3) =00027 

Thus in all probability, we should expect a normal Variate to lie within the 
range 4 +30, though theoretically, it may range from—© to ©. 

Remarks, 1. The total area-under normal probability curve is unity, 2.e., 


J fa)dc=f o@dz=1 


2. Since in the normal probability tables, we are given the areas under 
Standard normal curve, in numerical problems we shall. deal with the standard 
normal variate Z rather than the variable ¥ itself. 

3. If we want to find area under normal curve, we will,somehow or other 
try to convert the given area to the form P (0 < Z < Z;), since the areas have been 
given in this form in the tables. 

$-2-12. Error Function. If X ~N (0, 0) , then 


1 i242 
f(x) = =e" oT ,-M<x<m 


2 1.. A iP? 
If we take h To then f)=7—e 
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The probability that a random value of the variate fies in the range +x is 
x 


. A 
h ya 
given by p=| fade= 7 | e”* dx 
_ 2h -ne a 2 { sh x os * 
"ae le dx = en" ¥ (nd) *) 
Taking 


\ 
. 2 

yO) = - e~” dy, (*) may be re-written as 
0 


= w (hx) = J e"* (hax A**) 
‘0 


The function y (y) , known as the error function, is‘of fundamental impor- 
tance in the theory ,of errors in Astronomy. 

8:2:3. Importance of Normal Distribution. Normal distribution plays a 
very important role in statistical theory because of the following reasons : 

(i) Most of the distributions occurring in practice, é.g., Binomial, Poisson, 
Hypergeometric. distributions, etc., can be approximated by normal distribution. 
Moreover, many of the sampling distributions, e.g., Student’s ‘? , Snedecor’s F, 
Chi-square distributions, etc., tend to ncrmality for large samples. 

(i?) Even if a variable is not normally distributed, it can sometimes be 
brought to normal form by simple transformation of variable. For example, if the 
distribution of X is skewed, the distribution of VX might come out to be normal 
{c.f. Variate Transformations at the end’of this Chapter]. 


(iit) If X~N(p, 6°), thien 
P(p~30<X<p+3 06) =0-9973 


=> P (-3 <Z <3) =0-9973 
=> P (1Z1 <3) =0-9973 
=> P (Z| > 3) =0-0027 


This property of the normal distribution forms the basis of entire Large 
Samole theory. 

(iv) Many of the distributions of sample statistic (e.g., the distributions of 
sample mean, sample variance, etc.) tend to normality for large samples and as such 
they can best be studied with the help of the normal curves. 


(v) The entire theory of small sample tests, viz., t, F, x tests-etc., is based 
on the fundamental assumption that the parent populations from which the samples 
have been drawn follow normal distribution. 

(vi) Theory of normal curves can be applied to the graduation of the curves 
which are not normal. 

(vii) Normal distribution finds large applications in Statistical Quality Con- 
trol in industry for setting control limits. 
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The following quotation due to Lipman rightly reveals the popularity and 
yMportance of normal distribution . 

"Every, body believes in the law of errors (the noimal cue) the ex. 
perimenters because they think itis a mathematical theorem, the mathematicians 
because they think it ts experimental fact” 

W J Youden of the National Bureau of Standards deseribes the importance 
of the Normal distribution artistically in the following words - 

THE NORMAL 
LAW OF ERRORS 
STANDS OUT IN THE 
EXPERIENCE OF MANKIND 
AS ONE OF THE BROADEST 
GENERALISATIONS OF NATURAL 
PHILOSOPHY IT SERVES AS THE 
GUIDING INSTRUMENT IN RESEARCHES. 
IN THE PHYSICAL AND SOCIAL ‘SCIENCES 
AND IN MEDICINE, AGRICULTURE AND 
ENGINEERING. IT IS AN INDISPENSABLE TOOL. FOR 
THE ANALYSIS AND THE INTERPRETATION OF THE 
BASIC DATA OBTAINED BY OBSERVATION AND EXPERIMENT. 

The above presentation, strikingly enough, giyes the shape of the normal 
probability curve. , 

8-2 14. Fitting of Normal Distribution. In order to fit normal distribu- 
tion to the given data we first calculate the mean LL. (say), and standard devia- 


tion 0, (say), from the given data. Then the normal curve fitted to the given data ts 
given by 


) 4 
fO) = Sax oxP' - @ Hy 72 Or, -~2 <x <o0 


To calculate the expected normal frequencies we first find the standard 
normal variates corresponding to the lower limits of each of the class intervals. 
~H 


Le., we compute 2j= ar where,” is the lower limit of the /th clas’ interval 


Then the areas. undér the normal curve <o the left of the ordinate, at 2 = 7), Say, 
@ (2;) are computed from the tables. F nally, the areas for the successive class 
intervals are obtained by subtrachon, viz.. @ (%+1)=@ (<). @ = [, 2...) and. gn 
multiplying these areas by N, wé get the expected normal frequencies. 

Example 8-10. 9 Obiain the equation of thé normal curve that may be fitted 
to the following data : 


Class. 60-65 65=70 JO-75 75-80 80-85) -85-90- 90-95 «95-100 


Frequency. 3 21 150-3335) 326—s—«=dU'3S * 4 
Also obtain the expected.normal frequenétes 


- 
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Solution. For the given data. we have 
N = 1000. p = 79 945 and o =5 545 
Hence the equation of the normal curve fitted to the given data ts 


rae 1000 =| »{ ’— 79-945 ra 
NS NP ee eee 
Vonx5 545°" | 2 5 545 
Theoretical normal frequencies can be obtained as follows . 
. fame ff. (>) A¢@ (2) Expected 
class class . = Word — We frequency 
hound NAW (2) 
(X") ~, 
Below . 

60 ~ 00 0000112 O0l2= QO 
60-65 60 0002914 | 2914 = 3 
65-70 65 0031044 31044 = 3) 
70-75 70 0 147870 147870 = 148 
75--80 75 0 322050 322.050 = 322 
80-85 80 0 319300 319300 = 319 
85-90 85 0 144072 144072 = 144 
90-95 90) 0 029792 29792 = 30 
95-100 95 0 002733 2733 = 3 
00 and 100 
over 
Total | 1000 


Example 8-11. Fora certain normal distribution, the first moment about 
10 is 40 and the fourth moment about 50 is 48. What is the arithmetic mean and 
standard deviation of the distribution ? . 


{Delhi Univ. B.Sc. (Hons. Subs.), 1987; Allahabad Univ. B.Sc. 1990} 


Solution. We know that if 1’ ts the first moment about the point X =A, 
then arithmetic mean is given by: 
Mean = A + Uy’ « 
We are given 
uy’ (about the point X = 10) = 40 = Mean = 10+ 40= ae 
Also we are given 


Uy’ (about the point X = 50) =48, 1¢, pUy=48 (-.: Mean =50) 


But for a norma! distribution with standard deviation 0, 
Wy=30° = 30°=48 ie, o=2 

Example 8-12. X is normally distributed and the mednof X i812 and S.D. 
is 4. (a) Find out the probability of the following - 

(i) X220, (i) X S20, and (UNOS X S12 

(b) Find x’, when P(X>x)=0-24. 

(c) Find xo and x1’, when P (x0 <X<x/)=0 50 and P(X > xj) =025 

Solution. (@) We have p = 12,6=4, ie., X ~N (12, 16). 
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(i) P(X220)=? 
When X=20, zoo 22 

P(X >20) =P (Z>2)=0-5 - P(OS Z¥ 2) =0-5 - 0-4772 = 0-0228 
(ii) P(X<20)=1-P(X220) 


= 1-0-0228 = 0-9772 


(:.. Total probability = 1) 


z=0 Z=2 
X-12 
(iii) P(O<X<12)=P(-3<Z<0) zani¥ 
=P(0<Z<3) =0-49865 (From symmetry) 


(b) When X=x’,Z= * = = 2) (Say) 


then, we are given 
P(X>x)=0:24 => P(Z>2z)=0-24, ie, PO<Z<2z)=0-26 


d 


0:26 


From normal tables, 
z;= 0°71 (approx .) 
x) -12 
4 


Hence =0-71 = x)'=12+4x0-71 = 14-84 


(c) Weare given 
P (xo < X $x1')=0-50 and P(X > x’) =0-25 a) 
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From (*), obviously the points.vo’ and xy” are located as shown in the figure. 


x)/- 12 
When X =x)’, zZ=~ rae (say) 
ky _ xo 12 | 
aniwhen X=xX0, 25 Qo =— 2 (It is obvious from the figure) 
We have 
P(Z>21)=025 => P(0<Z<27)=0-25 
21 = 0-67 (From tables) 
xyo- 12 
Hence >= 067 => xy = 1244x067 = 14-68 
2 ; 
ee => x9 =12-4x0-67 =9-32 


xXo°— 
and cara 
Example 8:13. X is anormal variate with mean 30 and S.D. 5, Find the 
probabilities that 
(i) 26<X S40, (ii) X245, and (iii) | ¥-304>5. 


Solution. Here p = 30 and o=5., 


=26 X= =60 
Z=-08 Z:0 7=2 
—! 9) a 
() When X=26, Z=+=4 = = 2-08 


oO 
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40-30 _ 
= 


and when X=40,Z= 2 


P (26<SX $40) =P (-0-8 <Z< 2) 
=P(-(:8<5Z<0)+P(0O<SZ<2) 


=P(-0-8$Z<0) +0-4772 (From tables) 
=P(0<Z<0-8) + 0-4772 (From symmetry) 
= 0:2881 + 0-4772 = 0-7653 
P(X>45) =? 
2:0 X=45 
Z=3 
wrenxaas, 2252303 


P(X 245) =P(Z23)=0-5-P(0SZS3) 
= 0-5 — 0-49865 = 0-00)35 
(iii) P(IX-30I1S$5)=P(25sXS35)=P(-1<SZ<¢1) 
=2P(0SZs 1)=2x0-3413 =0-6826 
P(I\X-301>5)=1-P(1X-301¢$5) 
= 1 —0-6826 = 0-3174 
Example 8-14. The mean yield for one-acre plot is 662 kilos with a s.d. 
32 kilos. Assuming normal distribution, how many.one-acre plots in.a batch of 
!,000 plots would you expect to have yield (i) over 700 kilos, (ti) below 650 kilos, 
and (iti) what is the lowest yield of the best 100 plots? 
Solution. Ifther.v. X denotes the yield (in kilos) for one-acre plot, then we 
are given that X ~N (i, 0”), where pp = 662 and o =32. 
(i) The probability that a plot has a yield over 700 kilos is given by 


P(X > 700) = P(Z> 1-19); | 


32 
=0-5-P(O<Z<t-19) 
= 0-5 - 0:3830 
=0-1170 
Hence in a batch of 1.000 plots, the expected number of plots with yield over 
700 kilos is 1,000 x O-117=117. 
(ii) Required number of plots with yield below 650 kilos is given by 
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1000 x P(X <650) = 1000 x P(Z<-0-38) Z = = 
= 1000 x P (Z > 0-38) (By symmetry 
= 1000 x [05-P(0<Z<0-38) ] 
= 1000 x [ 0:5 —0-1480 ] = 1000 x 0-352 
= 352 

(iii) The lowest yield, say, x1 of the best 100 plots is given by 

100 
P (X > x) = — = (0-1 
When Xax,, Ze see! a = 2; (say) -».(*) 
oO 32 
such that P(Z>2)=-01 => P(0sZ<2)=04 
=> 2, = 1-28 (approx.)' [From Normal.Probability Tables] 


Substituting in (*), we get 
X; = 662 + 32,Z = 662 + 32.x 1:28 
= 662 + 40-96 = 702-96 

Hence the best 100 plots have yield over 702-96 kilos. 

Example 8-15. There are six hundred Economics students in the post- 
graduate classes of a university, and the probability for any student to need a copy 
of a particular book from the university library on any day is 0-05. How many 
copies of the book should be kept in the university library so that the probability 
may be greater than 0-90 that none of the students needing a copy from the library 
has to come back disappointed? (Use normal approximation to the binomial 
distribution). [Delhi Univ. M.A. (Eco.), 1989] 

Solution. We are given : 

n= 600, p = 0-05, p = np = 600 x 0-05 = 30 
o” = npq = 600 x 0-05 x 0:95 = 28:5 => o = V285 = 5:3 
We want x; such that 


P(X <x) > 0-90 
=> P(Z<2)>0-90 Coed 
=> P(0<Z<z,)>0°40 
=> Z1 > 1:28 {From Normal Probability Tables] 
=> a= >128 => x, >304+5:3x 1:28 
=> x1>30+6784 => #£x;>36°784~ 37 


Hence the university library should keep at least 37 copies of the book. 
Example 8-16. The marks obtained by a number of students for a certain 
subject are assumed to be approximately normally distributed with mean value 65 
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and with a standard deviation of 5. If 3 students are taken at random from this sey 
what is the probability that exactly 2 of them will have marks over 70? 

Solution. Let the r.v. X denote the marks obtained by the given set of 
students in the given subject. Then we are given that X¥ = N(p, 0’) where 
u=65 and o=5. The probability ‘p’ that a randomly selected student from the 
given set gets marks over 70 is given bv 


p=P(X>70) 
70-65 


When x=70, 7-4. sl. 
p=P(X>70)=P(Z>1) 


=0:5-P(0sZs1) | 
= 0:5 —0:3413 = 0-1587 [From Normal probability tables] 

Since this probability is same for each student of the set, the required 
probability that ‘out of 3 students selected at random from the set, exactly 2 wil| 
have marks over 70, is given by the binomial probability law:, 

3C> p? .(1—p) = 3 x (0°1587)* x (0-8413) = 0-06357 

Example 8-17. (a) IflogioX is normally distributed with meant 4 and 
variance 4, find the probability of 

1:202 <X < 83180000 

(Given logio 1202 = 3-08, logio 8318 = 3-92). 

(b) logio X is normally distributed with mean 7 and variance 3, log;0Y is 
normally distributed with mean 3 and variance unity. If the distributions of X and 
Y are independent, find the probability of 1-202 < (X/Y) < 83180000. 

[Given logio (1202) = 3-08, logio (8318) = 3-92 ] 

Solution. (a) Since logX is a non-decreasing function of X, we have 

P (1-202 <X < 83180000) = P (logio 1-202 < logio X < logio 83180000) 


= P (0-08 < logio X < 7-92) 
= P (0-08 < Y < 7:92) 
where Y = logioX ~N (4, 4) (given) . 
When Y=0-08, Z= — 4 1-96 
and when “Ye 792, Z= == = 1-96 


Required probability = P (0-08 < Y < 7-92) 
| = P (~ 1-96 <Z < 1:9%6) =2P(0<Z < 1-96) 
ete (By symmetry) 
= 2 x 0-4750 = 0-9500 
(b) P| 1-202 < (X/Y) <83180000 ] 
= P [:logio 1-202 < logio (X/Y) < logio 83180000 | 
= (0-08 < U < 7-92) 
where U = logio (X/Y) = logio X — logio Y 
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Since logo X ~ N (7, 3) and logio Y ~N (3, 1), are independent, 


logio X —logio Y ~N (7 — 3, 3 + 1) (c.f. Remark 1, $ 8-2-8) 
=> U = (logio X — logio Y) ~ N (4, 4) 
. Required probability is given by 
p= P (0-08 < U < 7:92), where U ~N (4, 4) 
= 0-95 [See part (a)] 
Example 8-18. Two independent random variates X and Y are both 
normally distributed with means 1 and 2 and standard deviatio;.s 3 and 4 respec- 
tively. If Z = X —Y, write the probability density function of Z. Also state the 
median, s.d. and mean of the distribution of Z. Find Prob.{Z+1s0}. 
Solution. Since X ~ N (1, 9) and Y~N (2, 16) are independent, Z = Y-Y~ 
N(1-2,9 + 16), i.e, Ze" X-—Y~N(—1, 25). Hence p.d.f. of Z is 
2 


pP2-=—— ox | -4( ey |;-w<z<=. 
iz : 
For the distribution of Z , 
Median = Mean=—1 and s.d.=V25 =5 
P(Z+1<0)=P(Z<-1) 
Z+1 


=P(U 0); = ASw (0, »| 


: = 0:5 

Example 8:19. Prove that for the normal distribution, the quartile 
deviation, the mean deviation and standard deviation are approximately 
10:12:15. [Dibrugarh Univ. B.Sc. 1993] 


Solution. Let X¥ be aN (yu, o’). If Qi and Q3 are the first andthird quar- 
tiles respectively, then by definition 


P(X <Q) = 0°25 and P(X>Q;) #025 
The points Q1 and Q3 relocated as shown in the figure given below. 
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When X=(23,Z= _ Ce Z1, (say), 
and when X=(Q), Z= fc = — Z; (This is obvious from the figure) 
Subtracting, we have 
Q3- =21 Q3-O1 4, 
The quartile ae is given by 
Q.D. « a — 


From the figure, obviously, we have 
P(0<Z<z,)=0-25 => z, = 0-67 (approx,) (From Noimal Tables) 
: Q0.D. =07=0670=26 


For norma! distribution mean deviation about mean (c.f. § 8°2°10) is given 


by 
D eee 
Hence Q.D.: M.D.:S.D.: 220: $G:0::2 3234; £1::10:12:15 


Example 8-20 (a). Ina distribution stall normal, 7% of the items are 
under 35 and 89% are under 63. What are the mean and standard deviation of the 
distribution ? [Kerala Univ. B.Sc., May 1991] 

(b) Ofa large group of men, 5% are under 60 inches i in height and 40% are 
between 60 and 65 inches. Assuming a normal distribution, find the mean height 
and standard deviation. [Nagpur Univ. B.Sc., 1992}: 


Solution. If X ~N(, 0”), then we are given 
P(X <63)=089 = P(X >63)=0-11 and P(¥ <35) =0-07 
The points X = 63 and X =35 are located as shown in Fig. (1) below. 
Since the value X = 35 is located to the left of the ordinate at X =p, the 
corresponding value of Z_ is negative. 


When X = 35, Z-2= B 21, (say), 


X=35 X=ph X=63 
Zx-Z, Z:0 Z=Z 
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2 
and when X = 63, Z= 25H = 22, (say), 


Thus we have, as is obvious from figures (7) and (ti) 
P(0<Z <2) =0-39 and P(0<Z <7) =() 43 
Hence from normal tables. we have 
z2= 1-23 and 2; = 1-48 
63 <M 123 ang 22H 
Oo ‘oO -: 
Subtracting, we get 


- = 35 + 1-48 x 10-33 = 35 + 15-3 = 50-3 
(b) Weare given 
P(X <60)=0-05 and P (60 <X <65)=0-40 
i.é., P(X < 65) =0-45 
Since the total area to the left of the ordinate at X = Lt is 0-5, both the,points 


X=60 and X=65 are located to the left of X= and consequently the cor- 
responding values of Z are negative. 


Let X~N(t, 0”). 
When X = 65, z= S22 = — 2) (say), 


and when X = 60, L= eee = — 22 (say). 
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Thus we have 
P(O<Z<22)=0-45 and P(0<Z< 21) =005 
22= 1-645 and 2) =0-13 (approx.) (From Normal’ Tables) 


Hence “st =— 1-645 4%): and a = — ()-13 ...(#*) 


Haas wd 6) - pp _ 1-645 _ 19825 _ 5.4) 
Dividing. we get 65-11 0-13 => p= 303 65-42 
60 — 65-42 
é » (*), we hi ge ae 9 
From (*), we have Oo 1-645 Y 


Remarks. If we substitute the value of pt in (**). we get 6 = 3-23 which 
is only an approximate value since the value of 2) = 0-13, seen from the table, js 
not exact but only approximate. On the other hand, the value of z2 = 1-645. is exacy 
and hence use of (*) for estimating © gives better approximation. 

Example 8-21 Uf the skulls are classified as A, Band C according as the 
length-breadth index is under 75, between 75 and 80, or over 80, find approximate. 
ly (assuming that the distribution is normal) the mean and standard deviation of q 
series in which A are 58%, B are 38% and C are 4%, being given that if 


fio =57] exp (- x’/2) dx, 
0 


then f (0:20) = 0-08 and f (1-75) = 0-46 
{Delhi Univ. B.Sc., 1989; Burdwan Univ. B.Sc., 1999] 
Solution. Let the length-breadth index be denoted by the variable X, then 
we are given 
P(X < 75) =0-58 and P(X > 80) =0-04 (1) 
' Since P (X < 75) represents the. total area to the left of the ordinate at the 
point X=75 and P(X > 80) -represents the total area to the risnt of the ordinate 
at the point X = 80, it is obvious from (1) that the points X = 75 and X= 80 ar 
located at the positions shown in the figure below. 


0-08 
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{ « 
| 2 : 
Now On J exp (—." /2) dx represents the area under standard normal 
~ "0 


curve between the ordinates at Z=0 and Z=1t, Z being aN (0, 1) variate. 
t 
> 
F tf 4 —. =. = » 
fO=p— J exp 2/2) dx=P(0<Z<2) 


Hence f (0-20) = P (0 <-Z < 0-20) = 0-08 (2) 

and f (1-75) = P(0<Z< 1-75) =0-46 

Let up and o be the mean and standard deviation of the distributicn. Then 
X~N(L,0). 


When X = 75, Z= =p = 21 (say), 


and when X = 80, Z= cia = Z2 (say). 
Oo 


Thus from the figure, it is obvious that t 
P(X <75)=0:58 = P(0<Z<2z,)-=0-08 
.. Using (2), we have 


a=2=B=020 | .Q) 
- Also P(X>80)=0-:04 => P(0<Z<z)=0-46 
.. From (2), we get | 
— oo 2175 . (4) 


Solving the equations (3) and.(4), we get 
{1 = 74-4 (approx.) and o = 3-2 (approx.) 


Example 8-22. = /n an-examination it is laid down that a. student passes if 
he secures 30 per cent or more marks. He is placed in the first, second or third 
division according as he secures 60% or more marks, between 45% and 60% marks 
and marks between 30% and 45% respectively. He gets distinction in case he 
secures 80% or more marks. It is noticed from the result that 10% of the students 
failed in the examination, whereas 5% of them obtained distinction. Calculate the 
percentage of students placed in the second division. (Assume normal distribution 
of inarks.) [Aligarh Univ. B.Sc., 1991] 

Solution. Let the variable X. denote the marks (out of 100) in the examina- 
tion and let X ~ N (p, 6’) . Then we are given 

‘ P(X < 30) =0-10 and P(X280)=0-05 


Thus from the figure on next page, we have 
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Zz-Zy 


When xX = 30, Z = a <i (say), 
0 


and when X = 80, Z= aE = 22 (say). 


P(0<Z< 22) =0-5 —0-05 =0-45 
and P(0<Z<21)=P(-21<Z<0) (By symmetry) 
= 0-50 ~ 0-10 = 0-40 
.. From normal tables, we get 
zi = 1-28 and z2= 1-64 


Hence = =- 1-28 
0 
me Ba 2128 and OEE 1.64 
‘ Adding, we get 
5 50 
—_—= =-——=s a 
2:92 oO 7.92 17-1 


= 30+ 1-28 x 17-12 = 30 + 21-9136 = 51-9136 = 52 


The probability ‘p’ that a candidate is placed in the second division is equal 
to the probability. that his score lies between 45 and 60, i.e., 


p=P(45<X<60)=P(-0-41 <Z<0-47) [2-7 | 


17-12 
= P(-0-41<Z<0)+P(<Z<0-47) 
=P(0<Z<0-41)+P(0<Z<0-47) (By symmetry) 
, = 0-1591 + 0-1808 = 0-3399 = 0-34 (approx.) 

Therefore, 34% candiates got second division in the examination. 

Example 8:23. The local authorities in a certain city instal 10,000 electric 
lamps in the streets of the city. lf these lamps have an average life of 1,000 burning 
hours with a standard deviation of 200 hours, assuming normality, what number 
of lamps might be expected to fail (i) in the first 800 burning hours? (ti) between 
800 and 1,200 burning hours ? After what perigd of burning hours would you expect 
that (a) 10% of the lamps would fail? (b) 10% of the lamps would be still burning? 


e 
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[In a normal curve, the area between the ordinates corresponding to 
X-X 2 and *—* 
Oo 0) 


= 1 is 0:34134 and 80% of the area lies between the or- 


dinates corresponding to = + 1-28 j. 

Solution. Ifthe variable X denotes the life of a bulb in burning hours, then 
we are given that ¥~N (u, 0°), where -=-1,000 and o = 200. 

(i) The probability ‘p’ that bulb fails in-the first 800 buming hours is given 
by 


p=P(X<800)=P(Z<-1)=P(Z>1) 22 | 


= 0:5-P(0<Z <1) = 0-5 —0-3413 = 0-1587 
Therefore out of 10,000 bulbs, the number of bulbs which fail in the first 800 
hours is 
4 10,000 x 0:1587 = 1587 
(it) Required probability = P (800 <X < 1200) = P(-1 <Z <1) 
=2P(0<Z <1) =2 x 0-3413 = =0-6826 


Hence the expected number of blubs with life between 800 and 1,200 hours 
of burning life is: 10,000 x 0-6826 = 6826 

(a) Let 10% of the bulbs fail after x; hours of ourning life. Then-we have 
to find x; such that P(X <x) = 0-10 


x; — 1000 


When X = x1, Ze 70.772! (say). 
as P(Z <~+~z;)=0:10 = P(Z52,)#=0-10 
=> | iP (0 <Z <z;) =0-40 .-(1) 


We are given that 
P (— 1:28 <Z <,1-28) =0;830 = 2P(0<Z<1:28)=0-80 
=> P'(0 <.Z < 1-:28)-= 0-40 (2) 
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. From: (1) and (2), we get : 
Z; = 1:28 
Hence ¥1~ 1000 | _ 1-28 => x, = 1000-256 = 744 


a 200 
Thus after 744 hours of buming life, 10% of the blubs will fail. 
(b) Let 10% of the blubs be still buming after, (say),'x2 hours of buming 
life. Then we have ' 


tp - a + 


: Via 


P(X >x2)=0:10 = P(Z>z2)=0-10 2 a0 | 


=> P(Q<Z <7z2) = 0-40 
Z2 = 1:28 ; > +s [From (2)] 


~ 


32= 1000 4.28 => xp-= 1000 + 256 = 1256 - 7 


200 
Hence after 1256 hours of burning life, 10% of the blubs will be still buming. 
Example 8-24. Let X ~ N(p, oo). If e=p, (u>0), express 

P(X <-pu|X <p) interms of cumulative distribution function of N (0,1). 

[Delhi Univ. B.Sc. (Maths. Hons.) 1988; (Stat. Hons.). 1993] 


1.e., 


Solution. 
Ww ALE . a . , 
Pesulvewet Ee. (-.: > 0) 


P(X <a) P(X<p) ’ 
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_P(Z<-2) 7247 _X- 
~ P(Z<0) ae arr 
- PZ@>2). 

(1/2) ’ (By symmetry) 


=2[({1-P(Z<2)J=2[1-@(2)| 

where ® (.) is the distribution function of standard normal variate. 

Example 8-25 Can xX and -X have the same distribution ? 

If so, when ? » (Delhi Univ. B.A., (Spl. Course Statistics), 1989] 

Solution. Yes; X and —X can have the same distribution provided the 
p.d.f. f(x) of X is symmetric about origin i.¢., if f(—x) =f (x) . 

For example, X and ~X have the same distribution if : 

(i) X~N(0, 1) 


(ii) X has standard aad distribution [c.f. § 8-9] 


l ° 
f(x) = =-. Gee) —e <x <oo 


(iii) X has standard Laplace distribution [c.f. § 8-7] 


p(xyate'™'; —0<x<oo, 


and so on. Obviously X and Y=-X are not identical. 

Remark. This example illustrates that if the r.v.’s. X and Y are identical, 
they have the same.distributions. However if X and Y have the sz"ue distribution, 
it does not imply that they are identical. 

Example 8-26. = [f X, Y are independent normal varia.es with means 6, 7 
and variances 9, 16 respectively, determine X such that 

P(2X+Ysh)=P(4X-3Y24 A) 
[Delhi Univ. B.Sc. (Stat. Hons.), 1988; B.Sc., 1987] 

Solution. Since X and Y are independent, by § 8-2-8 [c.f. equation 
(8-15a)], we have 

U=2X+Y-N (2X64+7,4x9+ 16), ie., U~N(19, 52) 


V=4X-3Y-~N (4X6-3X7, 16X9+9-x 16), i.e, V~N (3, 288) 


and Leidioainiaiasad ah. 2 where Z ~ N (0, ;) 
and i oonislasaic ae his. 3 taal 1) 
Now P(2X+YSA)=P{ (4X-3 7) 242 
7-19 42-3 

=: < —S— i= 

za P{ zs v2 |= 23 | 

- K-19 (42-3 

52 -——«|:s«12V2 i 


[Since P(Zsa)=P(Z2b) => a=-b, 
because normal probability curve is syrametric . Z=0 ]. 
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A-19 3-42 
= Vis. 62 
= (6 V2 +4 V13) A=114 V2 +3 V13 
— 14V2 4313 
~ 6V¥2 44013 


Example 8-27. If X and Y are independent normal variates possessing a 
common mean such that 


P(2X+4YS10)+P(3X+YS9)=1 
P(2X-4Y<6)+ P(Y~3X2 1) =], 
determine the values of {1 and the ratio of the variances of X and Y. 
Solution. Let Var (X;) = 0? and Var (Y)= o3 


Since E (X) = E (Y) =, (Given) and X and Y are independent by § 8-2-8 
(c.f. equation (8-15a)], we have 


2X +4Y~N (2+ 4u, 407 + 1603), i.e, N (6, 4af + 1603) 

3X+ Y~N (3+ p, 967 + 03),-i.e., N (4p, 9 07 + 04) 
2X ~4Y~N (2 — 4, 4 ot + 16.03), i.e, N (— 2, 4 oF + 16 03) 

Y~3X~N (p— 3p, 03 + 904), ie., N (— 2 pL, 907 + 03) 
Let us further write : 

4c¢+1603=02 and Iot+03=P" (1) 
If Z denotes the Standard Normal Variate, i.e, if Z~N (0, 1), we get 
P (2X+4YS 10)+P(3X4+YS9)=1 


=> rX 


=> p z< a8 +P[ zs2—S |=) 
= P z< = ot -t-p(2525th | p[ 2228 | 
es 10-6 — 9-4 
oO B : wA2) 
(Since normal distribution is symmetric about Z = 0). 
. Similarly 
P (2X -4Y $6) + P(Y-3X21)=1 
= P z¢ot3h +P z2ttze = 
\ 
= P z<°%2t wivp{ 22th lep[ zs 5th 
= 6+2p 1+2u 
ri 8 (3) 
Solving (2) and (3), we get 


a@_6+2u_10-6y 
Bo l+2p° 4p-9 (4) 
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= (6420) (44n~-9)=(10-64) (1+2y) 
=> 5? -~2u-16=0 (On simplification) 
_24+V44+320 2418 
= ean (Maa; 
=> W=2 or -—1-6 
Substituting pf. = 2 in (4), we get. , 
a 10 _& 
Cs t.é., 4= 
From (1), we get 
40}+1603 44162 G3 
eS Taking X=—> 
9 of +05 9+% ee 
32 8 
= 4(9+A)=44+16A => A= 1273 
Again putting =— ]-6 in (4), we get 
14) _o? _4+16% _ ,_ 1280 _ 64 


| pe 942 ~~ “~ 1740 87 
Example 8:28. /f two normal universes A and B have the same total 
frequency but the standard deviation of universe A is k times that of the universe 
B, show that maximum frequency of universe A is I/k times that of universe B. 


Solution. Let N be the same total frequency for each of the two universes 
A and B. Ifois the standard deviation of universe B, then the standard deviation 
of universe A isko. Let [4) and p22 be the means of the universes A and B 
respectively. 


The frequency function of universe A is given by 


fa) = pp exp [-(e- w)?/2P 0} 


and the frequency function of universe B is given by 
. N 2 
fa) = — 75 — exp | — (x w2)?/2.07| 
Since, for a norma! distribution, the maximum frequency occurs at the point 


x = mean, we have 
( fa (x) Imax = Maximum frequency of universe A 


= | fs (x) Assi 
-| cone! —(- i) 72 Ke #|| oo TT 


Similarly 


[ fe (x) |max = fe (x) x= U2 


8 50 Fundamentals of Mathematical Statistics 


7 — ex eG. )°/20° me 
Slang Eh en <= _ ote 
| fa (x) |max es I 
| fe @) Imax =f 
EXERCISE 8 (b) 


1. ‘“‘If the Poisson and the Normal distributions are fimiting cases of 
Binomial distribution, then there must be a limiting relation between the Poisson 
and the Normal distributions.’’ Investigate the relation. 

2; (a) Derive the mathematical form and properties of normal distribution 
Discuss the importance of normal distribution in Statistics. 

(b) Mention the chief characteristics of Normal distribution and Normal 
probability curve. [Delhi Univ. B.Sc. (Stat Hons.), 1989} 

3. (a) Explain, under what conditions and how the binomial distribution 
can be approximated to the normal distribution. 

(b) For a normal distribution with mean ‘ui’ and standard deviation o, 
show that the mean deviation from thé mean ‘u’ is equal to o V(2/m). What will 
be the mean deviation from median ? 

(c) The distribution of a variable X is given by the law: 


E ade.aan 

f(x) =Constantexp| ~ 5 ‘es | |,-c<x<oo 
Write down the value of : | 
(1) the constant, () standard deviation, 
(i) the mean, (vf) the mean deviation, 
(ui) the median. (vif) the quartile deviation of 

the distribution. 
(i) the mode, (Gujarat Univ. B.Sc. April 1978) 
Ans. (i) SIR? (ii) 100, (cit) 100, (iv) 100, (v) 5 (vi) V(2/n) x 5 = 4, 
(vii) 5 x 5 = 3-33 (approx .) » ‘ 


(ef) Define Normal probability distribution. If the mean of a Normal popula- 
tion is [and its variance G-, what are its (i) mode. (ii) Median, (iii) 3) and B2 ? 

(e) Foranormal distribution NV (. o) 

(t) Show that the mean, the median and the mode coincide. 

(ii) Find the recurrence relation between pi, and Ur, -2. 

(iii) State and prove additive property of normal variates. 

(iv) Obtain the points of inflexion for the normal distribution N (pL, oO"). 

(v) Obtain mean deviation about mean. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1988] 
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(f) Show that any linear combination of n independent normal variates is 
also a normal variate. [Delhi Univ. B.Sc. (Stat. Hons.), 1989] 

(g) Show that for the normal curve : 

(i) The maximum occurs at the mean of the distribution, and 

(ii) the points of inflexion lie at a distance of + 6 from the mean, where o 
. the standard deviation. [Delhi Univ. M.A. (Eco.), 1987] 


(h) Describe the steps in volved in fitting a normal distribution to the given 
data and computing the expected frequencies. 


(i) Explain how the normal probability integral 


i @ (z) dz, 
0 


"4s used in computing normal probabilities. 


4. Write a note on the salient features of a normal distribution. N (p, °°) 
denotes the normal distribution of each of the random variables X1, X2, X3, ..., Xn, 


where [Lt is the mean and o” the variance. Prove the following : 

(i) If Xt, X2,...,Xn are independent, then X) + X2+...+Xn has the dis- 
tribution N (n pi, n 0’) 

(ii) kX, where k is aconstant has the distribution N(k pL, Po). 

(iii) X+a, where a is aconstant has the distribution N (u + a, 0’) 


(iv) In (i) if X -Utet then 


\n a) has the distribution N (0, 1). 
5. (a) Show that for a normal distribution with mean ti and variance oO. 
the central moments satisfy the relation 
on = (2n — 1) Han-20" 5 Hon +1 = 0 
[Delhi Univ. B.Sc. (Stat. Hons.), 1987} 


Hence show that [2,= 


ea io)" and pn+1=0;,n=1,2,... 


(Delhi Univ. B.Sc. (Stat Hons.) 1985] 


(b) State the mathematical equation of a normal curve. Discuss its cmef 
features. 


(c) Find the momént generating function of the norma! distribution 
(m, 0° ). and deduce that 


Hane) = 0, 
Lon = 1-3-5... (2-1) 0”, 
where Lt, denotes the nth central moment. 
[Delhi Univ. B.Sc. (Stat. Hons.) 1990,’ 82) 
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(d) Show that all central moments of a normal distribution can be expresseq 
in terms of the standard deviation and obtain the expression in the general case. 


; [Aligarh Univ. B.Sc. 1992) 
(ec) The normal table gives the values of the integral: 


rg 1 : 
9W=p— | exp (-3" dt 
for different values of x . 
Explain how to use this table to obtain the proportion of observations of g 
normal variate with mean 1 and S.D.o, which lie above a given value ‘a’ , 
(i) wherea>p, (ii) wherea<p.. 
6. (a) If X; and X2 and two independent: normal variates with means 
H41 and 2 and variances of and 03 respectively, show that the variables 
U and V where U= X, + X2 and V=X,-—X2, are independent normal variates. 
Find the means and variances of U and V. 
(b) IfX; and X2 areindependent standard normal variates obtain the p.d/f. 
of (X;—X2)/V2.. 
Ans. U=(X;—X2)/V2 ~N(0, 1) 
(c) Suppose X; ~N (0, 1) and X2~N(0, 1) are independent r.v.’s. 
(i) Find the joint distribution of (X1 + X2)/V2 and (X1- X2)/V2. 
(ii) Argue that 2 X; X2 and X3 — Xf have the same distribution. 
- Ans. (i) U = (Xi + X2)/N2 and V = (Xi — X2)/V2_ are independent 
N (0, 1) variates 
(i’) Hint. ig—xt-a| 5% || BOM (UV) 
'=2 x [ Product of two independent SNV ’s ] 
2X X2=2X [ Product of two independent SNV ’s ] 
Hence the result. 
7. (a) Let X be normally distributed with mean 8 and s.d. 4. Find 
{i) P(SSX<10), (DD P(IOS X< 15), (Gi) P(X 2115), (iv) P(X S5). 
Ans. (i) 0-4649 (ii) 0-2684 (ii) 0-0401 (vi) 0-2266. 
(b) The standard deviation of a certain group of 1,000 high school grades 
was 11% and the mean grade 78%. Assuming the distribution to be normal, find 
(i) How many grades were above 90%? 
(ii) What was the highest grade of the lowest 10? 
(iii) What was the interquartile range ? 
(iv) Within what limits did the middle 90% lie? 
Ans. (i) 138, (ii) 52, (iii) Q) = 70-575; O3 = 85-425, and (iv) 60 %to 96:2% 
(c) If X is normally distributed with mean 2 and variance 1, find 
: P(tX-21<1). Ans. 0-6826 [ or ® (1) -® 1) ] 
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(d) If X~N(p=2, 0° =2), find P(1X-—11<2) in terms of distribution 
function of standard normal variate. 
Ans. Probability = P (— 1 < X <3) = ® (IA2) - ®(- WH) 
(2) IfX~N (30, 5*) and Y~N(15, 10’), show that 
P(26<X<40)=P(7S YS35). 
Hint. Each Probability = P (- 0-8 < Z< 2) where Z ~N (0, 1) 
(f) If X~N (30, 5°), find the probabilities of 


(i) 26<5X< 40, (ii) |X-301>5, (Git) X2 42, (iv) X S28 
(Bihar P.C.S., 1988] 
Ans. (i) 0-7653, (ii) 0-3174, (iii) 0-0082, (iv) 0-3446 
8. (a) In a normal population with mean 15-00 and standard deviation 
3.5, it is known that 647 observations exceed 16-25. What is the total number of 
observations in the population? (Sri Venkateswara Univ. B.Sc. April 1990) 
Hint. Let X ~N (, 0°) where p= 15 ando=3°5. 


If N is the total number of observations in the population, 

then we have to find N such that - 
Nx P (X> 16-25) = 647 

(b) Assume the mean heights of soldiers to be 68-22 inches with a variance 
of 10-8 (in. )*. How many soldiers in a regiment of 1,000 would you expect to be 
over 6 feet tall? (Given that the area under the standard normal curve between 
X=0 and X= 0-35 is 0-1368 and between X=0 and X= 1-15 is 0-3746). 

Ans. 125 [Osmania Univ. M.A. 1992, 

9. (a) If 100truecoins are thrown, how would you obtain an approximation 
for the probability of getting (i) 55 heads, (é) 55 or more heads, using Tables of 
Aréa of normal probability function. 

(b) Prove that Binomial distribution in certain cases becomes nrrmal. 

A six faced dice is thrown 720 times. Explain how an approximate value of 
the probability of the following events can be found out easily. (Finding out the 
numerical values of these probabilities is not necessary) : 

(i) ‘six’ comes for more than 130 times 

(ii) chance of ‘six’ lies between 100 and 140. 

10. (a) Thenumber (X) of items of a certain kind demanded by customers 
follows the Poisson law with parameter 9. What stock of this item should a retailer . 
keep in order to have a probability of 0-99 of meeting all demands made on him? 
Use normal approximation to the Poisson law. 

(6) Show that the probability that the number of heads in 400 throws of a 
fair coin lies between 180 and 220 is= 2F(2)— 1, where F(x) denotes the 
standard normal distribution function. 

11. In an intelligence test administered to 1,000 children, the average score 
is 42 and standard deviation 24. 

(i) Find the number of children exceeding the score 60, and 
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(ii) Find the number of children with score lying between 20 and 40, 
(Assume the normal distribution.) Ans, (i) 227 (it) 289 


12. The mean LQ. (intelligence quotient) of a large number of children of 
age 14 was 100 and the standard deyiation 16. Assuming that the distribution was 
normal, find 

(i) What % of the children had I.Q. under 80? 

(ii) Between what limits the 1.Q.’s of the middle 40% of the children lay? 

(iii) Wl.at % of the children had 1.Q.’s within the range p+ 196.6? 

Ans. (i) 10-56%, (#) 91-6, 108-4, (ii) 0-95 

13. (a) In a-university examination of a particular year, 60% of the students 
failed when mean of the marks was 50% and s.d. 5%. University decided to relax 
the conditions of passing by lowering the pass marks, to show its result 70%. Find 
the minimum marks for a student to pass, supposing the marks to be normally 
distributed and no change in the performance of students takes place. 

Ans. 47-375. . 

(b) The width of a slot on a forging is normally distributed with mean 
0-900 inch and standard deviation 0-004 inch. The specifications are 0-900 + 
0-005 inch. What percentage of forgings will be defective ? 


Hint. Let X denote the width (in inches) of the slot. We want 
100 x P(X lies outside specification limits) | 
= 100[ 1 =P (X lies within specification limits) ] 
= 100 [1—P (0-895 <X<0-905) ]. 

14. (a) The monthly incomes of a group of 10,000 persons were found to 
be normally distributed with mean Rs. 750 and s.d. Rs. 50; Show that of this group, 
about 95% had income exceeding Rs. 668 and-only 5% had income exceeding Rs 
832. What was the lowest income among the richest 100? 

Ans. Rs. 866:3 . 

(6) Given that X is normally distributed with mean 10 and 

P(X > 12) =0-1587, 
what is the probability that X will fall in the interval (9, 11)? 
Take ® (1) =0-8413 and ® (- 3) = 0-3085 


x 
where ® (x) = ao J exp (- u?/ 2) du 


Ans. 0-3830 
(c) A normal distribution has mean 25 and variance 25. Find 
(i) the limits which include the middle 50% of the area under the curve, and 
(it) the values of x corresponding to the points of inflexion of the curve.. 
Ans. (2) Limits which include the middle 50% of the area under the curve are: 
Q) = -0:6745 6 = 21-7275 ; Q3=p + 0-6745 6 = 38-2725 
(ii) (30, 20) 
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15. (a) Ina distribution exactly normal 7% of the items are under 35 and 
39% are under 63. What are the mean and standard deviation of the distribution? 

Ans. 4 = 50:3, o # 10-33. 

(b) Ina normal distribution, 31% of the items are under 45 and 8% are over ~ 
64. Find the mean and variance of the distribution. 

Given that area between mean-ordinates and ordinate at any o distance from 
mean, 


Zarit ; 0-496 1-405 
Ara : 019 0-42 


[Delhi Univ. B.Sc., 1987; Madras Univ. B.Sc., 1990] 

Ans. » #50,0 = 10 

16. (a2) -Aminimum height is to be prescribed for eligibility to government 
services such that 60% of the young men will have a fair chance of coming up to 
that standard. The heights of youngmen are normally distributed with mean 60-6" 
and s.d. 2:55". Determine the minimum specification. 

Ans. 59-9". 

Hint. We want x;s.t. P(X > x1) = 0-6 


Xi = 60-6 * 
When X=x;, Z=— >. 2-55 — 2}, (say) . ...(*) 


[Note the negative sign, which is obvious from 
the diagram] 

Obviously P (0 < Z< 2) = 0:10 => z;, = 0-254 

Substituting in (*), we get 

x1 = 60-6 — 2°55 x 0-254 = 60-6 — 0°65 = 59-95" 

(b) The height measurements of 600 adult males are arranged in ascending 
order and it is observed: that 180th and 450th entries are 64-2" and 67-8" ‘respec- 
tively. Assuming that the sample of heights is drawn from.a normal population, 
estimate the mean and s.d. of the distribution, 

Ans. 67-78", 3° 

17.(a) Marks secured by students in sections I and [I of a class are 
independently normally distributed with means 50 and 60 respectively and varian- 
ces 10 and 6 respectively. What is the probability that a randomly chosen student 
from secticn II scores more marks than a randomly chosen student from section I? 
What percentage of students are expected to secure first division (z.e., 60 marks 
or more) in section I? Write down your results in terms of the standard norma] 
distribution fucntion. 

Hint. X~WN (50, 10), Y ~ N (60, 6) are independent r.v.’s. 
U = Y —- X -N (10,16). We want P(Y >X) =P(U>0): 


(b) In an examination, the mean and standard deviation (s:d.) of marks in 
Mathematics and Chemistry are given below 
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Assuming the marks in the two subjects to be independent normal variates, 
obtain the probability that a student scores total marks lying between 100 and 130, 
[Full marks in each subject are 100]. Given that 


F (0-28) = 0-1103, F (1-94) = 0-4738, 
z 
re 1 ; { 
where Fag Jewcy) a. 


(Bhagalpur Univ. B.Sc., 1990} 

18. (a) One thousand candidates in an examination were grouped into three 

classes I, II, Ill in descending order of merits. The numbers in thé first two classes 

were 50 and 350 respectively, The b ighest and the lowest marks in class IT were 

60 and 50 respectively. Assuming the distribution to be normal, prove that the 

average mark is approximately 48-2 and standard deviation, approximately 7-1. 
The following data may be used: 


The area A is measuréd from the mean zero to any ordinate X . 


x A x A 
Go 0 J 

0-2 0-079 1-5 0-433 
0-3 0-118 1-6 0-445 
0-4 0-155 1-7 0-455 


(b) In an examination marks obtained by the students in Mathématics, 
Physics and Chemistry are distributed normally about the means 50, 52 and 48 with 
S.D. 15, 12, 16 respectively. Find thé probability of securing total marks of 


(i) 180 or above, (i) 90 or below. 

1 ¢ 
——= f exp (-7/2) dz =0-1942, === { exp (-7/2) dz =0-0224 
| v2x J Y v2 x f 


Ans. 0°1942, 00224 [Agra Univ. B.Sc., 1988] 

19. Ina certain examination the percentage of passes and distinctions were 
46 and 9 respectively. Estimate the average marks obtained by the candidates, the 
minimum pass and distinction marks being 40 and 75 respectively. (Assime, the 
distribution of marks to be normal.) (Ans. p = 36°4, 0 = 28-2) 

Also determine what would have been the minimum qualifying marks for 
admission to a re-examination of the failed candidates, had it been desired that the 
best 25% of them should be given another opportunity of being examined. 

Ans. 29. 
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20. The local authorities in a certain city installed 2,000 electric lamps in 
q street of the city. If the lamps have an average life of 1,000 burning hours with a 
sD. of 200 hours, 


(i) What number of the lamps might be expected to fail in the-first 700 
burning hours, 
(ii) After what periods of burning hours would we expect that 
(a) 10% of the lamps would have failed, and 
(b) 10% of the lamps would be still burning? 
Assume that lives of the lamps are normally distributed. 
You are given that F (1-50) = 0-933, F (1-28) = -900, 


t 
i 
1-53? 
where F © J Vax’ 
Ans. (i) 134, (ii) (a) 744, (6) 1256. [Allahabad Univ. B.Sc., 1987] 


21. (a) The quartiles of a normal distribution are 8 and 14 respectively. 
Estimate the mean and standard deviation. 

Ans. H=11, 0=4-4. 

(b) The third decile and the upper quartile of a normal distribution are 56 
and 63 respectively. Find the mean and varaince of the distribution. 

Ans. 1 = 59-1, o=5°8. 

22. (a) 5,000 variates are normally distributed with mean 50 and probable 
error (semi-interquartile range) 13-49. Without using tables, find the values of the 
quartiles, median, mode standard deviation and mean deviation. Find also the value 
of the variate for which cumulative frequency is 1250. 

[Meerut Univ. B.Sc., 1989] 

Ans. Q: = 36-51 Q3 = 63-49, o=20, M.D. = 16,.x; = 36-51. 

(b) The following table gives frequencies of occurrence of a varaible X 
between certain limits : 


Variable X Frequency 
Less than 40 — 30 
40 or more but less than 50 33 
50 and more 37 


The distribution is exactly normal. Find the distribution and also obtain the 
frequency between X = 50 and X=60. [Kurukshetra Univ. M.A. (Eco.), 1990] 
Ans. Hint. 50+ =0-:336; 40-p=-0-526 
ut = 46-12, 6 = 11-76 
N.P (50 < X < 60) = 100 x 0-2517= 25 
23. (a) Suppose that a doorway being constructed is to be used by a class 


of people whose heights are normally distributed with mean 70” and standard 
deviation 3". How long may the doorway be without causing more than 25% of the 
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people to bump their heads? If the height of the doorway is fixed at 76", how many 
-persons out of 5,000 are expected to bump their heads? 


[For a normal distributionthe quartile deviation its 0-6745 times standard 


@ 


- be batt xX 
deviation. For a standard normal distribution Z = ee the area under the curve 


between Z=0 and Z=2 is 0 4762. | 

(b) A‘normal population has a coefficient of variation 2% and 8% of the 
population lies above 120. Find the mean and S.D. 

Ans. = 122,06 = 2-44 

24. Steel rods are’ manufactured to be 3 inches in diameter but they are 
acceptable if they are inside the limits 2-99 inches and 3-01 inches. It is observed 
that 5% are rejected as oversize and 5% are rejected as undersize. Assuming that 
the diameters are normally distributed, find the standard deviation of the distribu- 
tion. Hence calculate, what would be the proportion of rejects if the permissible 
limits were widened to.2-985 inches and 3-015 inches. 

(Hint. Let X denote the diameter of the rods in inches and let X ~N (1, 0”) . 
Then we are given 


P(X>3-01) =0-05 and P(X < 2-99) =0-05 


2.01 — 9.99 — 
3-Ol-p _ 1:65 and 299-165 
oO Oo 
Solving we get p= 3 and o= aa 


The probability that a random value of X lies within the rejection limits is | 
P (2:985 < X < 3-015) = P (-2-475 < Z< 2-475) =2 x P(O0<Z< 2-475) 
= 2 x 0-4933 = 0-9866 
Hence the probability that X lies outside the rejection limits is 
1 — 0-9866 = 0-0134 

Therefore, the proportion of the rejects outside the revised limits is 0-0134, 
i.e., 1-34% ]. 

25. Derive the moment generating function of a random variable which has 
a normal distribution with mean 1 and. variance 0”. Hence or otherwise prove 


that a linear combination of independent normal variates is also normally dis- 
tributed. 


An investor has the choice of two of four investments X), X2, X3, X4. The 
profits from these may be assumed to be independetnly distributed, and 


the profit from X; is N (2, 1), 

the profit from Xz is N (3,3), 

the profit from X3 is N (1,4). 

the profit from X4 is N (22,4). 
(Profits are given in £ 1000 per annum). 
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Which pair should he choose to maximise his probability of making a total 
annual profit of at least £ 2000? (London Univ. B.Sc. 1977) 


26. (a) State the important properties of the normal distribution and obtain 
from the tables the inter-quartile range in terms of its mean pt and standard 
deviation O . 


Find the mean and standard deviation as well as the inter-quartile range of 
the following data. Compare the inter-quartile range with that obtained from mean 
and standard deviation on the assumption of normality. 


X (central values) ... 0 I ¢ 3 4 5 6 
f * (frequency) ... 5 9 15 32 21 10 8 


(b) The following table gives Baseball throws for a distance by 303 first 
year high school girls: 


Distance in feet | Number of girls Distance in feet Number of girls 

15 and under 25 l 85 and under 95 44 

25 and under 35 2 95 and under 105 3] 

35 and under 45 7 105 and under 115 27 

45 and under 55 25 115 and under t25 11 

55 and under 65 33 125 and under 135 4 

65 and under 75 53 135 and under 145 | 

75 and under 85 64 


(i) Fit a normal distribution and find the theoretical frequencies for the 
classes of the above frequency distribution. 


(ii) Find the expected number of girls throwing baseballs at a distance 
exceeding 105 feet on the basis that the data fit a normal distribution. 


27. (a) The table given below shows the distribution of heights among 
freshmen in a college : 


Height in 61 62 63 64 65 66 67 68 
inches 

Frequency 4 20 23 75 114 186 212 252 
Height in 69 70 71 72 73 74 

inches 


Frequency 218 175 149 46 18 8 


By comparing the proportion of cases lying between pt + (2/3) 0,1 + 0, 
1t+2 06 and p +36, for this distribution and for a normal curve, state whether the 
distribution may be considered normal. 


(b) Fit a normal distribution to the following data of heights in cms of 200 
Indian adult males : 
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Height in Frequency 
(cms) 
144 — 150 3 
150 — 156 12 
156 — 162 23 
162 — 168 52 
168 — 174 61 
174 — 180 39 
180 — 186 10 


(c) Two hundred and fifty-five metal rods were cut roughly six inches over 
size. Finally the lengths of the oversize amount were measured exactly and grouped 
with 1-inch intervals, there being in all 12 groups. The frequency distribution for 
the 255 lengths was 


Fit a normal distribution to the data by the method of ordinates and calculate 
the expected frequencies. 


28.(a) LetX~N(u,0°). Let 
D(x)=P[XSx], 
calculate the probabilities of the following events in terms of ®: 
(i) aX+B<t, where , B are finite constants. 
(ii) -X2t 
(iii) \X1>1 [Poona Univ. B.E., 1991] 


(b) Determine C such that the following function becomes a distribution 
function: 


2° 


2 
29. (a) Determine the constant C so that Ce 2* *% wc x<co , isa 
der.sity function. If the random variable X has the resulting density function, then 
find (i) the mean of X, (ii) the variance of X and (iii) P(X 2 1/4). 


Ans. (i) 0-25 (ii) 0-25 (iii) 0-5 


(b) Iff(x) =k-exp. {- (9 x? - 12x + 13)} , isthe p.d-f. of a normal distribu- 
tion (k, being a constant) find the m-an and s.d. of the distribution. 


‘ 2 
FQ=C | ex] SB |e 


-— 0 
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(c) If X is a normal -ariate with p.d.f. f(x) = 0:03989 exp (— 0-005 x + 
05x - 12-5), express f(x) in standard form and hence find the mean and variance 
of X. [M.S. Baroda Univ. B.Sc., 1991] 

(d) Let the probability function of the norma! distribution be 


2 \ 
P(dake “® +2N co EX C00 


cind k, pando. [Dethi Univ. B.Sc. (Stat. Hons.), 1985] 
(e) Xi, X2, X3, X4 isarandom sample from a normal distribution with mean 


100 and variance 25 and X =2(X) +X2.+X3+ Xa). 
State the distribution, expected value and variance of each of the following: 


(i) 4X ,, (i) X1-2X24Xs—-3 Xa, 


| 2 
(iti) 95 p> Xi- 100 | [Bangalore Univ. B.Sc., 1989] 
Ans. (b) Mean =2/3, o « —. 
) = se 
30. If X is anormal variate with mean 50 and s.d. 10, find P (Y < 3137) , 
where Y =X? + 1, 


0-6 
2 
ee fot? a= 02258 
Ro [Delhi Univ. B.Sc. (Hons.), 1990] 


Hint. Required Probability = P (X’ + 1 $3137) =P (- 56 < X< 56) J. 
Ans. 0-7258 
31. LetX be normally distributed with mean p. and variance o” . Suppose 


6’ is some function of 1, say 0”? =h (p). Pick h (.) so that P (X $0) does not 
depend ont for p>O. 


Ans. P(X $0)=P(Zs—-p/Vh (1) = P(ZS$- 1) ; independent of pif we 
take h(u) =p. 

32. (a) If£X is a standard normal variate, find E 1X  [Ans. ¥2/x = = 4/4] 

(b) X isarandom variable normally distributed with mean zero and vafiance 
o’ . Find EI X\ [Delhi Univ. B.Sc. (Stat. Hons.) 1990] 

Hint. E | X | = Mean Deviation about origin 


= M.D. about mean (-.: Mean = 0) 
Ans. V(2/n). o= 2 Oo 


32. (a) X is anormal variate with mean | and variance 4, Y is another 
hormal variate independent of X with mean 2 and variance 3. What is the 
distribution of X + 2Y ? [Punjab Univ. B.Sc. (Hons.) 1993] 

Ans. X +2Y ~N (5, 16) 
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(b) If X is a normal variate with mean | and S.D.06, obtain P[ X>0}), 
P[|X-1120-6 J and P[(-- 1-8<X<2-0]. What is the distribution of 4X + 5 9 

34. (a) Let X and Y be, two independent random variables each with a 
distribution which is N (0, 1). Find the probability density function of U = a, x 
+ az Y, where a; and a2 are constants. 


(b) Show that if X1, X2 are mutually independent normal variates having 
means ‘41,2 and standard deviations 61,62 respectively, then U=a, Xx, 
+ a2 X2 is also normally distributed. 


34, (c) If Xi, (i= 1, 2, ..., n) are independent N (ui, G7) variates, obtain the 
n 


distribution of Dy a; X; 
i=] 
where q;, i= 1, 2,..., m are constants. Hence deduce the distributions of : 
(i) X,+X2 
(ii) X,—X2 


n 
(iit) X a ¥, XisifX’s are iid. NU, 0%). 
i=1 
How dothe results in(é) and (ii) compare with those in Poisson distribution 
and result in (iii) compare with Cauchy distribution ? 


[Delhi Univ. B.Sc. (Stat. Hons.), 1991) 
Hint. For Cauchy distribution, see Remark 4, § 8-9-1 . 

35. (a) If X is normal with mean 2 and standard deviation 3, describe the 
distribution of Y=+X-1. Explain how you would find P (Y 2 3) from the tables, 
Hint. (a) We are given that X ~N (, o’) where bp =2,0=3. The dis- 

tribution of the new variable Y = aX + D is also normal with 

E(Y) =E (aX + b) =aE (X)+ b=au+b 

and Var (Y) = Var (aX + 6) =a’ Var (X) =a’ 0” ..(*) 
Hence Y = X-1~N (ty, 64) , where Lt; and of are given by (*) with 


1 =1.2-1=0;01=().9=2. 
Thus Y ~N (11, 01) , where [11 = 0, 61 =3 
P(Y¥232)=P(Z21)=05-P(0<Z< 1)=0-5 — 0-3413 =0-1587. 


(b) If X and Y are independent standard normal variables and if Z = aX + 
bY +c where a, b andc are contants, what will be the distribution of Z ? What is 
the mean, median and standard deviation of the distribution of Z ? 


Find P(Z$ 0:1) ifa=1,b=—landc=0. (LLT. B. Tech. 1992) 
Hint. Z ~N (c, a +b’) 


Ifa=1,b=-1,c=QthenZ<=xX-Y~N (0, 2) 
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Z-—0 
- P(ZS0-1l)=P Use |; U=—- ~N (0, 1) 
36. LetX be arandom variable following normal distribution with mean 
pand variance o” and let r be a non-negative integer. 
If w/ = E (X’) and if fo, =[ E(X—)” J, prove that 


de 
() Wre2=2MW rit (O-W) +o = 


(it) Hors2 =O? bar + PF ata [Madras Univ. B.Sc. (Main), Oct. 1989] 
Hint. (i) 
ae. Se exp | ~(x -—p)°/2 0 | dx 
+ | SESH ool ~(x—1)*/2 0} ax 
Mr | Wre2 JU rs Toate 
2 © 
(it) dhe jf Sot ee rn {~~ py?/2 07 } dx 
2r+2 
+) Sa exo - (~w)/20 jar =— Ee 4 PG? 


37. Prove that if the independent random variables X and Y have the 
probability densities, 


a he se = Py y 


Li nie 
Vn Vt 


then the random variable U =.X + Y has the probability density, 


,— 00 < (x, y) <0 


+ ef ~ 00 < Ii <0 
tT : 
where eee. 
Por KR 
a 2 n Rn 
38. If] © cipi| =9 E cto}, find P| OS YS2 Eom | 
i=] i=l i=1 


where Y= & c; Xj, X; being a normal variate with mean [Hj and varidnce oc. 
i=l 


(Allahabad Univ. B.Sc., 1988) 
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Hint. 0) 1) ss rT 
We know Y= 2 ci: Xi ~N (y, 0’), where fp = 2 cjp; and o= LF c? GF? 
i=} i= | 


Since & sad aad | p> al have u2=9 0" or f=3 


If we write z=7=b , then Z~N(O, 1). 


P(OOSY<2 2 cu)=POSY<2p) =P (-3 $Z<3)=0-9973 
i=] 


39. (a), Find the mean deviation about mean for the normal distribution 


N (u,. 0°). 3 
(b) IfX~N(u, 0’) , find.the mean and variance of 


1 2 
¥==[(-w)/o] [Punjabi Univ. M.A. (Eco.), 1991] 
Ans. E(Y) = 1/2, Var (Y)=1/2 
Remark. Also see Example 8:30, on Gamma distribution. 
(c) Derive normal distribution as a limiting case of binomial distribution, 
clearly stating the conditions involved. - [Delhi Univ. B.A. (Stat. Hons.), 1981) 


40. Iff(x) is the density function for the normal distribution with mean 
zero and standard deviation o , then show that 


J rey Pde=s4 


Hence show that if the normal distribution is grouped in intervals with total, 
frequency Ni, and N2 is the sum of the squares of the frequencies, an estimate of' 


_ NM 
OS IN VR (Gujarat Univ. B.Sc., 1992): 
eo oo 2 
Hint. J (fix) dx = | {peer (-x°72 ° dx 


__1 fer _ 1 ok 


i= 
2no *. 2no (1/0) 
° 2 2 

‘s% oJ et? de=vR/a 


a 2 
m= (mse) dees 


41. Obtain the normal distribution as a limiting case of Poisson distribution 
when the parameter A, > ©. 


42. (a) If X is N(O, 1), prove that the p.d.f. of |X| is 
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h (x) =| V2/n exp (-x2/2), x20 
0, otherwise 
(b) Let X~N(0, i) and Y~N (0,1) be independent random variables. 
Show that X + Y is independent of X-Y. 
43. IfX~N(u, 9°) and Y~N(q, 12%) are independent, and if 
P(X+2YS3)=P(2X-Y24), determiner. 
[Calcutta Univ. B.Sc. (Maths Hons.), 1989) 
44. IfX~N(0,1) and Y~N(O0, 1), prove that 
({) Var (sin X ) > Var (cos X)’ 
(ii) E\X-Y1s 8/0 
[Delhi Univ. B.A. Hons. (Sp]. Course-Statistics), 1988) 


Hint. (i) X~N(0,1) => Ox ()=E[cos tX +éisin tX)= ae tl? 


= E(costX)= =e!” and E(sin 1X) =0. 
Taking .t=1 and 2, we get: 
E(cosX)=e'”; E (cos 2X)=e"?; E (sin X) = E (sin 2X) =0. 


Var (cos X) = E (cos” X) — (E cos rghae| L+S82K | (Ecos X}’ 


=5(1 -e')= 0.99 


Similarly Var (sin X) = el Tas SE 
(ii) UselX-YISIXI+I1YI and EIXI=E1YIl=V27n- 
orX-Y¥~N(0,0 =2);  EIX-Yl=V2/n o=V4/n <V(8/n) 

45. Let X and Y be independent N(0, 1) variates. Let X=Rcos 8, 

Y=R sin 6. Show thatR and 0 are independent variates. 

[Delhi Univ. B.A. Hons. (Spt. Course Statistics), 1985] 
46. If X~/N(0O,1), find p.d.f. of |X|. Hence or otherwise evaluate 
EItxl. [Delhi Univ. B.Sc. (Maths. Hons.), 1980] 
Hint. Distribution function Gy (y) of Y=1X| is given by: 
Gy(y)=P(YSy)=P(IXISy)=P(-ySXSy) 
= P(Xsy)-P(Xs-y) 
Gy ()) = Fx (vy) - Fx (- y). 
where F(.) is the distribution function of X. Differentiating, the p.d.f. of Y= 
|X lis given by 
bY =f O) +x C yy =2f/K 0) 
=> pgy(y)=V2/n .€” ie ;y20 (By symmetry, since X ~N(0,°1) ] 


8-2-:15. The log-normal Distribution. The positive r.v. X is said to have. 
a log-normal distribution if loge X is normally distributed. 


|-tesnxP=3 a -—e7*)~ 0-43 
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Let Y= logeX ~N (1, 0”). 
Forx>0O, 
Fx (x) = P (X $x) =P (log X < log x) = P(Y < log x) 


(since log X is monotonic increasing function) 
log x 
I | 2 2 
mr Ces exp|-(y—H) /26 | dy 
vas -© [since Y~N (np, 3’) } 


__l | 2 py qd | due es 
= oven | ol (log u—p)"/2 6 7 (y = log u) 
Forx <0, 
Fy (x) = P(X Sx) =0 
Let us define 
l 2 2 
feu) =| ware PE log u—Wh'/2.0"} a >0 18 


0,u<0O 
x 


Then Fx (x) = | fx(u) du for every x and hence fy (x) defined in (8-18) 


isa p.d.f. of X. 
Remark. If X~N(u, o’), then Y= is called a log-normal random 


variable, since its logarithm log Y= X, is a normal r.v. 
Moments. The rth moment about origin is given by 


u,’ = E (X) = E (e"") [-: Y=logX = X=e"] 
= My (r) (m.g.f. of Y, r being the parameter) 
=exp{pr+iro’} [-. Y~N (pH, 0°) ) 

.. (8-19) 


Remarks. 1. In particular if we take 1 = log a, a> 0 i.e., 
log X ~N (log a, 0’) , then 


Hy = E(X)=exp|{r.logat+iro?|=ol.exp{?o/2} (8-19 a) 
2 2 
“. mean = ty’ = a e° 2 and U2’ =a e*° 


2 2 
po = plo’ ~ ps’? = a7 e® (e* - 1) 

2. Itarises in problems of economics, biology, geology, and relibility theory. 
In particular it arises in the-study of dimensions of particles under pulverisation. 

3. If X1, X2, ..., X_ iS a set of independently identically distributed random 
variables such that mean of each log X; is {4 and its variance is o’ , then the product 
X, Xz... X_ is asymptotically distributed according to logarithmic normal distribu- 
tion and with mean Lt and variance n o 
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EXERCISE 8(c) 
1. (a) Let X be anon-negative random variable such that Jog X = Y, (say), 
is normally distributed with mean Ut and variance O°. 


(i) Write down the probability density function of X . Find £ (X) and 
Var (X) . 


(iii) Find the median and the mode of the distribution of X . 
(b) If X is a normally distributed with zero mean and variance o find 
the density function of U= e* Locate the mode of the distribution. 
2. Arandom variable X has the probability density function: 
f(x) re ox0| ~ 35 (oe x0} x>0. 
Find E (X) and Var (X). [Punjab Univ. M.A. (Eco.), 1991]. 
3. Arandom variate X has the p.d.f., 


1 - (og 2°72 
f@=;xvan®*79 
0, elsewhere 


Calculate the mean, mode, standard deviation and coefficient of skewness. 
Ans. Ve, 1/e, Ve (e—1), and (1-e°*)/Ve-1 


4. The random variable X has mean m and standard deviation s. If 
Y=log X is normally distributed with mean M and standard deviation S, prove 
that ; 


()  maexp| M+45° |, (ii) +5 rs 


5. Given that X; are independent ieearanaaie normal variates with 
parameters [ti and 0;; #=2,...,m, find the sth raw moment of the variable 


Y=! (a;X).; i=152,. 


6. Show that the log-normal distribution is pose skewed i.é., mean > 
median > mode. 


Ans. Let Y=logX~N (i, 0°) 


E(Xyeeto? ; Median = e" ; Made = e#-° 


7. If X and Y are two independént log-normal variates, then XY and 
X/Y are also log-normal variates. 


Hint. Let log X ~N (11, 07) ; log Y ~N (2, 63) ; U= XY and V=(X/Y). 


tog U= log X+ log ¥ “N (hi +a, of +02) ©. x and ¥ are independent 
‘log V= log X —log Y ~N (Li ~ Ha, Of + 02) 


8. IfX~N(0,02), obtain thedistribution of e* Find out the mean of the 
distribution. [Delhi Univ. B.Sc. (Stat. Hons.), 1985] 
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9 If X~N(p, o”), find the p.d.f. of Y=e%, using the result that 


E (e'*) _ ef tera? 
Find the coefficient of variation of Y . [Delhi Univ. M.A. (Eco.), 199} 
8-3. Gamma Distribution. The continous random variable X which js 
distributed according to the probability law : 


a A-3 


f(x) =) TA) 
0, otherwise ...(8-20) 
is known as a Gamma variate with parameter and referred to as a y(A) 
variate and its distribution is called the Gamma-distribution. 


| Remarks. 1. The function f(x) defined above represents a probability 
function, since 


-A>0,0<x< 00 


_ -x A-1 ie 
J re dei | er dx = Fay | ra) =1 


2. A continuous random eer X having the following p.d.f. is said to 
have a gamma distribution with:two parameters a and 1. 


r 
fen eeen ;a>0,A>0; d<x<e| 


2 


..(8-20 a) 

=0, otherwise 

We write X ~ y(a, A) 

Takinga=1 in(8-20a) weget(8-20). Hence we may write 
X~y(A)= (1,4). 


3. The cumulative distribution function, called incomplete gamma fun- 
ction 1s 


oll te hie 
Fy (x) = J a rile uo du,x >0 
7 otherwise .. (8-20 5) 
8-3-1. M.G.F. of Gamma Distribution. pn: about origin is given by 
1X tx iw -x A-l 
Mx (t) = E (e*) = j. f(x) de= Fo! a ent ho! dy 
l t -~(l+nr A-1 l (A) 
==, |e xk axe ,Itl<1 
ray! ra) a-s 
My (t)=(1—1)7* I tl< (8-21) 


8-3-2, Cumulant Generating Fucntion of Gamma Distribution. | The 
cumulant generating fucntion Ky (¢) Is given by 


Kx () = log Mx (t)-=-log (1 —1)"* =-2 log (1-"t);|t| <1 
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=A [tt £ + f +— f +. 
2 3 4 
Mean = x; = Coefficient oft in Ky(t) =A 


2 
2 = K2 = Coefficient om inKy (=A 
«3 = Coefficient of Si in Ky (t)=2A 


K4 = Coefficient of is in Ky()=6A 


Rene one 
#5 4n 4 6 
=o sm d =e pute 
Hence Bi aa 5, an B2 = 3 3+ x 


Remarks 1. ae Poisson distribution, the mean and variance of the 
Gamma distribution are also equal. However, Poisson distribution is discrete while 
Gamma distribution is continuous. 


2. Limiting form of Gamma distribution as 4 — a We know that if 


X~yQ), then E(X)=A= p, (Say), and Var (X) =) = o” , (say). Thenstandard 
gamma variate is given by 


X-u_X-a 
Oo vA 
Mz (t) =e"? Mr (t/a) =e"? (1-t/oy* [From (8-21)]. 


-01WK a 
os 


=> Ko(0)=-Vi «1 Mos 1 


Z= 


N 
se 
“Tr 
? ? 
VR al eg at ant] 


2 
=-VRt4VR 645 ¢0(NY) 


whereO(A””?) are terms containing + and higher powers of A in the.denominator. 
rm lim g 
Kz (t -> = Mz (t) =e? 


i = 00 


lim 
A — 0 
Which is the m.g.f. of a Standard: Normal: Variate: Hence by uniqueness theorem 
oim.g.f., Standard Gamma variate tends to Standard Normal Variate as A — ©. In 
other words, Gamma distribution tends to Normal distribution for large values of 
pirameter A . 


3. For the two parameter gamma distribution (8-20 a), we have 


8-70 Fundamentals of Mathematical Statist, 


t 
Mxy(t)=|1-—| ;t<a. 
x ( ) a < a «(8°21 a) 


Proof is left as an exercise to the reader. 
Kx (t)=-Alog(1-t/a); t< 


2 
1/t 
mae , 2 (= 
Mean = ki = A/a 
Variance = ky = \/a” = Mean/a --(8:21 8) 
Hence Variance > Mean ifda <1, 
Variance = Mean ifa=1, 

and Variance < Meanifa>1. 

8-3-3. Additive Property of Gamma Distribution. The sum of inde. 
pendent Gamma variates is also a Gamma variate. More precisely, if X\, Xo, .., 
Xx are independent Gamma variates with parameters At, A2,e00) Ad: respectively 
then X, + X2 + ... + X_ is also a Gamma variate with parameter \1 + 2+ ... +, 
Ae. 

Proof. Since X; is a y (Aj) variate, 

Mx, (t) = (1-1) 
The mg. of the sum X; + X2 + ... + X% is given by | 
My 1 +Xx,+...+X, (0) = Mx, (9) My, (0) ... Mx, (0 
(since Xi, X2, ... , Xz are independent) 

=(1-)™(1-)™...(1-.™ 

so (A 1 +Azt... As) 
which is the m.g.f. of a Gamma variate with parameter A; + Az + ... + Ae. Hence 
the result follows by the uniqueness theorem of m.g.f.’s. 

Remark. If naar if X; ~Y (a, Aj), t= 1, 2, ...,.” are independent r.v.’s 


then . Xi~Y| 4, z Ni 


iol ial 
8-4. Beta Distribution of First Kind. Zhe continuous random variable 
which ts distributed according to the probability law 


_1 ~1l¢q. ywe-l. 53 
peel BQuwy AHA Oy) >0,0<x<1 


0, otherwise .-(8°22 
(where B (u,v) is the Beta function), is known as a Beta variate of the first kind 
with parameters 4 and v andis referred to as B: (u, v) variate and its distribution 
is called Beta distribution of the first kind. 


Remarks. 1. The cumulative distribution function, often called the Jncom- 
plete Beta Function, is 
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Theo 
0,x<0— 
fl 
F (x)= a’! (=u) | dus 0<x< 1, (u,v) >0 
(0) Ju jaw) (Hv) 
l,x>I (8-22 a 
2. In particular, — take t= 1 and v=! in (8-22) we get: 
=], I 
" f@)= oa 1 O<x< (822.4) 


which is the p.d.f. of uniform distribution on [ 0, 1 ). 
3. IfX~Bi (u,v), then it can be easily proved that | — X ~ Bi (v, HW). 


$-4-1. ae of Beta pene of First Kind. 
r=|xf(xyj)d= 
m ! Sw ae=ao a 


] rusnre Tr (u+v) 
=Bavy 2 Yt) = Pasrevy TUWr) 


_Patnl (pty) 
~ P(wtrtv) PF (yy) ..(8°22 b)» 


[git (h—x)’~! dx 


In particular 

7 T+ Pw+vy)_ pr r@etrvy) _ 
Mean=HtTuevel Tu) (tv) r@+tvy lr) pty ..(8-22c) 
[Tw =k-DPKk-D) 
, P+? Pmty)  Mtnerwrury 
PP Tusve2 ry tye Duty rutyr@ 

pe (1 + jt) 
~ (U+v) (tv). 

2 
he (i +p) |, 
Hence ba = Ba <p” ~(tt+v)(htvel rey 


= ee er I) | (u+y) (a+ )-p(tv+ 1) | 


=" __ 
urwutven (8-22 d) 
Similarly, we have 
2uv (v -L) 
= fs’ — 3 po’ qu’ + 2 wi? = 
W3= 3-3 pe pe + 2 (H+) utvel @tv42) 


and pL = 114’ — 4 ps” py’ + 6 pte” pa” ~3 4% 
3HVI HV e+V— ~6) +2 (+) } 


~ (ey! (U+v+1)(U+V+2) (L+v +3) 
so that 
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goth AIH ave 
: Uv(utvt 2) 
uy 2Htvel) UV(Utv—6)4+2 (b+ vy! 


and Br=tt=s—--—-- -- ~ 


: UV(U+tvV+2)(UtvV43) 
The harmonte mean A ts given by 


od t 
Wc ih gps pee cs ate avn ly. 
i keh = Buy) \ (l-—x) dx 


| Bu) gota DE) r (uy + v) 
B (u. v) T(ut+vel) Pury) 
_P(-D@tv-)E(pev—-b _ btv-l 

P(ut+v~1)(u- 1 P(t) pH — | 


eee tad 
Weve S22 ¢ 


8-5. Beta Distribution of Second’ Kind. The continuous random var, 
able X which is distributed according to the probability law : 


ols, 

FQ) =) BUY) (b+ tt 

0, otherwise ~ (8-23) 

is known as a Beta variate of second Kind with parameters \t and V and jg 


denoted as B> (ul, V) variate and its distribution is called Beta distributed of second 
kind. 


Remark. Beta distribution of second kiid is transformed’ to Beta distribu. 
tion of first kind by the transformation 


| 
l+x .{*) 
Thus if X ~ Bz (u,v), then Y defined in (*) is.a By (U, Vv). Tie proofis lef 
as an exercise to the reader. | 
8-5-1. Constants of Beta Distribution of Second Kind. 


> (u,v) >0,0<x <0 


eee => y= 
y 


{ yitr-l 
Biv) (anh 
| ied yhrn-! 


=, a 
B (up, v) 5 a Pe aa 


pr’ = fx f(x) de= 
0 


1 
= RGN). Biutrv-=r) m( 


_Pwtnlv=-n T (w+v) _P@rn@v-n 
Paty, Tw’) . rare) 
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In particular 
, Twtbrw-l) prq@Pw-l — p 
~~ TWF) FWww-HbPwW-b v-! 
vit 2) PE (v-2) © (U+ Iu (u) PT (v—-2) ee Wu (u + 1) 
B= TWO Fwwv-N)W-2)Pw-2) V-) ~-2) 


2 
[ 
fl = Lr’ — i” = H. (u + ) E= 


(v—I)(v—2) | v-I 
(v= 1) (w+ I)-p(v~ 2) |_ Busve 
i (v - 1) (v2) ~(v=1P? (V=-2) 
The harmonic mean H is given by 
eg t jaf 0 
mba FM ae= BU, a ear 


ff ae 
~ BUM, v)s (14 xh evel & = Fun 
P(u-iPiv+1) Catv) PF m-1)vr(v) oar 


Tr (ut+v) “TwPrw) w@-)FrQM-DY)rw) ‘p- 
Hence Hab 


iB 1, v+1),p>1. 


Example 8-29. The daily consumption of milk in a city, in excess of 20,000 
gallons, is approximately distributed as a Gamma variate with the parameters 
v=2andA= 10.000 ° The city has a daily stock of 30,000 gallons. What is the 
probability that the stock is insufficient on a particular day? 

{Madras Univ.-B.Sc. (Stat. Main), 1990] 

Solution. If the r.v. X denotes the daily consumption of milk (in gallons) 
in a city, then the r.v.. Y= X—20,000 has a,gammia distribution with p.d_f. 


- y/10,000 
26)= I 2-1 ,- y/10,000 _ Y @ 


— =———~ ;0<y<o 
(10,000)° T (2) (10,000) 
[ See 8-20 (a) ] 
Since the daily stock of the city is 30,000 gallons, the required probability 
‘p’ that the stock is insufficient on a particular.day is given by 


p= Ane ee P (Y> 10,000) a) 
Seder 


=f 20)dy= : 
fe 4 js ‘000? ” 


. + 
= J ze "dz ; 
" 


Integrating by parts, we get 


5° 


{Taking z= 10,000 } 


374 Fundamentals of Mathematical Statistic, 


p=|-ze™ | +Jefaase'-|e'| 
a1 
| 


=e!+e!=2/e 
Remark. Sincev=2, the integration is easily done. However, for genera] 
values of A and v , the integral is evaluated by using tables of Incomplete Gamma 


Integral, [see Tables of Incomplete Gamma Fucntions, K. Pearson, Cambridge 
University Press] of the form 


ax, 


which have been tabulated for different values of @ and n. 
Example 8-30. /[fX~N(u,0°), obtain the p.df. of: 


2 
U=! cal 
2 Oo 
Solution. Since X ~N (p, 0°), 
dP (x)= =o ete a wer cee 

2 

Let on3(434] => xP au 

0) 
dx = To du 


’ Hence probability differential of U is 


= I -u 6 -u -1/2 
qd) = "bao? ep du=syre u“*~ du 


a fee 
=e. & 2 du 0 <u <0 


the factor : —_* from the fact that total probability is unity. 


. dGW=aa ros 8 ul"! du, 0<u<eo [--T@=ve] 
1(X-p) . a 

Hence U =F a IS a YG) variate. 

Example 8-31. Show that the mean value of positive square root of a 


¥(W) variate is T(p+ sv T (41) . Hence prove that the mean deviation of a normal 
variate from its mean is V2/% 6, whereo is the standard deviation of the 


distribution. [Delhi Univ. B.Sc. (Stat. Hons.), 1986] 
Solution. Let X bea y@) variate. Then 
e X = 
f= x 7U>0,0<x<0 


My) 
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, r Tr (u +3) 
EWX)=( 2 #0) dre [tll ae 2 
oe Js vores rus? r (u) 
If X~N (p, 0°), then 
1(x-p) 
U lee is a y(4) variate. (cf, Example 8-30) 


IX-pl =V2 oU'”, where U is a ¥@) variate. 
Hence mean deviation about mean is given by 
E|X-pl|=E(v20U'”)avV2Z0EU™”) 
G+) Wo 
=V26 r@ =e = V2/0 fe} 
Example 8:32. If X and Y are independent Gamma variates with 
parameters 1 and V_ respectively, show that the variables 
_Xx_ 
X+Y 
are independent and that U is a y (+ V) variate and Z is a B; (1, Vv) variate. 


[Delhi Univ. B.Sc. (Stat. Hons.), 1991] 
Solution. Since X is a y(j) variate and Y is-a,y(v) variate, we have 


fixyadr= ext! dx 0<x<0,>0 


U=X+Y,Z= 


I 
YT (u) 
Avda =Ty 


Since X and Y are independently distributed, their joint probability dif- 
ferential is given by the compound probability theorem as 


1 alee 2 = 
dF (x,y) = fi (2) fay) de dy = hl yw’) de dy 


eV y’~ | dy ;0<y <0, v>0 


x 
Now u=x+y,Z=——, so that x=uz, y=u-—x=u(1-2z) 


x+y 
Jacobian of transformation J is given by 
ox oy z l-z 
_9(%y)_ | du Ou ee 
~9(mz2) [ax dy | _ 
OZ Oz u —u 


As X and Y range from 0 to ©, u ranges from 0 to ~ and z from 0 to I. 
Hence the joint distribution of U and Z is 


dG (u,2)= 8 (u,2) dude= oe" (ua [u(l—z)]’ lil du dz 


l = 


se wu ptv-l op-lyy jy -l 
ray rw) °° u z" (1l-z)~”  dudz 
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_ pol geves | ao vei : 
shares | saw" (1 -z) a | 


=[ 2) (u) du] [ 22(z)dz|, (say), 


- +v-1 
«yt v 


where g (u)= e », 0<u<c 


I 
Pr (w+ v) 
l p-! v-1 
z 1-z -0<z< 1 
Buy. “oO? 
From (*) and (**) we conclude that U and Z are independetnly distrib: 
U as a y(w+v) variate and Z as afi (u,v) variate. 


Example 8:33. If X and Y are independent Gamma variates 
parameters 1 and v respectively, show that 


and g2(z)= 


X 
U=X+¥,2=4 


are independent and that U is ay(u+v) variate and Z is a B2 (pL, V) variate 
[Rajasthan Univ. B.Sc. (Hons.), 1 
Solution. As in example 8-32, we have 


- as fe e 
dF) = FAD Fwy)? +>) Ht Vy ' dx dy, 0< (x, y) <2 
Since u=x+y and z=, 
1424 UL _ Wz _ {yo 
eee => y= 1+2Z and ara ot 
_O(yy) _ —Uu ‘ 


~O(42 (1 +z) 
Asx and y range from 0 too, both uw and z range from 0 to oo: H 
the joint probability differential of random variables U and Z becomes 


-1 v-1 
as l Sag | HZ _u, | 
EG = FT Oy Feral bers 1 dudz 


-u wt+v-) N -1 
= Sty a ay ; 
Pr (u+v) B(u,v) (1-+z)# 
0<u<0,0<1 


Hence U and Z are independently distributed, U as ay(fp+v) va 
and Z as a B> (uu, Vv) variate. 


Remark. The above two examples lead to the following important res 
If X is a y(M) variate and ¥ is an independent y (v) variate, then 


(i) X+Yisay(uw+V) variaie, i.e., the sum of two independent Gal 
variates is also a Gamma variate. aie 
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(ii) y isa Bo (u. v): variate, i.e., the ratio of two independent Gamma 


variates is also a gamma variate. 
ii) >> ts a Bi (Ue Vv) variate. 
Example 8-34. Let X and-Y have joint p.df. 


-(v+) ve 3 


ge)" Tar 
= Q, — ‘here. 

Find (1) p.d.f. of U= TF , GQ) ECU) and (iit) EL U- -E(U)y na 
(Allahabad Univ. B.Sc. 1992) 


x>0,>0 


Solution. Let u= * and v=x+y 
xtFy 


=> xX=uv,y=v—-x=v—-—uv=v(l—2u) 
Jacobian of transformation is 
ax ax 
Oy) _ [du av}_} © * J _ 
O(mv) Oy Oy] -v I-u - 
du dav 


Hence joint p.d.f. of UV and V becomes: ‘° 
p (u,v) = g (x,y) 14! 
es l -Vv 3 4 
“T4ars © -(uvy [v (1 u) | Xv 
ple = ty 
“T4Ts © 


vw (lu): 0<susisv>0 


aa 
; < | and since x>0,y>0 
y 


.we have 0<u<1 and v=x+y20] 


me -v 8) _T9 3 4], 
-| i “la (1 ot | 
O0<u<1,%>0 


= p (u, v) on (v) . p2 (u) : | (*) 


where precy & Ve -u>0 


T9 se. 
are aah (l-u) ;O0<u<l. A) 


From (*), we conclude that U and V are independently distributed and 
from (**), we conclude that 


XxX 
U= X+ y Bi (4, 5),. 
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i.e, U is a Beta variate of first kind with parameters (4, 5). 
Aliter. We have 


Bee 1 ~(x+y) 3 4 
8% N= TPs ° x Jy 


-|te-r 3 ||/toyy 
-| Fae ? || | 


= gi (x) g2(y);x>0,y>0 
=> X and Y are independently distributed and X ~y(4) and Y~ y(5) 
Hence 


U= (4, 5) 


x 
X+Y ~Bi 
| \ ] 
Now EW)=] u.prWdu=zay Jui - uy di 
0 0 


B(4,5)° 0°" [Using Beta integral] 


SS eae — = = 


r4rs* Tid 14. 919 9 


E(U) = — fo? (1 —u)* du 
0 


B(4, 5) 
rm  T6rs 
= B45) ~ PO) =Tars* Ti 
_ 3x4 2 
~10x9 9 
; ‘ 16 2 
E[U-E(U)P=E(U)-[E(WP=$-P =e 


Example 8-35. A random sample of size n is taken from a population with 
distribution: 


(x to" te 
e v4 cad a, = 
dP (8) = Taye 7 - “O<x< ,a>0,A>0 
Find the distribution of the mean X. 
Solution. [Delhi Univ. M.Sc. (OR), 1990] 
Mx () = E (e%) =| e* f(x).de 
0 
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2a hi e(2-}s 2-1 dx 
0 


I (A) a 
1 - TQ . —ary* + 
Taye E etre) =A) 
a 


M2 (t) = Ma, +x,...+X/n (0) = Mx, +X, 4...45, (t/n) 


[°° Mex (t) = Mx (ct) ] 

= My, (t/n) Mx, (t/n)... Mx, (t/n), 
(since Xi, X2, ...,.X_ are independent). 

Hence on using (*), we get 

: -A a 
at ft 
us| Le os i= 

which is the m.g.f. of a Gamma distribution (c.f. Remark 3, $ 8-3-2). Hence by 


uniqueness theorem of mg.f.,X ~y (n/a, nA) with p.d.f. 


-nd 


(7) eax“! Qex<m 


Example 8-36. A saniple of n values is drawn from a population whose 
probability density is ae™™ , (<x =0,a>0). If X is mean of the sample, show 
that na X is a y(n) variate and prove that 

= 1 = 1 
E(X ar and SE. of X ee 


(Marathwada Univ. M.A., 1991) 
Solution. 


f(x) =ae*30sx<0,a>0 


Mx()=f &f()deoa fe? ™ d& 
0 0 


paki eee (a >i) 
-(@-2)}, a-—t’ 


=@ 


n . 
' Manz (0) = Ma, 4+ X2+...+X,) (t) * MX, +X,+...+X, (at) 
= My, (at) . My, (at) ... My, (at), . 
(since the sample values are independent). 
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+. Man¥(t) = qT Mx, (at) = Mx; (at) iG 
(since X1, X2, ...,X_ are identically distributed). 


* Magz()) = + =(1~—1)", which is the mg.f. of a y(n) variate. 
Hence by uniqueness ee of m.g.f., anX isay(n) variate. 


Since the mean and variance of ay (”) variate are equal, each being equal to 
n, we have 


E(anX )=n => anE(X )=n, ie, E(X )=+ 


and V(anX )=n => a’ n°? V(X )en Le., Vet 
na? 


Hence standard error (S.E.) of ¥ =VV(X — 


Example 8:37. Let X ~ 61 (u,v) andY~y (A, +) be independent ran- 
dom variables, (u,v, 4 >0). Find a p.df. for XY and identify its distribution. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1987] 


Solution. Since X and Y are independently distributed, their joint p.d.f. 
is given by 
arty -i +V— 
f(x,y) = Tue’ iS aes 
O<x<1,0<y<© 
Let us transform to the new variables U and Z by the transformation 
xyy=u,x=z te, X=zandy=u/z 
Jacobian of transformationJ is given by 


p-l Laan va] 
7 Buy (1 x x 


2691) 4 | a2 
aaa z) = a z 
Thus the joint p.d.f. of U and Z becomes 
are w- v-1 -Aw/z[{ otis 
642)" Fa yrery ? -(1-z) € . z lJ |; 


O<u<w,0<z<1 
Integrating w.r.t.z in the range 0 <z<1, the marginal p.df.ofU is given 
by 
gery wevol I (1-z)"! eo hw 
Pwr) Ze 


1 ve 
Laterateyl aca "ples 
Pwr) ~ 2742 


dz 
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Thus 
query ywevad © -1 -Au(l48 1 
oT aprey SM) [pte 
qty bevel Au a She eel 
rere Jee 


quev pwevel Au T (v) 


“— T@re) au 


eu! Qecuce 


._* 
Pr (u) 
* Hence U = XY is distributed .as a gamma variate with parameters A and 
p, Le. XY ~y (Ap). 
Example 8-38. Letp ~ 1 (a, b) where aand bare positive integers. After 


one observes p, one secures a coin for which the probability of heads is p. This coin 
is flipped n times. Let X denote the number of heads which result. Find P (X = k) 
fork =0, 1, 2,...,.n. Express the answer in terms of. binomial cu-efficients. 


Solution. Since p ~ a (a, 5), its p.d.f. is given by 


1_ nn 
PO)= say P Ap, O<p<! 
P(X=k| the probability of success in a single trial is p) 
- (zea aete 


P(X =k)= f P(e) PK =k|p) dp 
0 


1 
“Sa yd “(pha yap 
0 


7 ns 
~ 3 


JP ie 6 | —p)"*?- ~k- ‘dp 
0 


ai Bla+k,n+b-—k) 


E 


Bab | (1) 
We have 
1 Fimtn) (m+n—-1)! mn ee 


Bim, n)— T (m) T (7) “(m- 1) !(n—- 1!” men{ om .«-(2) 
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ee. 
~ 


n\ ab (a+b 

Piel a 
Pat) a+k) n+b—-h n+a+b 
(n + a +b) ( atk 


a (* ‘s 
= ‘ ab (n +a +b) 


n+atb\ ‘(a+b)(a+k)(n+b-hk 
atk. 
Example 8-39. Given the Incomplete Beta Function, 


Bz (l,m) = f x27 * G-x)"" de 
0 


and f, (1, m)= B, (i, m)/B (I, m) , show that ® 
I, (i, m) =1-11-x(m, f). 
Solution. We have 


Iz (1, m) B (1, m) = By (I, m) = f 27! (1—x)"~' de .a(*) 
0. 


1 1 
= f'n de foal ae 
0 x 
1 


=B(l,m)- f 2-1 (1-x)""! de 


z 


In the integral, put 1 —x = y, — 
I, (1, m) . B (I, m) = B (I,m) - Foye dy) 


ee 


= B(I,m)- jrva ~ yl dy 
=B (i, m) p< YW) = B (i, m) -I-x (m, NB (m, 1) 
[From (*)] 
Since B (i, m) = B (m, |), we get on dividing throughout by B (i, m) 
I, (i,m) =1-—Nh_x (m,) 


EXERCISE 8() 
1. (a) Suppose the frequency function of a random variable is given by 
¥ e* 
f= RY? for x>OQ » 


0, otherwise 
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ere k is non-negative integer. 
(i) Find the moment generating function of this distribution. 


(ii) Determine the mean and variance of this distribution. using moment 
cnerating function. 


(b) If X is a Gamma variate with parameter A, obtain its m.g.f. Hence 


deduce that the m.g.f. of standard gamma variate tends to e’ 7/2 as 1 > oo. Also 
interpret the result. [Delhi Univ. B.A. (Stat. Hons.), 1988, ’82] 


(c) X and Y are two independent gamma variates, with parameters / and m. 
prove that (X + Y) is a gamma variate with parameter (/ + m). 


(d) If X,, Xz, ..... X_, are independent and identically distributed gamma 
random variables, what is the distribution of X, + X2+...+X,,? 
[Delhi Univ. B. Sc. (Maths. Hons.), 1988] 
(e) Cosider a random variable.X with the following p.d.f. 


wh 


f(x ‘awe ,0<x<0; 0,8 >0 


Find the moment generating function of X. 


Let the random variable X with above p.d.f. be defined as X ~ Ga (a, B). 
Then prove the following theorems : 

Theorem 1 : If X and Y are independent and X ~ Ga (0,, B) and 
Y ~ Ga (Q2, B), then X + Y ~ Ga (a + ap, B) 

Theorem 2 : If X and Y are independent and X ~ Ga (a, B) and X + Y ~ 
Gu (O, + O%2, B), then Y ~ Ga (Qu, B) (Mysore Univ. B.Sc. 1993) 

2. (a) Define the Beta variate of first kind. Obtain its mean and variance. 
Also define. the Beta variate of second kind and state its relation. with Gamma 
variates. (Nagpur Univ. B.Sc. 1993) 

(b) Write down the Beta probability functions of the first kind and the 
second kind with parameters ut and v. Show that a Beta variate of the first kind 
can be obtained by a transformation of a Beta variate of the second kind. 

(c) If X has a Beta distribution, can E (1/X) be unity ? 

mtn-J 


Ans. X ~ B (m,n); E (1/X) = ~ yp 2 1 No. 


3. Let X ~y (A, a) and Y-y (A, b), be independent random variables. 
Show that : 


ee 
X+Y] E(X+/Y) 
Hint. Since X ~ y (A, a) and Y ~ y (A, b) are independent, U = X/ (X + Y) 
and V= (X + Y) are independent. Hence 


E(UV)=E(U)E(V) = Ew) =£|-~]. 2+ Y), 
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4. (a) If X ~ y(A, p) and y - y(A, v) are independent random variables, 
show that 


X 
“7 ~ B.(n, Vv) 

(b) X and Y are independent Gamma variates find the distribution of 
XIN + Y). 

(¢c) If VY is random variable having as its rth moment 


(k +r)! 
By kt 
k being a positive integer, show that its probability density function is 
cr -x 
rad 2 Xx >0 
Kx» = " x <0. 
0 


(dq) Ifthe r.v. XY is such that 
- FAX =(n t+) YRVK In = 1, 233, ... 
k being a positive integer, find the p.d.f. of X. 


F ] 
Ans. + {teh + 7 [Delhi Univ. M.A: (Econ.), 1987] 
5. If X and Y are independent Gamma variates with parameters p and 
v respectively, show that the variables y-y 


U=NX+YandV= 
arc independent variables. 
6. If. and Y are independent Gamma variates with parameter A and 1 
respectively, show that the variables: 


X+Y 


” 


e 


U= NV+): [= 
(a) V+ ¥ and YoY 


are independently distributed and identifv their distributions. 
[Delhi Univ. B.Sc. (Stat Hons.) 1991| 
(6) V=X + Yand V=N/Y are independently distributed, U ~ y(A + p) 
and / ~ B, (A, p). 
7. <A simple sample of n values x), x5, ..., x, iS drawn ‘from the ' 
population: 


dP(x) = a e* x"! de, Osx <0 
If x is the mean of the sample, find the distribution of 7 x . Hence find the 
mean and variance of the distribution. 
8. (a) show that for a y(A) distribution, 
Mean - Mode | _ iI Bs 
Oo 7 Vr 2g" 
Show that the excess of kurtosis of the distribution is 6/X. 


qheorctical Continuous Distributions eee 
(b) Show that the mean value of the positive square root of ay (A, 1) variate 


wrt) / INR EOL. 


Hence prove that the mean deviation of a N (UL, 0”) variate from its mean is 
5V2/t 

{Gauhati Univ. M.A. (Eco)., 1991; Delhi Univ. B.Sc. (Stat Hons.), 1989] 

Hint. Proceed as in Example, 8-31. 


9, Show that the mean value of the positive square root of B(, Vv) variate 


ys 


r{ +3 rary 


ru) rye | 
10. (a) For the distribution : 


ae peg ee v>2 
Bu, v) (b+x)ht¥" 


how that variance is H+. “)) 
(v- 1)" (v-2) 


Find also the mode and Ht,’ (about origin). Also show that harmonic mean is 
y-I/V. 


(b) Find the arithmetic mean, harmonic mean and variance of a Beta 
distribution of first kind with parameter i) and v. Verify that A.M. > H.M. 
Also prove that if is the geometric pee then: 


log G = Bu ITE Y CLG 2 [ tog v2 - ~log E(u +v) | 


11. Given the Beta distribution in the following form : 


l 
P= Fae Lat JX &(1 xj ,a>-1A>-1,0<xS1 


ind its variance. 


dP (x)= 


Also find the distribution of (i) . » (i) 1% ; 

12. If X is anormal variate with mean p1 and standard deviation o , ‘find 
he mean and variance of Y oennee ‘by 
Sel © (Meerut Univ. B. Sc., 1993) 
86. The Exponential Distribution. A continuous random variable X as- 


uming non-negative values is said to have an exponential distribution with 
Rrameter 8 > 0, if its p. : f. ’ gervel by 


x “x20 
I= ; oiiawie .. (8-24) 
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The cumulive distribution function F (x) is given by 
F (x) = | f(u) du=8 | exp (-9 4) du 
0 0 
, . | l-exp(-8x), x 2 0 ’ 
PQ) = ne otherwise --o[ 8-24 (a) 
Moment Generating Function of Exponential Distribution 


86-1. 
Mx ()=E(e%)=0 | ee dx 
: 0 
-of exp (0-1) x} dx = —2— O>1 
(8-1) ° 


0 


(4-36) 


r 


u,/ = E (X’) = Coefficient of - in Mx (1) 


r! 
Sp he 2 ce, 


Mean = |)" = 7 


Variance = U2 = U2 — ieee ee 
H2=—2 - 1 oe 
Theorem. If Xi, X3, ..., X, dteiridependent random variables, X; having an 


exponential distribution with parameter 6;;f=1,2,....n; then Z=min 


(X1, X2, ..., Xn), has exponential distribution with parameter 2 @;. 
i=l 
[Delhi Univ. B.Sc. (Stat. Hons.), 1986] 


Proof. Gz (z)= P(Z <z)=1-P(Z>z) 
=1-P[min(X, X2,...,Xn)>Z ] 
=1-P[X;>z, i=1,2,..,2] 

[1-P(Xi <2) ] 

1 


n n 

=I- TI P(X;>z)=I1- dl 
i=l [ 
n 

=1- I [1 - Fx, (z)] 


+ fz] 
whete F ‘is the distribution function of X; . 
n 
=1-m{1 -(\1 8) 
(c.f. 8-24 (a)] 


i=1 
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1 i 


Oe be oy: 2>0 
= i=l 


(). otherwise 


(xz 8,) 2 | ( >z af. z2>0 
gz(=y i= 


0, otherwise 
n 

=> Z=tnin (X), Xo, ....X,) is an exponential variate with pafameter2 0;. 

iol 

Cor. If Xi;i=1,2,..., ave identically distributed, following exponential 
distribution with parameter 8, then Z = min (X}, X2, ..., Xn) is also exponentially 
distributed with parameter n 0 . 

Example 8-40. Show that the exponential distribution ‘‘lacks memory’’, 
ie. if X has an exponential distribution, then for every constant a20, one has 
p(¥sxlX2a)=P (X Sx) forallx, where Y=X-a. 

(Delhi Univ. B.Sc. (Stat. Hons.), 1989; Calicut Univ. B.Sc. (Main Stat.), 1991] 

Solution. The p.d.f. of the exponential distribution with parameter 6 is 


f(x) =0 679% 50>0,0<x<00 


We have 
P(YSxNxX2a)=P (X-aexnX2a) (.. Y=X-—a) 
=P(XSatxNX2a)=P(asXSartx) 
atx 
=0) 8 drae (1-6 
a 
and P(X2a)=0| e 8 dy =e 79 
P(¥sxNXe2q@) | -Ox 
RSs X28) 9 paeay A) 
z 
Also P(Xsx)=0 f &*dr=1-e"* (#4) 
“ a 


From (*) and (**), we get 
P(Y¥sxIX2a)=P(XEx) 
ie, exponential distribution lacks memory. 
‘Example 8-41. X and Y are independent with a common p.d,f. (exponen- 


tial): 
e*, x20 
x)= 
we ee 


Find a p.d.f. for X — Y, [Delhi Univ. B:Sc. (Stat. Hons.), 1988, ‘85) 
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Solution. Since. and ¥ are independent and identically distributeg 
(i.i.d.), their joint p.d.f is given by 
ey 0 y>0 


Sxy (x, y) = ‘ 

otherwise 

u=x-y Xx=utv 
Let = > ‘; ah; --(1) 
d(x,y) jl 1 
G(u,v) |O 1 
Thus the joint p.d.f. of U and V becomes 
—(u+2v) 


=] 


y= 


glu. =e 

(1) => u=zX-vVoOVveEx-—uU 

Thus v>-uif-wo<y<(0 
and v>QOifu>0 
For —-ox<y<Q, 


-v>0,-a<u <0 


co 6) 


g(u) = J g(u, v) dy = | e7 (ut2v) dv =e" = 
-y 


-u 
and for u > 0, 
[7 6) 


e” 


—2 


—u 


g(u) = J glu, v) dv= e" = : eu 


Hence the p.d.f of U = ¥ - Y is given by 


ee Sz 0 
2 


gu) = | 1 


—e " u>0 
2 


These results can be combined to give 


I - 
lu) = Fe ~~ RDKUKD 


which is the p.d.f. of standard Laplace distribution (c.f § 8-7). 
Aliter. 


My(t)= J e™ f(x) de = | eID oy = 


Ss 


g (lox 


-(-f) 


ae 
P l-t 


0, ; 0, 
.. Characteristic function of Y is 


1 
a Ba a (f), 
(since .Y and Y are identically distributed.) 
. Py_y() = Oy +(-Y) (t) = py (rt) P_y (ft) (.° VY, Y are independent) 
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l I 


(l-it)}(l+it)) 140? 
which is the characteristic function of the Laplace distribution, (c.f. § 8°7) 


= Px(t).Qy (-1) = 


1 oie 
g(u) iad Wl co <u <a A*) 
Hence by the uniqueness theorem of characteristic functions, U =. — Y has 
the p.d.f. given in (*). 
8-7. Laplace (Double Exponential) Distribution. A continuous random 
variable Y is said to follow standard Laplace distribution if its p.d.f. is given by 


Lo -txt : 
S(X) = i aes (8°25) 
Characteristic function is given by 
oo _ 1 oa) | ; 
ity ae a tte —|x| 
ot) = [¢ LOS [e .e "dx 


| (7s) tee] 
5 | cos tx.e H! de +i J sin tre. e7?! de 


—ws —-W 


(os) 
| “is 
== 2) cos tx .e7F! de, 


0 
Since the integrands in the first and second integrals are even and odd 
function of x respectively. 


J e* costx ax 


QD = 
0 
=I-f | e“ cos tx dé (on integration by parts) 
=]- P Py(t) 
= Pf) = —; ...(8°25 a) 


The mean of this distribution is zero, standard deviation is ./2 and mean 


deviation about mean is 1. 


Remark. Generalised Laplace Distribution. A continuous rv. X is said to 
have Laplace distribution with two parameters 4 and p if its p.d.f. is given by 


I 
f(x) = op [-|x-plA],-w<x<0;A>0 (8°26) 


Nop 
Taking amr. in (8-26) we obtain the p.d.f. of standard Laplace variate 


given in (&-25). 
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Moments. The rth moment about origin is given by 
om ry _ l f Fivce. -|x-p| 
W’, = E(X ) = ory J xX exp Gowen 
lf ; 7 
== J A +H)" exp(-le) E a | 
_i { z : (za)* tok exp(-|z|) dz 
2 k= Kk 
Se > (7) yr* fo exp(-|z]) dz 
2 k=0]\k 


-w 


wm 1 
+| e iz" dz 
() 


> () MTR SD CDE n| 


k=0 


Vier \ak rk, pel 
5\(i)* po kN+(-2)"} | .. (8°26a) 
Mean = Mr = = hy = and at = 2 + 23? 
Similarly we Can ebisin higher se moments from (8-26 a) and hence the 
values of B, and B, can be obtained. 
The characteristic function-of (8-26) can-be obtained exactly, similarly as we 
obtained the characteristic function of standard Cauchy distribution, c.f.§ 8.9. 


8-8. Weibul Distribution. A random variable XY has a Weibul distribution 
with three parameters c (> 0), a(> 0) and p if the xv. 


X a 4 
Y= (==*) (i) 
a x 


has the exponential distribution with p.d_f. 
py) =e%, y>0 il) 
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The p.d.f. of X is given by 


c-) : 
px (y= co! a] exp E a |.onero (tit) 


The standard Weibul distriQution is obtained on taking @= 1 and p=0, so 
that the pd.f. of standard Weibul distribution which depends only on a single 
parameter c is giver by 

px (x) =ex' 7! exp (-3x°)s4>0,c>0 ...(iv) 

8-8-1. Moments of Standard Weibul Distribution (iv) 

For standard Weibul distribution, (a= 1,1=0), from (i), we get Y= X° 
which has the exponential distribution (ii). We have 


wp=E(X')=E(Y'“)'=E(¥”) 


= |e.y” ay 


0 [-. Y has p.df. (ii) ] 


=> V=P| +1 
erie ,..(V) 
Mean = E (X)=T at 
and Var (X) = E (X’)=[E(X)] 


= "(+ 1 }Hr(e+ l 
c c 
Similarly, we can obtain expressions for higher order moments and hence 
for Bi and B2. For large c, the mean is approximated by 


E(x) = Teh F* 


= 1-0-57722 c°' +0-98905 ¢? 

where ¥=0:57722 is Euler’s constant. 

The distribution is named after Waloddi Weibul, a Swedish physicist, who 
used it in 1939 to represent the distribution of the breaking strength of materials. 
Kao, J.H.K. (1958-59) advocated the.use of this distribution in reliability studies 

‘and quality control work, It is also used as a tolerance distribution in the analysis 
‘of quantum response data. 

8-8-2. ‘Characterisation of Weibul Distribution. Dubey, S.D. (1968) 
has obtained the following result : 

‘‘Let X;(i=1,2,...,n) be i.i.d. random variables. Then min 
(X;, X2, .... Xn) has a Weibul distribution if and only if the common distribution of 
X;’s is a Weibul distribution’. 

Proof. Let X;, ((= 1, 2,...,2) be iid. rv. each with Weibul distribution 
(iif) and let Y= min (X, X2, .... Xn). Then 

P,(Y > y) = P[ min (X1, X2, .... Xn) > y ] 
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P a x» | 
i=l 


rp (X,>y) =( P(X;>y) ] 


= 1 


since X;’s are iid. nv.’s. 


eo c-l c 
Now pouay= J ea'| oH | oo] (58) Ja 


) 


il 
—, 
*e 
2. 
rs | 
i" 
a 
a 
mae 
eT 
a | 


= exp bs ey 
a , 
Substituting in (*), we get 


| 
von 
| 


ak 


This implies that Y has the same Weibul distribution as X;’s with the 
difference that the parameter « is replaced by an“. 

8-8-3. Logistic Distribution. A continuous r.v. X is said to have a Logistic 
distribution with parameters @ and f, if its distribution function is of the form: 


Fx (x)=[1+exp{-(x-a)/B]], B>0 (8-26 b) 


l ! 
7 ] + tanh pe-ave | ; B>O (826 0) 
(See Remark | on page 8-94). 


The p.d.f. of Logistic distribution with parameters a and f (> 0) is given by 
f= £F@) 


= (1 +exp|-(r—a)/B} 1? exp [-@-0/B}3 


_1 2Jle_ 
2p sech {36 a)/B 
The p.d.f. of standard 


...(8-26 d) 


...(8-26 e) 


Logistic variate Y=(X—a)/B, is given by: 
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L 
er Q)=fQ). 4 
dy 
9 
=e '( l+e' ,- cy <oo ...(8-26 f) 
ag ee gS nee (826 8) 
The distribution function of Y 1s: 
GyQva(lte'y!; -e<y<c0 ..( 8-26 h) 


Logistic distribution is extensively used as growth function in population and 
demographic studies and in time series analysis. Theoretically, Logistic distribu- 
tion can be obtained as : 

(i) The limiting distribution (as 1 — o-) of the standardised mid range, 
(average of the smallest and the largest sample observations), in random samples 
of sizen 

(ii) A mixture of extreme value distributions. 

Moment Generating Function. Thé m.g.f. of standard Logistic variate 
Y is given by: 

1Y fy 
pr @=E(e”)=] e.g 0)dy 


—- 09O 


=[& er (l+e?)y? dy 


oo ~ -2 
= J eerie) dy 


=f ee (1tey dy 


. -1 Se ee 
Put z=(l+e) “—— : 


. My()= “| (454 £) ay =| fr ‘(1-2)’ dz 


= B(1-¢, +d, ee, 
=Td-ayrd+o)/7T 2 
rd-s M+9 


tf cosec Kt 3t<l ... (8-26 8) 
22 
~,ett,_/) ata 
Suge © gage 4) 
(See Remark 2 below.) 


.. E(Y) =Coefficient of ¢ in (*) = 0 
= Mean = 0 
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P 2 


lu» = E(Y’) = Coefficient of srin(y=y 


U3=E (Y°) =0 
4 
Ly = E(Y') = Coefficient of a in (*) = as x 
Hence for standard Logistic distribution : 


Mean = 0, Variance = [2 = n/3, 


2 
[13 4 7x9 
ce a sep Late 
ep eae gg 
Remarks 1. We have 
tanh x= Sinhx _ ere . j-¢* 
~coshx ek¢e*% Lee 
= ttanhx=—2 > Sf +tanhx ]=(14e4y! 
I+e 2 
Pe 
2. x cosecx=)+ 5 +5 0x +... 


x 
Proof. x cosecx =——— = 
sin x 


il 
ae 
pad 
{ 
re. 


il 
+ 
LN 
al* 


ul 
+ 
| 


3. We have: 


gQj=e™ Nit ls (142) 809) 


=> The probability curve of Y is symmetric about the line y=0. 


Since p.d.f. g (y) is symmetric about origin (y = 0), all odd order moments 
about origin are zero Le., 


Worst = E(Y y7*1=0, r=0,1,2,. 
In particular 
Mean = 1)’ =G 


,’=rth moment about. origin 
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= 7th moment about mean 
= [- 
= Mae) =Woret = 0 
i.e., all odd order moments about mean of the standard logistic distribution are 
zero. 
In particular u3=0 = B, =0 
4. The mean and variance of the logistic Variable (X) with parameters & 
and B are given by-: 


E(X)=E(a+BY) - on wl 
= a+ B £E(Y) 
= oO 
Var X = Var (a + B Y) = B’ Var (Y) = B* 27/3. 
5. We have: 
yc 
Goyer! = ee eR 
€ l+e 
=> 1-Gi)=I- a = 
lt+e’ |l+e 
e 
OOS eae ...(826 j) 


(cf. Remark 3) 
GQ) _» siege 
Also 266)" eo = ) oe 1-G(y) ...(8-26 k) 


6. Mean deviation for the standard Logistic distribution is 


oo i-1 
2 pee ae =2 2 atl ae = 2 log, 2 
. 2 3 4 nee i. 


Proof is left as an exercise to the reader. 


EXERCISE 8(e) 
1. (a) Show that for the exponential distribution 
p(x)=yo.€ 7, OSx<~; o>0, 
mean and variance are equal. Also obtain the interquartile range of the distribution. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1985, 1982] 


(b) Suppose that during rainy season on a tropical island the length of the 
shower has an exponential distribution, with parameter A = 2, time being measured 
in minutes. What is the probability that a shower will Jast more than three minutes? 
If a shower has already lasted for 2 minutes, what is the probability that it will last 
for at least one more minute ? [Madras Univ. B.Sc. (Main Stat) 1988] 


§ 96 Fundamentals of Mathematical Statistics 


2. (a) [f Xi. X2.....X_ are independentrandom variables having exponen- 
7 
tial distribution with parameter A. obtain the distribution of Y=Z X,. 
pal 
(b) Obtain the moment generating function and the cumulant generating 
function of the distribution with p.d-t. 


fa) e*9;0<x<%,0>0 


(Madras Univ. B.Sc. (Main Stat.) Oct. 1992] 
Hence or otherwise obtain the values of the contants By, B2, yi and y2. 
(c) A continuous random variable X has the probability density function 
f(x) given by 
=Ae. “5 x>0O 
f(x) = 0, otherwise 
Find the value of A and show that for any two positive numbers § and ¢, 
P(LX>st+tlX>sJ=P[X>t]. 
3. If X; and X2 are independent and identically distributed each with 
frequency function e ‘,x>0, find the frequency function of X; + X2. 
(b) lf X,, X2,..., Xn, are independent r.v.’s X, having an exponential dis- 
tribution with parameter 0;, (i= 1, 2,...,”), then prove that Z=min (Xi, X2, 
n 


...» Xn) has an exponential distribution with parameter Z 9; 
i=} 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990, ’88, ’86] 


4. Let X and Y have common p.d.f. ae %* ,0<x<00,@>0. Find the 
p.d.f. of 


(i) X°, (@)3+2X, (iii) X-Y, and (iv) 1X-YI 


Ans. (i) ae exp(— ax!) , (ii) a x>3 


(iii) . e%'*! allx,and (iv) ae **,x>0. 


5. (a) X and Y are independent random variables each exponentially 


distributed with the same parameter 8. find p.d.f. for and identify its 


X 
X+Y 
distribution. [Delhi Univ. B.Sc. (Stat. Hons.), 1989] 

(b) The density functions of the independent random variables X and Y 
are: 
Kkiwsrke*  , x>0,A>0] frQyare®? , y>0,A>0 
=() , «s0 =0 , Otherwise 
Find the density function of the random variable Z= X/Y . 
6. (a) For-the, distribution given by the density function 
fQ)HLEN wc xcos, 
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obtain the moment generating function. 
(6) Find the characteristic function of standard Laplace distribution and 
hence find its mean and standard deviation. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 
7. (a) If X has exponential distribution with mean 2, find PCY < 1) |X < 2) 
Ans. P(X < 1) [P(X < 2) = (1 - e®\/(1 — e?°) where 0 = 1/2. 
[Delhi Uniy. B.A. (Spl. Hons. Course-Statistics), 1989] 
(b) If X~Expo (A) with P(X < 1) = P(X> 1), 


(*) 
find Var_X. [Delhi Uniy. B.Sc. (Maths Hons,), 1985] 
Hint. PLY < 1) + PX > 1) = 1S PY <1)=1/2 [Using (*)] 


Ans, Var (X) = 1/2 = I/ (log e*)* 
8. (a) Show that Y = — (1/A) log F (x) is Expo (A): 
{Delhi Uniy. B.A. Hons. (Spl. Course-Statistics), 1985] 


if 
Hint. y(t) = Ete!) = E exp 2 log F 09 | 


=E[F(Xy “| = E[Z“] where Z = F (x) ~ U (0, 1] 
(b) If X,, X5, 3 and X, are iid. N (0, 1) variates, show that Y =, A, - 
X, X,, has p.d.f. 


1 
SW = 5 oxP [-|yl],-o<y< 


{Indian Civil Services, 1984] 
Hint. Show that pf) = 1/(1 + t?) = Y has Standard Laplace distribution. 


oOo @ 
~(ax? + 2 hyy + by*) = us 
Use : J J . dx dy = 


sae Vab—h? 


9, 200 electric light bulbs were tested and the average life time of the bulbs 
was found to be 25 hours. Using the summary given below, test the hypothesis 
that the lifetime is exponentially distributed. 

Lifetime in hours : 0-20 2040 40-60 60-80 80-100 

Number of bulbs : 104 6) 24 12 4 

[You are given that an exponential distribution with parameters a > 0 has 
the probability density function: 

p(e) = a e%, (x > 0) 
= 0, (x < 0) 
[Institute of Actuaries (London), April 1978] 

10. Find the first four cumulants of the Laplace distribution defined by 
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Fo) =| exp{-ix-w} |: -wexee,2>0 


and hence find the values of 1; 6, yj and: 2 . Calculate also the semi-interquartile 
range (S.LR.) | 
Ans. KL =H, 2 = 22, K3 = 0, Ky = 12 Msm=ap.o=V2 A450, ¥2=3 and 
S.LR. =A log, 2 
11. The p.d.f. of ar.v. X follows the following probability law 
p (x)= a5 exp | -2—?" 
20 ) 
Find m.g.f. of X.. Hence or otherwise, find E (X) and Var (X). 
[Delhi Univ. B.Sc. (Stat. Hons.), 1986] 
12. X;,i=1,2,...,n are iid. r.v.’s having Weibul distribution with three 
parameters. Show that the variable Y= min (X}, X2,...,Xn), also has Weibul 
distribution and identify its parameters. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1984] 
13. Obtain the moment generating function of Logistic distribution and 
hence find its mean and variance. ‘(Delhi Univ. B.Sc. (Stat. Hons.), 1993] 


14. (a) Obtain the p.d-f. g (y) and the distribution function G (y) of the 
standard logsitic variate and prove'that : 


(i) .g (y) issymmetric about origin. 
(i) 80) = GO)[1-GO)] 
GQ) 

1-G() 


(b) Obtain the m.g.f. of standard logistic variate and hence prove that: 
; 


> Mm@COKXcoo, 


(iii) y=loge 


Mean=0, ‘Variance = 1’/3, 
Bi=0, B=; Maye) =0 
and mean deviation about mean = 2 log; 2. 

8-9. Cauchy’s Distribution. Let us consider a roulette wheel in which the 
probability of the pointer stopping at any 
part of the circumference is constant. In 
other words, the probability for any value 
of 6 lies in the interval [— 1/2, %/2 ] is 
constant and consequently @ is‘a rectan- 
gular variate in the range [—-2/2, 2/2 ] 
with probability differential given by 

dP (8) =(1/n) d8,-1/2<O8Sn/2 
= 0, otherwise 
(8-27) 

Let us now transform to the vari- 

able-X by the substitution: 
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x=rtan® = dv=rsec’ @d0 
Since —- 2/2 <9 <n/2, the range for X is from — 2 tooo. Thus the prob- 
ability differential of X becomes: 
{ dx | dx r ax 
MPO) ge 
Rorseci@ TM [ry lt+e /ry ty MR rex 
In particular if we take r= : we get 
1 
(x) =— 
f a 
This is the p.d.f. of a standard Cauchy variate and we write X~ C (1,9) 


,—~ eK X <0 


>? pe ee ee 


Definition. A random variable X is said to have a standard Cauchy 
distribution if its p.d.f. is given by 
1 
fx (®) = —— > 1-8 SX < 00 
m(1 +x’) ...(8-28) 
and X is termed as a standard Cauchy variate. 


More generally, Cauchy distribution with parameters A and pt has the 
following p.d.f., 


»- omc yon: A>0 


) = S”:O””:—CSE=" b ] 
Os [A°+(y-p)7] (8-29) 


and we write X ~ C (A, 1) 


But putting X=(Y—p)/A in (8-29), we get (8-28). 


8-9-1. Characteristic Function of Cauchy Distribution. If X is a standard 
Cauchy variate then \ 


l 
Ox () =o 
7 ‘a ) 
To evaluate (*) consider Lapalce distribution 
fi (z ete! —00 <7 <00 
1 
Then O(N=E (7) =— 
1+tf [From (8-25 a) 


Since @ (f) is absolutely integrable in (—, e), we have by Inversign 
theorem 


—e'ts =5> f itz l r en 
xe '=fi@)=5- Je m Wdt=7 dre 
- itz ‘dl iz 
= eMed |S f ane NT 
a cent (Changing (tto- 1 


On interchanging ¢ and z, we get 
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CP) 
From (*) and (**), we get 

ox(nee'" ...(8-30) 
Remarks. 1. If Y is a Cauchy variate with parameters A and uf, then 


x=7ck => Y=p+ax 
oy (N=E(e act Ee) =e! ox (tr) 
= gitto All a 5g ...(8-30 a) 


2. Additive Property of Cauchy distribution. Jf X; and X2 are inde- 
pendent Cauchy variates with parameters (Ai, p41) and (A2, 2) respectively, then 
X\ + X is a Cauchy variate with parameters (Az + X2, Ly + 2). 

Proof. ox, () =exp{ipt—Ajlelt, G=1,2) 

x, +x, (t) = Ox, (1) Ox, (4) (Since Xi, X2 are independent) 


= exp| itu + Uo) = (Ay + Ad) | a 
and the result follows by uniqueness theorem of characteristic functions. 


3. Since ’x (rt) in (8-30) [where (’) denotes differentiation w.r.t. t ] does 
not exist at t=0, the mean of the Cauchy distribution does not exist: 


4. Let Xj, X2,....Xn be a sample of n independent observations from a 
/ 


= 1 4 
standard Cauchy: distribution and define X = x X;. Then 


i=l 
GX (t) = Or x, (t/n) = IT [ Qy, (t/n) } 
i=] 


= [ @x, (t/n) ]’ (since X;’s are iid.) 
oe [ gen ee = x (t) 


Hence by uniqueness theorem of characteristic functions, we have: 


The arithmetic mean X_ of a sample X\, X2, ..., Xn of independent obser- 
vations from a Standard Cauchy distribution is also a ‘standard Cauchy variate. 
lrrother words, the arithmetic mean of a random sample of any size yields exactly 
us much information as a single determination of X.”’ 


This implies that the sample mean X, of arandom sample of size 7, as an 
estimate of population mean does not improve with increasing n, whichcontradicts 
the Weak Law of Large Numbers (WLLN). 


$92. Moments of Cauchy Distribution. 


0 x. 
EM=fwoa-5S a Go 
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Although the integral fats 7229 4Z, IS not completey convergent, i.e., 
a 


lim * f 
bo fee cen e lim 
v — oo Js = 5 dz does not exist, its principal value, viz., 2 < 2 dz 
ae Nn >Or MN +z 
n’ > o~n a 


exists and is equal to zero. Thus, in the general sense the mean of Cauchy 
distribution does not exist. But, if we conventionally agree to assume that the mean 
of Cauchy distribution exists (by taking the principal value), then it is located at 
x=. Also, obviously, the probability curve is symmetrical about the point 
x=... Hence for this‘distribution, the mean, median and mode coincide at the point 
x= pU. 


2 
n= £(Y-wr= f o-wesoray=* J ow wy, 
ns 2 + (yp)? 
which does not exist since the integral is not convergent. Thus, in general, for the 
Cauchy’s distribution 1,, (r 2 2) do not exist. 


Remark. The role of Cauchy distribution in statistical theory often lies in 
providing counter examples, e.g. it, is often quoted as a distribution for which 
moments do not exist. It also provides an example to show that 


@x + ¥ (t) = Ox (1) Pr (2) 
does not imply that X and Y are.independent. [See Remark to Theorem 6-23] 
Let Xi, X2, .... Xn be a random sample of size n irom a standard Cauchy 
n 
distribution. LetX = = X, “a Since E (X,) does not exist (-.- mean of a Cauchy 
i=1 
distribution does not exist), E (X° ) does not exist either and the definition of an 


unbatsed estiamte does not apply ta Xs 
Cauchy’distribution also contradicts the WLLN [See Remark 4, § 8-9-1]. 
Example 8:42. Ler X have a (standard) Cauchy distribution. Find a p.af. 
for X’ and identify its distribution. [Delhi Univ. B.Sc. (Stat. Hons.), 1989;.’87] 
Solution. Since X has a standard Cauchy distribution, its p.d_f. is 
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| 
f(y =r ! 7 TOOK Soo 


oe oo 
Th: distribution function of Y= X7 is 
Gy O)= PCY Sy) = PU Sy) = PCW £XS Vy) 


Ww 


1 d 
- [40 de =2— J ree 


== ian! (Vy), O<y<e 
The p-d.f. gy(y) of Y is given by 
er a aoe age 
gy) = 4G, [HW |" x Gsy) 2 
=i yi? 1 yii/2) -1 
“eR l+y” BS. Ly (1+ yuaein > 0 
This is the p.d.f. of Beta Sccdhatiea of second kind with parameters 
(34 ), L.e., X’ ~ Bo (4 ; 
Remark. Here y =g (x)= x’, gives g’ (x) =2x which is sometimes >UV atiu 


sometimes <0. Hence Theorem 5-9 can not be used in this case. 
_ Example 8-43. Let’ X~N(0,1) and Y~N(0,1) be independent ran- 


dom variables. Find the distribution of X/Y and identify it. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990; Nagpur Univ. B.Sc., 1991] 
Solution. Since X and Y are independent N (0, 1), their joint p.d-f. is 


given by 
— (x “ey 2 


fy (x, 9) = Sx (x) - fr O)=5a-€ 


Let us make the following sali of variables 
u=x/y,v=y so that x=uv,y=v 

Jacobian of transformation J = v. 

Hence the joint p.d.f. of U and V becomes 


Suv (u, 5 = exp {-w? + yv2}ivi 
==—exp{-(1+w’) V/2}Ivl,-o<(u, Vv) <0 


The marginal p.d.f. of U is , 


guu) =5— exp{—(1+u’) 7/2} I vldv 
lye ~+__ at a 
os (1 + 2) [Ga+u 2=9] 
l = 1 


~ (1 + u2) Ne eee 


~e7! Qa 
9 m1+u?)’ 
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which is the p.d.f. of a standard Cauchy distribution. 


Thus the ratio of two independent standard normal variates is a standard 
Cauchy variate. 


Example 8-44. Let X and Y be i.i.d. standard Cauchy variates. Prove that 
ee 

the pdf, of XV is: % | toslet), 

mn | x-1 [Delhi Univ. M.Sc. (Stat), 1991) 


Solution. Since X and Y are independent standard Cauchy variates, their 
joint p.d.f. is given 2 


x; 

ai ae z ian 

Let u=xy and v=y. Then Jacobian of transformation is given by 
1 u 


wo <(x,y)<@. 


Fee VS. al et [yes ef] 
a (u, v) 0 1 v 
Thus the joint p.d.f. of U and V is given by 
a oe: 
) a A u 2 
[1+ ]a en 


oe i ;-O<(u,v)<0 
x (u? + v*) (1 +v’)’ 


Integrating w.r.to v over the range —© to ©, the marginal p.d.f. of U is 
given by 


: _if-___|y 
~ few ae al@ ca TCP ald 


Wie babs 
0. (u? aay 


(Since the integrand is an even function of v.) 


gu) 3a (u? G@ahaed” 


iy 1 1 
3d GZ malta? ur +? 5a» 
“zap | et - ~log (u? +v)| 


z=— 1 lo dav 7 
x? (u? —1) lite do 
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i 
.———— 1 log ve —log 1 
x (u’ -1) u* ue 
7a +1 
v 


yo @ 


: 2 log hu 
log 1+ 2 log|u eee] _ cy ee 
[ log aan  (u? 1) 


ee 
x (u’ -1) 
EXERCISE 8(f) 


1. (a) Show that a function 


k ° 
x3 p, d) = —>——5 ; -ocex<m 
f(x3n, A) eau 


represents a frequency function of a distribution for a suitable value of k. 
Determine k and obtain median and quartiles of the distribution. Hence interpret 
the parameters A and p of the distribution. 


(0) IfX is a Cauchy variate with parameters A and p, find the charac- 

teristic function py (#) . Discuss briefly the role of Cauchy distribution in Statistics. 

[Bombay Univ. B.Sc. (Stat.), 1993] 

(c) ‘‘The role of Cauchy distribution often lies in providing counter ex- 

amples.’’ Justify. [Delhi Univ. B.Sc. (Stat. Hons.), 1991, ’88] 
(d) Discuss briefly the role of Cauchy distribution:in statistics: 

If Xi, X2, ..., X, are independent standard Cauchy variates, show that the 


meanX =(X1+X24¢...+Xn)/n, is also a Cauchy variate. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1986] 
Z. (a) '€X and Y are independent random variables following Cauchy 
distribution with parameters (Aj, wi) and (A2, W2) respectively, show that 
X + Y follows Cauchy distribution with parameters A; + A> and p+ p2.- 


(b) Obtain the characteristic function of Cauchy distribution 
dx 
aF (x) = ———,, 
@) (1 +2’) 


If Xi, X2,....Xn are independent Cauchy vaniates, show that the mean 


—-w<x<@ 


X -+ 2X is also a Cauchy variate. 


3. LetX and Y be standard normal variates. Find the distribution of 
U=X/|Y}. 


Ans. io — 


51 -O<KUK<@ 
KX L+u 


4. Aneedle spins about the point (0, 5) of the x-y plane with b>O and 
comes to a stop thereby making anangle p with Y—axis. The direction of the needle 
then intersects the x-axis at a point (X,0). Assuming @ is a r.v. with uniform 
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probability distribution on (—x/2,n/2), what is the distribution function and_ 
hence p.d.f. of X ? 

Aus: Fe a+ in Ge * eee eres 
: IU y a el Xn x +! a? 


5. If X\, X2,X3,X4 are independent standard normal variates, find the 
distribution of At + a3 
is %*X," 
6. X, is the mean of n independent random variables distributed like X, 


and X ‘bas a symmetric distribution. If X, has exactly.the same distribution as XY 
for all n, then prove that the characteristic function of X is 
by (t) = “l" 
for some real constant c > 0. Tdentify this distribution. 
7. IfX~N (uy, 03) and Y ~N (to, 03) are independent random varaibles, 


: X=" 

obtain the p.d.f. of U = ros (LLT., B. Tech. 1993) 

8-10. Central Limit Theorem. The central limit theorem in the mathe- 
matical theory of probability may be expressed as follows : 

“If Xi, (i= 1,2, ...,n) be independent random variables such that 
E (X)) = wi and V(X;) = 07, then it can be proved that under certain very general 
conditions, the random variable Sy =X, + X2 + ...+ Xn, ts asymptotically normal 
‘with mean w and standard deviation where 

n n 
u= Jw and o= 5 Ot 
iol iel 

This theorem was first stated by Laplace in 1812 and a regorous proof under 
fairly general conditions was given by Liapounoff in 1901. Below weshall consider 
some particular cases of this general central limit theorem. 


De—Moivre’s—Laplace theorem. (1733). A particular case of central limit 
theorem is De-Moivre’s theorem which states as follows: 


cere 2 "| 1, with probability p 
‘  }0, with probability q 

then the distribution of the random variable S, =X, + Xz + ...+Xn, where X;’s 
are independent, is asymptotically normal as n —> @.”’ 

Proof. M.G.F. of X; is given by 

My, (t) = E (e*) =e! p+ e-° ge (q+ pe’) 
M.GF. of the sum S, = X; = X2 + ... + Xp is given by 
Ms, (t) = My, +X2 +... +X, (¢) sae My, (¢) . My, (t) eee My, (t) 


= (Mx, (since X;’s are identically distributed) 


= (q+ pe)", 
which is the M.G.F. of a binomial variate with parameters n and p. 
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E (Sn) =np = (say), and V(Sn) = npq = 0’, (say). 
Sn -E (Sn) ae Sn—v 


Let Za 
VV (Sn) o 
Mz (1) =e"? Ms (t/o) [c.f. Chapter 6] 


ze" Pa | g + pea | 
R 


2 
tr. -3/2 
| ee a | [c.f. Example 7-19] 


where O (n™*””) represents terms involving n°”? and higher powers of n in the 
denominator. 


Proceeding to the limits as n —> ©, we get 


lim lim CA -3n ; lim ¢ ‘2 
eae Mz(t)= |. 1+57+O(n ) ree l+7 =e 
which is the M.G.F. of a standard normal variate. Hence by the uniqueness theorem 
of M.G.F.’s 


jel 
re 
Hence Sy = Xi + X2+...+Xn is asymptotically N (p, 0’) as N—> ©, 
Remarks 1. From this theorem it follows that standard binomial variate 
tends to standard normal variate as n — ©, In other words, binomial distribution 
tends to norma] distribution as 2 —> ©. 
z. Convergence in Distribution or Law. Let |X, } be a sequence of r.v.’s 
and { Fn | be the corresponding sequence of distribution functions. We say that 
Xn converges in distribution (or law) to.X if there exists a r.v.X with distribution 


function F s:‘ch that as n> ©, F,, (x) > F (x) at every point x at which F is 
continuous. 


is asymptotically N (0, 1). 


We write X,, Ly or X, @ Xx. 


3. It may be remarked that convergence in probability discussed in § 6°14 
implies convergence in distribution‘(or law) 1.e., 
X2x>2xXAX -(*) 
The converse is not true i.e., Xp Ly in general, does not imply X, PY 
However, we have the following result. 
Let k be a constant. Then 
Xk > Xk a(**) 


Combining (*) and (**), we get the following result. 
Let k be a constant. Then 


Xik oo %Bk a(***) 
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$-10-1 Lindeberg-Levy Theorem. The following case of central limit 


theorem for equal components, t.e., for identically distributed variables, was first 
proved by Lindeberg and Levy. 


“If X1,X2, ..,Xn are independently and identically distributed random 
variables with 


-E(X) =p). 
vee) ot f 1, 2, ...,.7 
then the sum S, = X,; + Xz +... + X, is asymptotically normal with mean 
j= np, and variance O = noj." 
Here we make the following assumptions : 
(t:) The variables are independent and identically distributed 
(ii) E (X?) exists for alli = 1,2, ... 


Proof. LetM, (t) denote the M.G.F. of each of the deviation (X; —1) and 
M (t) denote the M.G.F. of the standard variate 


Z=(S,-p)/o 
Since 4,’ and 2’, (about origin) of the deviation (X;— 11) are given by 


wi’ = E (Xi~p1) =0, po! = E (X;—- 1)" = 03 
We have 


Se are 
Mi=(1+ t+ 71 + Bs ait 
¢ x, | 
=| 1+—ot+O(r) 
rot | 


where O (t°) contains terms with? and higher powers of t. 
We have 


Zaha (X1, +X2 + wee tXq) — 1 py . ) ew 
oO oO 


and since X;’s are independent, we get 


Mz (1) aM F -nvo ()=M = G%-w) (0) 


“11 { Mw, -w) (t/o) | = [M; (t/o) f =[Mi(t/Vn onl 
=| 1 f.0Ww™ 
2n [From (*)} 


For every fixed ‘7’,” the terms O (n7 >”) —>0Q as n-e ©. Therefore, as 
n—» ~, we get 


no rs) 


o +4 2 i 
lim Mz(t)= "im [1+ +00r")| =exp [7/2], 
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which is the M.G.F. of standard normal variate. Hence by uniqueness theorem of 
M.G.F.’s Z = (S,—4)/o is asymptotically N (0,1), or S, =X1 +X2+ ... + Xn is 
asymptotically N (uy, 0’), where = my, and o* = n 4. 

Note. C.L.T. can be stated in another form as follows: 

(i) If ¥1,X,...,Xn are iid. with mean w1 and variance of (finite) and 
Sn =X1+X2+...+Xp, then 


lim Sn- Sn Wt — 
Plas a <b|-9()-9(4 


1 7 2 
=f axe ¥72 dy (8°31) 


for—-~ <a<b<%;H(—0)=0,D(mo) a1 
or 


(i) i" Pp 08 War Gy to [720-2 


VVar (Sn) ...(8-31°1) 
or, still another form : 
~~ lim Xn—E (Xn) 
(ii) “Plas sb|=%(b)-9® (a) 
nes War (%) (Xn) | | 
i.e, iM p ast + <b |=@(b)- (a) 
7 ne @ oi/Vn ,.(8°31:2) 


Remarks 1. We wrote the CL.T. using non-strict inequalities 


Pi(as (.) sb] 
It makes no difference whether one or both are changed to a strict inequality. 
The reason is that the limit distribution function (d/f.) ® (.) is a continuous df. 
Z. Inthe binomial case, C.L.T. gives good approximationifp is nearly 1/2. 
For p near about 0 or 1, the C.L.T. approximation still holds but in that case 7 has 
to be sufficiently greater than in the case p = 1/2 approximately. 


8-10-2. Applications of Central Limit Theorem. (a) If X1,X2,... are 
itd. B(r,p) and 


Sp =X, + X2 + ...+ Xn, then 
E (Sn) = = fags = ApyeMp 


and V (Sn) = “V( EX) - V(X) = a (Pq) = "1p q 


i es 
Tence (8°:31°1) 


lim Sn—nrp ae = 
ee P| as patty =| ® (b)—P (a),0<p<1 
(b) IfY, is binomial variate with parameters n and p, then 
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lim Yn — np = _@t 
es Plas +771 OE ad ® (6) — BD (a), 0<p<1 
Proof. Let .Xi,X2,... be iid. Bernoulli variates, i.e., B (1, p), then 
Sn2Xi+X2+...+X,=B (n, p). But Y, =B (n, p) 
Hence using Y, instead of S, in (831-1), we get 
lim Yn—E (Yn) 
plas War Y, <6 ® (b) — ® (a) 


nw 
Le. lim Pias Ya~ ll asl a = D (b)-—® (a),q=1 —p 
) n— © Vvnpq ? 


(c) If Y, is distributed as P(), then 


lim } Ya-—n = 
Wit Plas ie" <6 ]-0 (6-900 


Thus, for instance 


n—e 2 


lim P(Yns)= “si i.e., >» ot Lasn+e 


Proof. Let .X1,X2,... be iad. P(1). Then 
Sp, 2X1 +Xo+...+Xi~P(n) => Y,=S, 


Y,-n < Sax 
Plas Tn Wi ba Van * 36] 
(b) -® (a) asn—> 
In particular, let us take a = —© and b =0, then 


P ast sb|=P “ag #0 \=P asm) (*) 
Also  (b) — ® (a) = & (0) — ® (-~) = 1/2 .-.(**) 


From (*) and (**), we get 
P(Ynsn)— 1/2 asn>o 

Remark. This result could be generalised. In fact, on taking @ = — © and 

be=0 in (8-31:1), we get 
Sn=E (Sn) <9 1/2 = P[S,sE(S,)]71/2 a n> 
VVar (Sn) : 

8-10:3. Liapounoff’s Central Limit Theorem. Below we shall give 
(without proof) the central limit theorem for the generalised case when the variables 
are not identically distributed and where, in addition to the existence of the second 
moment for the variables X;, we impose some further conditions. 

Let X}, X2, ..., Xp, be independent random variables such that 

E (Xi) = yi 
VX) =; 


Let us suppose that third absolute moment of X; about its mean viz., 


;t=1,2,. 
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pi=E{ Xi-wif} 3;im1,2,...,m 
is finite. Further let 


p= = pi 


ial 


ie -f = (0, the sum X =X, = X2-+ ... + Xq is asymptotically 


a fn 
N (p, 0’), where p= = yj and o%= E of. . 
tol jel 
Remarks. 1. (About Liapounoff’s theorem). If the variables X;;i=1, 
2, ..., 2 ate identical, then 


R n 
p> = = oP =A pi and o*= = of not 
i=t ied 
1/3 
n“pi pi 1 
eae 6 ae > QO asn—o, 

Thus for identical variables, the condition of Liapounoff’s theorem is satis- 
fied. 

It may be pointed out here that Lindeberg - Levy theorem proved in 
$ 8-10°1, should not be inferred:as a particular case of Liapounoff’s theorem, since 
the former does not assume the existence of the third moment. 

2. Central limit theorem can be expected in the following cases: 


(t) Ifa certain random variable X arises as cumulative effect of several 
independent causes each of which can be considered as a continuous random 
variable, then X obeys central limit theorem under certain regularity conditions. 

(it) If (Xi, Xo, ..., Xn), iS a function of X;’s having first and second con- 
tinuous derivatives about the point (111, (22, ..., fn), then under certain regularity 
conditions, p (X1, X2, ..., X,) is asymptotically normal! with mean 
@ (U1, 2, «.-5 Hn). 

(ui) Under certain conditions, the central limit theorem’holds for variables 
which are not independent. 

3. Relation between CLT and WLLN. (a). Both the central limit theo- 
rem (CLT) and the weak law of large numbers hold for a sequence |X, | of iid. 
random variables with finite mean p and variance o”. 


However, in this case the CLT is a stronger result than the WLLN in the sense 


that the former provides an estimate of the P| Sra-nyp | / ne } as given 


below: 
P| Cat ze |=P| Xn—w| ze) 
Xn-p € . 
zal o/¥n ed 


=P[|Z]|2evn/o); Z-N(0,1) 
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=1~P[|Z|<evn/o] 


-[9(S"}-2(-%")| 
o 
where ® (.) is the distribution function of standard normal variate. 
However, WLLN does not require the existence of variance (c.f. Khinchin’es 
theorem]. 


(b) For the sequence | Xn of independent and uniformly bounded r.v.’s, 
WLLN holds [c.f. theorem 6:32] and CLT holds in this case provided 


B, ne Var (Xi +Xat...+Xp n) SOL 403+... + On > as n —> ©, 


(c) For the sequence { X,,| of independent r.v.’s, CLT may hold but the 
WLLN may not hold. 


810-4. Cramer’s Theorem. We state below (without proof), a useful 
result on the convergense of sequences of r.v.’s. 


Cramer’s Theorem. Let | X,} and| Y,,} be sequences of r.v.’s such that: 


Xn =X and Y, PF, (constant), 


then" 4 2 x if c «0 


For illustrations, see Example 8-46 and Qns, 15 to 17 in Exercise 8 (g). 


Exam ple 8-45. LetX,,X2, ... beai.id. Poisson variates with parameter 
d.. Use CLT to estimate P (120 = Sn s 160), where 


Sn 2X1 +X2+...+¢Xq_3 Aw2 and n=75, 
Solution. Since X; is itd. P(A), 
E (Xj) =) and Var (Xi) 2A; t= 1,2,...,0 


E(Ss)='S E(X) enh 
inl . 


‘ ; 
Var (Sp) = Var (Xi + X2+...+Xn)= © VarXj;=nh 
ia#l 


Hence by Lindeberg - Levy CLT, (for large 7) 
Sp ~N (nd, nd) =N (p= 150, o = 150); (n= 75; =2) 


120 ~ 150 160 ~ 150 
P (120 sS,s 160) =P| a> s Zs ae 


—2-45 s:Z s 0°82); Z:~N (0, 1) 
= P(-2-45 <Z<0)+P(0sZ 50-82) 
=P (0525245) + P(0<Zs 0-82) 
Example 8-46. Let Xi; X2,...,Xn Ve iid. standardised variates with 
E (Xj) < ». Find the limiting distribution of: 


Zn=Vn [Xi X2+X3Xa+ ..  +X2n21X2n | + [ Xi + X3 +. + X3] 
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Solution. Since X;s are iid. standardised variates we have: 


E (X) =0; Var (X;) =E (X?)=1, i=1,2,..,n sae(*) 
Let Y; = X2;-1Xz3; 1=1,2,..,7 
=> E (Yj) = E (X2-1) E (Xx) = 0 (-.-.X;’s are independent) 


Var (¥) = EYE = E (X:-1.X3)) = £ (%4:-1) E (X2) =1 
Hence Y;, i = 1, 2, ..., are also i.i.d. standardised variatés. Hence by CLT 


n 
for tid.r.v.’s, |Sa= 2 Y; |, we get 


ial 
yy _Sn—E (Sn) 2 X1 X27 +.X3X4+ ... + Xn- 1Xm L 2-N(0, 1) 
"War (S,) | vn 
as n~> oo 


Also E (X?) =1 (finite), i= 1,2,...,n 
Hence by Khinchine’s theorem, WLLN applies to the sequence 
| x? |, i=w1,2,...20; so that 


Xi+X3 +... 4+XBn 
Qn 


Hence by Cramer’s theorem 


lim Un ALBEE ELLER .+Xon—1X2n] L 4 Zyo,1) 


Va= LA E (X}) =1,asn— , 


n—2 Vn Xi+X3+.. +X, 
—, lim vn [Xi X2 + Xs.Xe -+Xmn-1Xm] L LZ No, 1/4) 
das XP +XB +... + XB, 2 
[-° Z~N(0,1) > CZ ~N(0,C’)] 
EXERCISE 8(g) 


a 

1. State and prove the central limit theorem for the sum ofn independently 
and identically distributed random variables with positive finite variance under 
conditions to be stated. 

Z. State Lindeberg’s sufficient-conditions for the central limit theorem to 
hold for a sequence (Xe of independent random variables. Show that every 
uniformly bounded sequence { Xe} of mutually independent random variables 
obeys the central limit theorem. 

Comment onthe case when the random variables do not possess expectations. 

3. A distribution with unknown mean p. has variance equal to 1-5. Use 
central limit theorem to find’ how large a sample should be taken from the 
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gistribution in order that the probability will be at least 0-95 that the sample mean 
will be within 0-5 of the population mean. f 


4. The life time of a certain brand of an electric bulb may be considered a 
random variable with mean 1,200 hours and standard deviation 250 hours. Find the 
probability, using central limit theorem, that the average life-time of 60 bulbs 
exceeds 1,400: hours. 


5. State the Liapounoff form of central limit theorem. 


Decide whether the central limit theorem holds for the sequence of inde- 
pendent random variables X, with distribution defined as follows: 


P(X, =1) =p, and P (X,=0) =1-p, 
‘6. Show that the central limit theorem applies if 
(i) P(X=2k)=1, (ii) POe=*V2k-1)=1, and 
(iii) P (Xe =-0) =1-k'-7, P Xen) = 17%, where a <} 


7. IfX1,X2,X3... is a sequence of independent random variables having 
the uniform densities 


ay. (142-17), 0 <x; <2-i7} 
fii) 0 elsewhere , 
show that the central limit theorem holds. 


8. Let X, be the sample mean of a random sample of size n from 
Rectangular distribution on [0, 1]. Let 


U, = vn (Xn —+). 
Show that F(u) = ue P(U;n <u) 


exists and determine it. ! 


v2 

9. Let Xi, X2,... be a sequence of independent, identically distributed 
non-negative random variables such that E (log Xi)" is finite. Z, = (1X2... 
DOs . Show_that the positive constant-c can be so chosen that the random 
variable (cZ,)"" has a non-degenerate limit distribution function F (.) and deter- 
mine F (.). 

Ans. c=e ", F(x) = © (log x/o), p= E (log X1) and o” = V (log X1). 

10. {Xn is a sequence of t.t.d. random variables. If z is a perfect square, 


y 2 
Ans. @®(vV12 u), where ® (u) = : f et dy 


then X, is a Cauchy variate with density - : »-O<X<O, 


pe a 

+x? 

Otherwise X, bas a distribution function F (x) with mean zero and finite 
Variance o” . Discuss the asymptotic distribution of (X; +.X2 + ... +Xn)/ vn. 


11. Let | Xi , k21 bea sequence of lid, variates with 
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f @) ase ll a <x<, 
Find the:-constants a, and b, such that 
[| Xi]+ [Xo] +... 4] Xe|—an\/bn S(O, 1) 


[Indian Civil Services, 1982) 
12. Using C.L.T, show that 


n k t -1 
lim | -7 > “| 1. lim f 2_. 
hol 2 ey | 2 es | fen 


(Indian Civil Services, 1984) 

13. Let{X,,n=1, 2, ... be a sequence of independent Bernoulli variates 
such that: P(X, = 1) = p,=1—-P(X,=0), n=1, 2, .., (Qan=1—pn). 

Show that if 2pnzagqn=%, (n=1,2,...,0), then the CLT holds for the 
sequence {X,}. What happens if 2 pngn<®. (Indian Civil Services, 1988) 

14. Let Xi, X2, ..., X, be independent and identically distributed r.v.’s with 
E(X) =p; Var (X;)) = 07; (0 <0*<®);i= 1, 2, ., nand E (X;—-p)* = 04 + 1. 

(a) State weak law of large numbers. 


(b) url (Xie X44... 4X2) -e]—0 as n—> ©, find c. 


Hint. By Khinchines theoremc = EX} = 0” + (finite). 
(c) State the Lnidberg—Levy Central Limit theorem. 
-.q fim 2 1 QMi-p)?+...+M%r-wy? 2 1 
(d) Find oP re) te : <0 +o 
[Delhi Univ. B.A. (Stat. Hons.), Spl. Course 1986] 
L | EQi-wy 


Hint. pom P| spe = SAW ot 


=P[-Vn sE(Xi-ny-n 0" svn] 


-P| 12 5 [ Xi-w)’-o° | / vn s 
iol 


=P[-1sSi/vn s 1] (4) 
where Sr= = [ (Xi-nyY-0° | = x U; 
iol i=l 
where U; =(Xj- p)? —o’, i=1,2,..,73 are td. r.v.’s. 
=> E (Uj) =E (X-p)’--0° = 0-0 =0 


Var U; = Var (X;- py | = Var (X;—-p)" 


‘ 
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Be (X;-—)* = o* + 1 (Given) | 
E(Sn)= 2 E(Ui)=0; ma 2 = Var (Uj) =n 
i=} 


je 


iol 
(-.- Uj?s are tid.) 
Sn-E(Sn) Sn L 


Hen ty CLL ey va —-N(0,1)asn—7>o At) 


From (*) and (**), we-get 
MM re reg [Ps S/n x1] = P(-1sZ-si), where Z~N (6,1) 
=2xP(0sZs1)=2x0-3413 = 0-6826 


15. Let.Xi, X2, ...,X_, be iid. N.(0, 1) variates. Show that the limiting 
distribution of 


Vn (X1 +X2 +... +Xa)/(X1 + X35 +... +X2) -N (0, 1) asin > &. 
Hint. Use Cramer’s Theorem. 
16. Let.X1, X32, ..., Xx be i.i.d.N (0, 1) variates. Find the limiting distribu- 
tion of Z, = Un/Vn where 


X  X3 X2n-.! _y2, y2 ip 
Unter ont X, > Vn a XT 4X4 ...¢ Xn. 


Hint. X; are iid. N(0,1); ¢=1,2,..,2n;E (X?) = Var X; = 1 
=> (X2;~\/X2) are tid. standard Cauchy variates; i = 1, 2, .., 


— lim On = Standard Cauchy Variate = C (0, 1), 


nwo n 
(Being the mean of itd, standard Cauchy variates) 
XT +X3 Xi 
lim Vn _ ALtA2t «1 tn EEX? =1, 
n—-o pn n 


(By Khinchine’s WLLN) 
Un (Un Va) L 
(2) (a) Sew 
(By Cramer's Theorem) 
17. Let | Xn be a sequence of i.i.d° r.v.’s with mean a and variance o” 


and let { Yn| be another sequence of i.i.d. r.v.’s with mean B (= 0) and variance 
of. Find the limiting distribution of : 


one (x, = a)/Yn where. X,, a . Xi, Yn “1; = Y; 
iel 
: —E Xn —E (Xn) L . 
Hint. U,, = of ae Z ~N (0, 1) ia CLT) 
; V, = y, 2 —> E (Yn) = B (By WLLN) 
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By Cramer’s Theorem: 


lim Un _ Vn Xn-a) L a3. where Z~N (0, 1) 


n~> © Vie oY, 
lim Va Xn-@) Lo o 
= n-—-@ Y,, ear eal n {0.55 B? 


18. Letn numbers Xi, X2, ...,.X, in decimal form, be each approximated 
by the closest integer. If.X; is the ith true number and Y; is the nearest integer, then 
Uj=X;-Y;, is the error made by the rounding process. Suppose that 
Ui, Ua, ... , U, are independent and each is uniform on (—0°5, 0-5) . 

(i) What is the probability that the true sum is within ‘a’ “units of the 
approximated sum? 


(a) Ifn=300 terms are added, find ‘a’ so that we are 95% sure that the 
approximation is within ‘a’ units of the true sum. 


HInt. Reqd. Prob. ‘p’ = P x (xX:-Y) se)-P[-a2 3 2 vise] 
ie 


Now use Lindeberg Levy C.L.T. for 1.i.d. r.v.’s 

U; ~U [ -0°5, 0-5 ] with E (U;) =0, Var (Uj) = 1/12 

Ans. (i) p=2 ®(a@ V12/n ) -1; 

(ii) p=095 => Pa 12/300 Ne =0:975 = 57196 => a= 98. 


8-11. Compound Distributions. Consider a random variable XY whose 
distribution depends ona single parameter ® which instead of being regarded as a 
fixed constant, is also a random variable following a particular distribution. In this 
case, we Say that the random variable X has.a compound or composed distribution. 

8-11-1. Compound Binomial Distribution. Let us suppose that X1, X2, 
X3, ... are identically and independently distributed Bernoulli variates with 

P(X;=1) = p and P(X;=0)=q=1-p 
For a fixéd n, the random variable X = X; + X2 + ... +X, iS a Binomial variate with 
parameters n and p and probability function: 


P(X=r)= 9 ll r=0,1,2,...,7 
which gives the probability of r successes inn independent trials with constant 


probability ‘p’ of success foreach tnal. 


Now syppose that n, instead of being regarded as a fixed constant, is.also a 
random variable following Poisson law with parameterA. Then 


P(n=k)=—; k =0, 1,2,.. 


: Insucha case X is said to have compound binomial distribution. Thé joint 
probability function of X and n_ is given by 


[|  PWarQn=he=P(n=k)P(X=r|n=k) 


. . a 
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since P(X =r|n=k) is the. probability of r Successes in k tials. Obviously, 
rsk => ker. 
The marginal distribution a X is given by 


P(X=r)= 3 Pure rOn=k) 


ker 
toh "ea Ap)’ & (Ag) 
: vere 3 (Mi Ee 


GET OM Gok-ry 


oApy ag Pp 
os rt “ ri: 
which is the probability function of a P_ isson variate with parameter A p. 
Hence E(X)=Ap and Var(X) = 
We give below some of the practical situations Where we would come across 
compound Binomial distribution,:, 


1. Stppose that ithe probability of an insect laying n eggs is given by the 
Poisson distribution e~* X"7n! and the probability of an egg developing is p. 
Assuming natural independence of eggs, the probability of a total of k survivors 
is given by the Poisson distribution with parameter'A p . 


2. The probability that a radioactive substance. gives off n Beta particles 
in a unit of time is P (A),.(n = 0, 1, 2, ...). The probability that a given particle will 
strike a counter and be registered is p. Then the probability of registering‘ Beta 
particles,in a unit of time,isalsoP (Ap)... oy 

3. Ifthe probability of number of hits by lightning during any time interval 
tis P(X?) and if the probability of its hitting and damaging an individual is p © 
then (assuming stochastic independence) .the, total damage during time ‘?’ is 
P(Atp). 

8: 11-2. Compound Poisson Distribution. LetX, be a P (A) So that 

~-Aoar 
PP (X= r) a= 


;r=0,1,2,. 4 


‘ 
where A. itself is a continuous ere vaiable with’ genénlised gamma density 
er a” i* . 


eos v-1, 
g (x)= Tow" AA? 0,a>0,v>0 


0,A<s0 : 


Let us consider the two dimensional random yector (X, A) in which one, 
Variable.is discrete-and the other is continuous. Fora constant >0 and Ay > 0, 
the joint density of X and is given by 
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P(XerNdisaskith)=P(Ai srAsAthPX=rlAisrsai +h) 
Dividing both sides by fh and proceeding to the limits ash > 0, we get 
lim PW=rNAisaAsAi+h) 


os [iM P&=n[Msrsath) ; 
. lim PO sA si +h) 
h-0 h 
But 
Ru Pu = xs Ai+ h) om ae G (A1 + ~ G (A1) =G’' (A1).= g (A1) 
where G(.) is the distnbution function and g(.) is the p.d.f. of A. 
lim P&=rNMsdsmith) en a a” aye ads 
“hoo” ho rt! “Ti Pe 


Integrating w.r.to A, Over 0 to’ and sing gamma integral, the marginal 
probability function of X is given by 


P(X=r)= 


a” 
Ir (v)r! 
a a” Trev) 
POT! (ea) 
(a5) v(v + 1)(¥ #2). (vt+r—-') 


l+a (1+a)"F! 


7 fe 
7 pet CY -, dior 
a 7} (-4) ;r=0,1, 2, a0 ' 
where p=a/(1+4),g=1-—p=1/(1-+.a) . 
Thus the marginal distribution of X is 4 negative binoruial with parameters 
(v, p). 


f eo tai yrev-l gy 
Is | 


r 


& l 


7 EXERCISE 8(h) 


. I. (a) What do you mean by a compourid distribution? Obtain the prob- 
ability function of compound Poisson distribution and identify it. 


(b) What is the Compound Binomial distribution? Obtain its probability 
function and identify the distribution. 


2. IfX isarandom ake with p.d.f, 
erXy > 
f(Qx)= Tar nah? x", x20 


wlicre 7 is a positive integer, and the discrete random variable Y has a Poisson 
distribution with parameter'A, show that P (X2 A) =P (YSn). 


= 
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rx 


Hint. P(X'2A)=1- J CPE dx 


r + T (n+1): 
Integrating by parts successively, we a the result. 
3. ILX has a Poisson distribution: 


e* - 


P(X=r)= ;r=0,1,2,. 
where the parameter A ‘is a random vaiiable of the continuous type with the density 


function: f)= eo 7"). 50 a>0,v> 0, 


r ro 
derive the distribution of X. 
Show that the characteristic function of.X is given by 
by(t) = E ("= q’ (1 - pe”, where p = 1/(1 + a),q=1-p 
be [South Gujarat Univ. M.Sc. 1991} 


4. The conditional distribution of:a continuous random variable-X for a 
discrete random variable Y, assuming'a value 7 is 


dF =n (i- x)- ' dx, Osxs1. 
The distribution of Y is: P(Y =n) = =(5 Y :n=1,2,3,... 
Find the marginal distribution of X . 


a - 
5. Given (x |y)=£—> and hy) =e”, where X is discrete, l.e., x = 0, 1, 


2,.. and Y is apna ie yz: show that the marginal distribution of X 1s 
geometric, ie., 2 (x) = (1 yet, 


6. The conditional probability that the random vaniable X should lie within 
the range dx fora ves o is given by 
2 ye 
a exp ie 7 @-W) /O }4 —O<x¥<0@ 
while the probability of o itself lying within the range do is 


+ exp |-30° /05| 0d0,0<0< 
Oo" os ad ; 

where Op is a constant. Sbow that the unconditional (i.e., marginal) distribution 
of.X ‘has the following 'probability function : 


Tor exp {- Plata) a p| |,-2<x<0 | 
7, LetX~U[0, advices ~ B (n, x) ‘ix., 


P(¥=y|X=%) = ; ) (12x), y=0; i 2,..,7 


Find the distribution of Y:. Also'find E (Y) . 
Ans.’ P(Y'=ys<'1/(n+'1), y=0,1,...2 = Y~U[0/i,2, ..., a]; E Y)=n/2 
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8:12. Pearson’s Distributions. Givena set of observations from a popula. 
tion, the first question that arises’ in our mind is about the nature of the parent 
population. A vague idea is provided by the frequency polygon (or frequency 
curve) but the information is totally. inadequate and. unreliable, .because the sample 
observations may not cover the entire range of the Parent distribution: Moreover, 
an unusually high frequency in cne class, arising out of sheer chance, May 
completely distort the shape of the pROAUECY curve. 


Consequently, to determine the frequency curve, we resort to the technique 
of curve-fitting to the givenYata. Fhe failure of the normal distribution.to fit mariy 
distributions which are observed in practice for continuous variables necessitated 
the development of generalised system of frequency curves. Since a trial and error 
approach is clearly undesirable, an elastic system of frequency, curves must be 
evolved, which should incorporate, if not all, at least the most common of the 
distributions. Pearsonian system of frequency curves is one'of the most important 
approaches inthis direction, in which we decide-about the shape of'the curve on 
‘the basis of a ‘criterion x’ calculated from the sample observations. 

Karl Pearson’s first memoir dealing. with’ generalised ‘frequency: curves 
appeared in 1895. In this paper and the subsequent two papers published in 1908 
and 1916, Karl Pearson developed a set of frequency curves which could be 


obtained by assigning values to the parameters in a certain first order differential 
equation. 


Genesis of Pearson’s Frequency Curyes. Experience tells us that most of 


the frequency distributions possess the following obvious and common charac- 
teristic: ae oe .! 


‘*They rise from.a jou frequency to a maximum. frequency and then again 
fall to the Jow frequency as the variable Y increases. This suggests, a unimodal 
ae curve y=f(x) with high contact at the extremities of the range, i.e., 


2 =0 when y=0. Accordingly, Karl Pearson proposed the following-differen- 


rf equation for the frequency curve y = f (x), 


dy _ y(x-a) a | 
“de F(x) ~ «..(8°32) 


where F (x) is an arbitrary function of x not vanishing atx =a, the mode of the 
distribution. Expanding F (x) by Maclaurin’s, theorem, we get F (x).= bo-+ bi x 


+ box’ +... and retaining only the first three-terms we'get the differential equation 
of the Pearsonian system cf frequency curves as 


dy | ° y(x-a) = df (x) =f! (x)= &- AF) 
dx bo + b, x +.b,x dx po as bo'+'b;, x.+ b, x (8°32 a) 
where a, bo, b; and b2 are.the constants.to be calculated from the given data. 


Remark. Equation (8:32 a) can also be' obtained as a limiting case of 
Hyper-geomejric distribution (c,f. Advanced statistics Vol. Tl by Kendall). - 
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$-:12-1. Determination of the Constants of the Equation in Terms of 


Moments. Multiplying both sides of (8-32 a) by x" for integral n > 0 and integrat- 
ing over the entire range of the variable X say (a, B), we get 


B B 
f 2 Bot bixt box) f' (x) de = fx" (xa) fx) de 
a : a . 
=> |=" (bo + byx + by x’) f (x) | -f [ mbox" ' + (n+ 1) bi x" + (n+ 2):box"*' |f@ Ae 


6 6 
=f x"*' f(x) de—af x" f(x) dx 
Qa a 
Assuming high order contact at the extremities so that 


B 
| xF(x) | = Oie., x" f(x) > 0 asx — a or B, we get (*) 

a eee 

os Me-1 (1 #1) by Wn + (n + 2) b2 Hin +1 | = Has Wn 
(assuming that_X is measured from mean and this we can do without any loss of 
generality). Thus the recurrence relation betweén the moments becomes 


nbo Un-1[ (nti) bi-a | unt [ (4+ 2) b241) Mnvi=0, — W.(**) 


n= 1,2,3,... 
Integrating (8-32 a) w:r.tox within the limits (a, B) and using (*), we get 
(b; — a) + (2b2 + 1) = 0 et *) 


Putting m = 1,2, and 3 in (**) and solving these equations and (***) with 


the help of determinants and using wo = 1, p41 = 0, we get ; 


bow bz 4vozns—3u8) ot 4 B2—3-B1) 
2(5 woys—-9nd-6 3) 2(5 82-6 81-9) 
pe bye bet 3) _ VB (B2 +3) 
2(5 p2Hs—9 3-63) 26 82-6 Bi-9) (8°33) 


a (2-2 ps ~3. 03-6 13) = _(2B2-3 Bi-6) 
2 (5 wo ws —9 3 —6 p3) ~ 2(5 82-681 -9) 
where 2 =0", Bi = 13 /p3 and B2 = 4/3 i 


Thus the Pearsons’s system (8-32 a) is completely specified by the first four 
moments. 


8:12:2._ Pearson Measure of Skewness. 
_Mean — Mode 7 0-a 
Standard Deviation Vy2 
7 VBi (B2 + 3) 
2 (5 B2-6 Bi -9) 


Skewness = 


(8°34) 
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8-12:3. ‘‘Criterion x’’. Equation (8-32 a) can be re-written as: 


df__(e-a)de 
f bot bixtbox’ Lee) 
Integrating we get 
log (f/c) = —“V dx = | (say) 
bo + bi x+ bo x’ 


where c is the constant of integration. 
f(x) =cexp [J] 
Thus f depends on /, which further depends on the roots of the equation 
bo + bix+bx°=0 -.(**) 


Now. 


borbi xe bar aba] Pap re | 
—b, + Vb? -4 bo b> I =a “VO 4 bob aa 


nba] x- 2 b2 


_ poly OL Vbi=4 bob aay Vj 4 bo be 
71" 2 be aa V4 
ii Ze 
ms b*-d | 2b * (x D 


where K = bi/(4 bo by), (8°35) 
determines the critenon for obtaining the form of the frequency curve. 
A brief description of various Pearsonian curves for different values of x is 
given below: 
K=-@ K=0 K =1 K=a 
TYPE IV 


TYPE ILI TYPE tl 


YPE il ; \ 
oe: NORMAL TYPE VII 


CURVE 
$-12-4, Pearson’s Main Type 1. This curve is obtained when the roots of 
the quadratic equation are real and of opposite sign, Le., when <0. 
Shifting the origin to the mode x = a, the equation becomes 
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di. fF 
dx Bo +Byx + Box B2 (x + 3) (x - B) 
where Bo = + B, =b; and Bo=b). 
a 
e zz (oe) B2 a ech | 
hs both Sides w.r.to.x, we get 
log f = log (x + a) B2(a+ B) log (x ~ p) PB: Sond ee log C 
f= Cc (x + or) Bz (a+ B) (x az p)P/B2 (a +B) 


& a/Bz (a + B) . /B: (a + B) 
=> faye) (1-§] ,-asxsB 
‘ sgt") 
a ee See 
Let Q@ = a), B = a2, m, = ———— _and m= so that 
Bea, m =F aap)  ™ BR asp) 
my, m2 1 : 
ae BG) then (*) may be written as 
x\" x\" 
= 1+— 1-— > sx 
Fx)= yo) 1+ 7 4 ila ...(8-36) 


which is a standard form of Type 1. 
Determination of yo: 


Put x =(a1+4a2)Z—a, sothat dv =(a,+a2)a 
1 Amy, 2 

= Lay f EAD? mm tay” 2 (a, + a2) dz 
A av as. 


mM, + fa, + 1 


a hess -— 
at ay’ 


dz a3 
“a + anytime 1B(m,+ 1, m2 + 1) 
Remark. [t may be noted that Beta distribution is a particular case of Type 
[ distribution. 
Determination of moments : 


mene : 


a2 my m2 
, re x “ ee 
Un me [+3 } (1 = ad, 


ieit a)™ +m2z,+n+1 


at" az’ 


B(n+m,+1,m2 + 1), 


where x = (a; + a2) Z — a1 
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(a; + a2)" 


—_— 1 
a TRS p m+ 1) >B(n +m, +1, m2 +1) 


{On simplification] 
$-12°5. Pearson’s Type IV. This curve is obtained when the roots are 
imaginary or when 
Bie 4 Bo B2, te, <x <1 
1 df | x 
fae Bo+ Bix Boxe [Origin at mode] 


x 
=> — (I a ees Ee oe 
£ (log) . , BL Bo Bi 
21\** oR | *| Bo 4B 


ayy : 2 (x+y). 2¥ 
” Bo { (x49) +8° J 2 Bo[ (x+y) +8" ]  2Ba[ (x+y) +8° ] 


1 -1X+ty 


log f= sp lel ty +e LS tan 5 —— +logC 
1 
i C[ (x+y) +8" oP | B65 an 5 
can a Saeed a eres 
Put 5 ; and BS v 


2 - 
Hence f(x) = yo [1 5) e>™ (/a) . _ o <x <0, (m,v)>0 
a 
..(8-37) 
which is a standard form of Type IV with origin at th ; o| . The curve is 
2 


skew and has unltimited range in both the directions. 
Determination of yo: 


x 
toy f (165) eV 8) dy [ Putx=atan0] 
n/2 
= ayo { cos” *9e"8do= = ayo F (2m — 2, v) 
~x/2 
ee eee 
a a F(2m —72, v) 


1 x? tanec! 
Hence f(x) = aFORTTT| 1 +5 | en Vian (x/a) 
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8-12°6. Pearson’s Type VI. The curve is obtained when the roots are 
real and are of the same sign. This is obtained when Bo, Bz aré of the same sign 
or, in other words, when x > 1. 


Let —a and — «az be the roots of the quadratic equation. Then 
Cn 
dx Bo+B,x+Box? Bo (x +01) (x + a2) 

C1, 1 a2 . | 


bd ae a 


” BS (01-02) (x+01) B2(a1—a2)* (x + a2) 


log f= =———*— log (x +0 —__2_dog (x + 02) + log C 

85" Br (ai—ay 8% *™ ~ Bley —an) 8% +92) * 108 
=> fC (x + cay) 2 O22) | ip + py) 9/82 Cr w) 
Hence the probability density function is: 

my > M2 ‘ 
x x 
X)2 1+—— 1+— Z 
Emre ( | | _ 4..(8°38) 


2 my, Jn2 
where 01 = 41, &2.-= a2 and Peon *a1,a2>0 
1 2 


This equation can also be written (on shifting the origin to — a2 or — @2) as 


f(x) =yo(x-a)®x°%, asx<o ...(8-38 a) 
where f+, @e es ee 
B2 (a1 — a2) Bp (a1-— a2) 


Remark. The curve is bell shaped if g2>0 and J-shaped if q.<0. 
Determination of yo: 


= -_— q | | = 4 
1 rm fe a)” x er Puts ed 
1 a — a 92 a 
—_— —_- d 
vf Fe (7-7 "| Ua 
xy ge? at! ees ee 
yoa se (1 —z)e-ur? 
r; se l “ 
B(@2 +1, q1-Gq2. ~1) 
giao} 
Hence f (x) = x M(x-a)®,asx<o 


v B(gatl, qu-q- 1) * 

The above discussion covers ‘almost the whole range of « but in limiting 

cases We gét Simple cases. The following are:more important ofthe transition curves 
when one of the main type-changes into another. 


$-12:7. Type II. This is a transition type curve and ts obtained when 
Bo=0,B,«00rx > 20, 
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+ (og f= 


BorBa 77 me , (origin is at mode) . 


_Bix+Bo-Bo_ 1 Bo 
B,(Bo+Bix) By, By, (Bo+ Bix) 


losh= 5 5 log (Bo + Bi x) + const. 


‘= const. e™ 7 (Bo + Bix) ~By/Bi 


...(8-39) 
where — =a and —+=p 


This gives the Type III curve with origin at mode. The curve is usually bell 
shaped but becomes J-shaped when fi: > 4. 


Remark. The distribution can be transformed into the gamma form by using 


the transformation y= PB (x +a), when the curve reduces to 
a 


fO-Fo ape’ MOsy<% 


8-12-8. Type V. This transition type is obtained when the roots are 
equal, 1.e., when Bt =4BoB2 or x=1. 


7 , Br Bt 
d x +3 Bs ~ By 
— (log f) s —————__ * 
B, B, ° 
B [++35,| | 2 B2 rm 
| 1) BT 
l= 75, 8 « ms oR ws + const 
2 B2 
1 
= f = const pe i‘ e a8 eae - 
2 B2 oF 2B3( = =—2 Ba 
f(x) =yoX Po ™%, 05X<m (8°40) 
B, By 1 v 
where X [++ 35: > Bh ~q and By ae 


8-12-9. Type Il. This. curve is obtained when B, =0 and Bo, B2 are of 
opposite sign, Le, «x =0. The equation to the curves 


21" ‘ | 
feo=o 5 ,-asxsa; 1 ..(8-41) 
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1 2. Bo. 
where m 2B >0,a Bp 

with origin at mean (mode). 
8:12:10. Type VII. This curve is obtained when B,=0 and Bo, B2 aré 


of the same sign, t.e., K = 0 and Bo B2 > 0. The equation to the curve is 


27" 
je =yo| 1435 << 0 
a 


2_ Bo 1 
where a’ = B and m= — IB 
with origin being at the mean (mode). This curve is usually bell shaped, symmetri- 
cal and of unlimited range in both the directions. 
8-12-11. Zero Type (Normal'curye). WhenB, = B2 = 0, (18-33):implies 
that B} = 0, and B2=3 and we have 


(8°42) 


2 
< (log:f) “> => log f= + log C, 
where C is the constant of integration. 
f=C exp (x7/2 Bo) = C exp (-x*/2 0’), -w<x<@ (8-43) 
where Bo = — 07 and the Origin is at mean. This is the normal distribution with 


mean zero and variance 3°. 
8-12-12. Type VIII. When Bo =0, B; > 0, 


f= 1+7 | -@sx80 (8-44) 
Type IX. When Bo =0, B; <O and k <0 
l+m(. x\" 
= ={- 's0 
INES B= | eG lent ars (8°45) 
Type X. When Bo = 0 and B2 = 0, 
fayet eo" 0sx<0,0>0 (8-46) 


This is the p.d.f. of simple exponential distribution with parameter o > 0. 
Type XI. When Bo.=,.B,; =0, and k>1 

f@= pee! (m - 1)x"~" bsx<o 
Type XII. When 5 B2-6 81-9 =0,« <0 


(1+ 
m ay 


a 1 
re() er" § ae 
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Example 8-47. Show that for a Pearson distribution : 
df __(a+x)d 
f bot bi xt+byx?’ 
the saan tg function 9 wie the.relation : “ 
hore SF + (1 +2b2 +b 0) 58 @ + (a + bi + bo 8) p= 0, where 6 = 


POF sn the recurrence oa for moments. 
Show also that the cumulant generating function obeys the relation : 


2 
b 0 sa*(Se| |+(0 + 2ba+ bi GE-+ abi +b00)=0, 


Hence show that the cumulants obey. the recurrence relation : 


1+ (r+ 2) bz] rai + rby cr rb4| te eak14("31 | Kr—2, 


r-1 r—-1 
Pces + j Jie rast w+ (722 } mime} x0 


Solution. (bo + b1 x + b2 cae =(a+x)f 


7 


2 bot bixt ox) Lad (a+xpf _ 


Integrating w.r.to. x, for the total range of x, assurhing, a integrals vanish 
at oe limit, we get 


f e°* (bo +b x + bax) dh des -f & asnfde 


=> [é “bot bixthx)/] f [8et* do vueebax 


os (b; + 2 bz x) fan f e°* (a+ x)f de 


=> 0-0| meri gg thoes | bu + 260 75 |=(a0+ F6 | 


‘QO ~E(e" )= “fe mpdenf fx, 

-© [it~ 6] 
ag Ox cx 1208 : 
i fx fdx and Ta e § fdx _ 


assuming differentiation is valid under integral sign. 
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2 
= bor tel + Qby + by 0) 52 + (a+ bi + bo 0) @ = 0 (On simplification) 
/ 


Differentiating n times w.r-t.6, using Leibnitz Theorem, we get 


n+2 n+1 n 
n| oat rst |; | grr (eb OE. iti 


d’! 
+ | saret hss 0+ | TP ty || =o ...(2) 


mY 
Putting 8 = 0 and using the relation | 2 | = Un, We get 


nb2 W'n+1+ (2b2 + 1) Wao 1 +.201 Un’ + (01 +a) Un’ + nbo Wn -1 = 0 
Shifting the origin to the mean, we get 
[ (a +2) bo +1) pnsit[ (ntl) dita) pntn be pn-1=0 --(3) 
2 

= =e “ Ce) =ov ay. ay 

Now ev, ae and pre e 76 (SS 
"(py = log g) 

Substituting these values in (1) and on simplification, we get 


2 
pia, (2). dy 0) = 
b2 0 aor d0 +(1+2b2+ bi @)—7 + (a + bi + bo 8) 0 


Differentiating (4) r times w.r.to. 8 using Leibnitz Theorem, we get 


2 
a’**y, op | 48. d’7) ay 
de*2 ral Zs | dv*! ag! 


ome be m0 


1 
d’**y 
= 7 de? 13 Q’- =| 298 d9- ry A 


; d’*h. d’-? dy dy 
vita Saat Ger {248-6 | | 


+1 r 
+ (1+ 2bn+b1 0) SF 411 0 


...(4) 


Putting 9 = 0 and using the relation 
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ae. =k,, We get 


[++ 2)ba}mrore() bre erba| (7G! Rn 


[731 Jost (02 3} a%i}=0 (On simplification) 


EXERCISE 8(i) 


2. Derive the differential equation 


1 d_ x+a 
y k bo +b, X +O, 


as the limiting form of the hypergeometric distribution. 
Show that, for the Pearsonian family of distributions : 


Mean-Mode  -VBi. Bi (B2 + 3) 
SD. (582-6 Bi -9) 


2. (a) State. the.differential equation ’for the Pearsonian system of curves 
and obtain the expressions for the constants in terms of moments. Obtain Type 1 
distribution as a particular case of Pearsons’s sytem of frequency curves and 
describe method of fitting it by moments. 


(b) Describe the.procedure for classifying the Pearson family of distribu- 
tions into various types. Show that all Pearsonian distributions are determined by 
the first four moments. 


Show that ‘Normal’, ‘Beta’ and ‘Gamma’ distributions belong to the Pearson 
family. 


(c) Assign the following distribution to one of the Pearson’s types. Give 
the reasons for your answers 


2 ' 
(i) dF=Ke*? (2) -1 a2 0<x7 <0 


dt, -© <i<o, 


2 
| 


(it) ar ok (145 


3. What are the reasons for the adoption of the following general form to 
describe the Pearsonian system of frequency curves 


£f@) = (x -a){ (x) 9 


bo + b,x + box" 
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Show that the Pearsonian curves can be characterised by a single criterion 
K . Outline the various types of curves for different values of K . 

4. Obtain Pearson Type III curve in its ‘usual form‘with niode ds origin, 
from the basic differential equation of the Pearsonian system of curves and establish 
a method 6f fitting this.curve to the given-data by the method of monients. 

Hence of otherwise, show that for this-distnbution 2 2 = 3 (fi1 £ 2)’. 

5. Derive the Beta distribution as a special case of the Pearsonian system 
of frequency functions expressed by 

d (log f) _ x+a 
dx bo + bi x + bx 

6. (a) Derive Type 1 Pearsonian fréquency curve and exaniine if the 

distribution given by 


dP = yo (1 ~ yyins 9? gy, —1 <ys 1 
— 
reduces to that distribution. 


(b) xpress the constants yo, @ and m of the distribution ! ; 


2 
feopeye| 15 yr ASX <a 


in terms of its p2 and B2. J 


(c) Show that normal, gamima and beta distributions belong to the Pear- 
sonian system. 
7. Show thatthe following are. memebers of the Pearson’s sytem, of curves 
and sketch them for some typical values of the constants. 
2 
Xx 


(i) (0) * 55 ex {- >: Sa fevmexse Sen 


m 


2 
(tit) £@) =— 1 — [1-5 ,-asxsa 


a B(:,m+1) a 


i 


m 


8 2 
(iv) $() = WE me ait 55] -esxsa 


8. Show that the Pearsonian Type VI curve may be written : 
~~ , ; 


12, 
= coal =1 2 
y= Yo ted exp { --v tanh i | 
and discuss its relationship with Type IV curve. 


9. Show that for a distribution : 


VV 
lo -~-—__—___—_+ 
5 | B)= -Bo + Byx'+ Boi : 
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~ 


f. 
3t lis 


the range is unlimited in both the directions if Bo + Bi x +.B2x° has no real roots, 
limited in one direction-if roots are real and ofthe same sign, and limited in both 
directions:if the roots are real and of opposite sign. 


10. Investigate the properties'and shapes which may be. assumed: by the 
frequency curve y = f(x) which has the differential equation 


“*- : : 1 dy. 2 mx 
y dk a — x? ros 


and obtain the probability integral. 


Hint. @ (log y) ep => logy =m log (a’ —x’)+ log C 
ne de ex | 
ie : ae : 
yak 1-5] ;-asxsa 
a 
which is type I distribution. me ya 
‘11. A erenuly of. distributions is defined by 
1 df_ x 


fds. bot by tax 


and the frequency function f = f (x) vanishes at the terminals of its range. Show that 
the moments about the mean are given by 


bo (2s + 1) pas bz (25 + 3) Mase2 + Da (25 + 5) Masa 4 = — Maes? 


. 8-13. Variate Transformations. Let T by any statistic which is ,asymp- 
totically normally distributed with mean @ dnd vaiiaiice wp (8), where w (8) is 
some function of the pardmeter 8 i.e., T~N (0, w (6)). Let us ‘transform T by a 
function g. as g (7). where g isa function which possesses first order derivative 
which is continuous and g' (6) «0, where (’) denotes differentiation w.r.to. the 
parameter 8. Then g(T7) is normally distributed about mean g (6) and variance 


[e’ (0) }. y (0), ie,, ; 
g(T)~N[ 2g (8),{ 8’ (8)} y)], ‘| 8-47 
asymptotically, provided g’ (6) «0 is continuous in the neighbourhood of 9. 


In. general Var (T) = wp (6) will be dependent on the parameter 6. We are 
interested in obtaining a function g such that the asymptotic variance of the 
transformed statistic g (7) is independent of 9, i.e., it should be constant. In other 
words, we want g such that 


Var[g(D ]=[¢' (©) Fw @)= constant = <2 (say) 


[ 8’ ) I= 6y 


Integrating both sides w.r.to.8, we get "| 


L.e., ‘ss 


Cc L.: a | 
0) = | —=~ do 
2O-\Te b (8°48) 
$-13-1. Uses of Variate Transformations. As discussed above, when we 
transform a statistic T toa function g (7), the distribution of g (T) is approximately 
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normal and its asymptotic variance is independent of the population parameter 8. 
Hence the use of statistic g (JT) gives better results and confidence intervals than 
the original statistic 7. The commonly used transformations are : 

(1) Square Root Transformation. 

(2) Sine Inverse or sin’ * Transformation. 

(3) Logarithmic Transformation. 

(4) Fisher’s Z-Transformation, 

In thé following sections we shall discuss these transformations briefly. 

813-2. Square Root Transformation. Square root transformation is a 
transformation for the Poisson variate. If a variable X follows Poisson distribution 
with parameter dX. (assumed to be large), then we know that asymptotic distribution: 
of X is normal as (A 00) with E (X) =) and Var (X) = A=w (A), in the above 
notations. Then (8°48) gives the acon 


8 )=f Fe =dh=2c Vi. 


Now we select c in such a way that 2c = 1 
ie., c=1/2 so that g(A)=vA. 


Hence the transformed variable is g (X) = VX . Using (8-47), the transformed 
variable VY has the mean 


g (x)= VR 
and Var (VX) =[9g’ (A) -y (A) 
es 
se) 
«1/4, 
or alternatively 
Var (VX) = constant = c? = (1/2)? = 1/4. 
Hence VX ~N (Vi, 1/4), asyniptotically. | (8°49) 
Ancombi has.suggested the transformation VX + 5 , where b is a constant 
Suitably chosen. 


$-13-3. Sine Inverse or sin! Transformation. Sine-inverse is the trans- 
formation for stablizing : the variance of a binomial variate. If‘p is the observed 
proportion of successes ina series of n independent trials with constant. probability : 
P of success for each trial then we know that the asymptotic distribution of p 1s 
asymptotically normal (as i> ©) With E (p) =P and 


Var(p)=PQ/n, Q=1-P 
Le., p~Nn tars }as n—> ©, 


In the usual notations 


y(P)=22 2 E-*) a | 


Using (8°48), the transforming mine 
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OS eRe cvn f = 
= 2c Vn sin’! (YP) 


east 
’ apsin (VP) 


we P) 


1 
2vP.v1-P 
Choosing the constant c so that 


1 ce? 
2cVn=1 => C= ae: 


we get 
g (P)= sin” ' (VP) 


Hence the transformed statistic is g (p) = sin”! vp. Using (8-47), g (p) has 
mean sin | VP and 


Var (sin! vp) =[ 8’ " ae 


oe P) 
WEA —-P 
: ag 
“an 


or Var (sin! Vp) =c* = sz\- = — = constant. 


Hence sin’ ‘Vp ~N oP), asymptotiically. (8-50) 
/ 
Ifr is the observed number of pane inn trials so that p = r/n, then 


Ancombi has suggestéd that instead of sin” ‘Vp. ='sin"' Vr/n , the transformation 


should be sin7! ¥V a : 


+ 3/8 
8-13-4. Logarithmic Transformation. Log transformation is the trans- 
‘formation for stabilizing the variance of the distribution of sample veriance. If s 
is ore sample variance in a simple of size n from normal population with variance 
, then the sampling distribution of s is asymptotically normal (as 2 —> %) with 


, 2 20° . 
E(s‘)=o and var(s’)= : (for large 7), 


(c.f. Remark to Theorem 13-5], In the usual notations we have. 
w (0”) =20°'/n, Using (8-48) , the transforming function is 
S29 Cc vn 2 +¢ vn a 
O°) = do‘ = log o* . 
We select c in such a way that ; 


cvn 


sothat g(o’)=logeo”. 
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Hence the transformation for the statistic s’ is g (s”) = log s’ and using 
(8-47) the transformed statistic is normally distributed with mean 


g (0”) = log o” and 5 
| | 2 
Var [ g (s*) ]={ 8" (0°) | -p (0°) 
1) 20% 
Qe 
2 
“hn 
2 
or Var [ g(s*) ] = Cc (Te ) i 
Hence loge s’ ~N| loge a", 4h a ae ay 


$-13-5. Fisher’s z-Transformation. This transformation is suggested for 
stablizing the varaince of sampling distribution. of correlation coefficient (cf. 
chapter 11). Ifr is the sample correlation coefficient in sampling from a correlated 
bivariate normal population with correlation coefficient p then the asymptotic 
distribution of r, as n —» © is.normal with E (r) = p- and 


* 242 
Var (r) = (apy = w (p) for large n. Using (8°47), we get 


2 
We select c in such a way that 


vnce=1 => c=1/Vn, 
so that g (p) = 5 loge a ; 


g(o)=f edo =F tose 728 | 


‘a 
Hence using:(8-47) , the transformed statistic 


g(r)= ; log. ber , which is denoted by Z, is normally distributed with mean 


1+ | 
g(p) == $loee( 522 arid 


Var [g (7) ]=c?== 
or Var [ g(r) ]=[28' (p) I’ y (p) 
[47 Gxe¥ 
Fede: 


= * for large n 
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Hence z= 5oee| = | | tt . oo , for Jagge ae. 


Prof. R.A. Fisher proved that the transforined statistic Z = g (7) ts normally 


distributed even ifm is sniall and that for exact,saniples (small n), 


1 1+ 1 
Z-N] Sloe ef | 


p'n-3 


For the various applications of this transformation the reader is referred to 
§ 14-7-2. 

1 1+r es | 

Remark. We have: Z =>; loge ae be tanh (r) 

Hence Z-transformation is also called the tan-hyperbolic-inverse transfor- 
mation. 

8:14 Order Statistics Let X,, X2, ...,X, ben independent and identical- 
ly distributed variates, each with cumulative-distribution,function F (x) . If these 
variables are arranged in ascending order of magnitude and then wnitten as X(1) s 
X(2) Ss... SX(n),_ we call Xi as the rth order statistic, r = 1, 2; «., 7. The Xqy's 
because of the inequality relations among them arc necessarily ‘dependent. 

Remark. If we write these ordered values‘as 

Yi s Yos...<5 Yn, then: 

Y, = Xi) = rth smallest of Xi, X2, ..., Xn 

Y, = X(1) = The smallest.of Xi, X2, ..., Xn 

Yn =X(n) = The largest of X1, X2, ..., Xn 

8:14-1. Cumulative Distribution Function of a Single Order Statistic. 
Let F, (x), r= 1, 2,...,. denote the c.d.f. of the rth order statistic Xi). Then the 
c.d.f. of the largest order statistic X(n) is given by : 

: Fy (x) = P (Xn) 5 x) = P (Xi sx 50 = 1, 2, ..., 2) 
=P(X,sxNX25xN... NX, sx) 
=P(Xis x) . P(X2 sx)... P (Xn sx) (-...X; ’s are independent) 
'=[F(x)], (8°53) 
since X), X2, ..., Xp, are identically distributed. 
The c.d.f. of the smallest order statistic X(1) is given by : 
F (x) = P (Xa) s x) 
=1—P(Xi1)>x) 
=1-P[X;>x;t=1,2,...,2] 
n n 
=1-M P(Xj>x)=1- N[1-P(Wsx)] 
iel i=l 


=1-[1-F(@)N, .-(8°54) 
since X\, X2, .... Xn are i.i.d. rv’s. 
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In general, the c.d.f. of the rth order statistic X/,) is given by: 
F (x) = PO) Sx 
= P [At least r of the X; ’S are < x] 


= > P [Exactly / . the n, A; ’s are Sx}, 
j=r 


; 2 ({n\_, os 
- >. ( i Fi (x) [L- F(x)", (8°55) 
jar J | 
by using Binomial probability model. 
Remarks, 1, (8°55) can also be written as [See Remark 2 to Exainple 7-23): 
FQx) = Ip gy (rn r+ 1), (8°56) 
P 


- I “1 b-1 
where I, (a, b) = B (a,b) | 1° (1-1) dt (8°56 a) 
0 


is the ‘incomplete Beta Function’ and has been tabulated in Biometrika ‘tables 
by Pearson and Hartley. 7 

(8:56) and (8-56 a) show that the probability points of an voraet Statistic can 
be obtained with the help of incomplete beta function. 

2. Taking r = | and r = n in (8-55), we get respectively: r 


F\(x) = 2 G F? (x) [1-F (@)]"" 
= 


Ind(" F! (x) [1 F (xyy"/ I. 


1-[(l-FQ@]" (8°56 5] 
and F(x) = F" @), ...(8°56 c) 
the results which have already been obtained in (8-54) and (8°53) respectively, 

8-14:2. - Probability Dénsit¥ Function (p.d.f.) of a Single Order Statistic. 
The results in (8°53) to (8-55) are valid for both discrete and Cotitinuous r.v.’s. 
We shall now, assume that X;’s are i.i.d. continuous rv.’s with p.d.f. f(x) = F 
(x) . If f- (x) denotes the p.d.f. of Xr) then from (8°55) or (8°56) we get: 


] e 


d ad . 
SG) = SAF @)= Ura) Con +) 


F(x)’ 
d l Ss 7 
ee) l-f n ’ dt a 
~ de} Br, n—-7 +) | Cot) | ...(8°57) 


Let us write 


gin= [U0 d= (De -0" es 
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F (x) 


(x) 
=> fr (1-9 'd=leol Te FG) -8 0) 
fe 
=> — ra t-" (1 —1)"~" dt = g! (F(x). f(x) (-.: g (0) is constant) 


=(F@J'[1-F@] ~" f(x) [ Using (+) } 
Substituting in (8-57) we get: 


-; 1 r-l ; 2s n=-r . 
r (x) PUnsrel ‘Gr F(x)) .f(@) :..(858) 
Aliter. By definition of a p.d.f. we get: 
e lim P[x<Xpysx+5x] 
f= in | (8°59) 


The event E: x < Xi) s.x+5x can materialise as follows: 


a ead 1 aa n-r 


Xx  XX+8x - 
: X;sx for (r—1) of the X;’s 
and x<X;sx+8 for one X; 
and X,2zx + x for the remaining (n —r) of the X;’s 
Hence by the multinomial probability law we have: 


ni} 1 
Gait iwaylPt eS 
where pizP (Xisx) =F (x) 
pr= P(e <Xjsx+5x) =F (x +5x)-F (x) 
and p3=P(X;2x+5x)=21—P(Xjsx+5x) =1-F (x +8x) 
Substituting in (8-60), we get: 
lim P@&<Xy<x+5x) 


P(x <Xpysx+5x)= .».(8°60) 


b= 5,49 dbx 
1 . prn-by: lim F (x +5 x) —F (x) 
~ B(r,n—r+ Tian ‘ O) x 59 a 


oe of 1—-F(x-+x)]"” 


1 
. B (r,n—r +1) 
as in (8-58). 
8-14-3. Joint p.d.f. of two Order Statistics. Let us denote the joint p.d.f. 
of Xr) and Xs), where 1s r<ssn by frs (x, y). Then, 


.F’"'() .f(@).[1-F@)]"" 


lim ?P' pa Dex ROMS ROSY TO) 


rs (X, y) =5x +0 
frs (x, y) “oe: Sxby 


...(8°6) 
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we 


The event E ={x< Xi) sx+dx Ny <Xys y = dy} can materialise as fol- 


a amma [ti }~——-s-r-1 mi k#—— 0- — 


MX X 58x Y+S8v 


Jows: 


X:<x for r—1 ofthe X; Ss, 
x<X;sx+5x forone X;, 
x+5x<Xj,sy for (s—r—1) of X;’s, 
y<Xisy+5y for one Xi. 
and X;>y+y for (n—s) of the X/’s 
rience using multinomial probability law, we get 
EM ah | pakgeey ss nN permanent 
n} <liie 
"GoDliteerstyiit@csy it Pes” a 
: ...(8°62) 
where 
pie P(X; <x) =F (x) 
P2= P(x <X;<sx+5x) =F (x +5x)-F (x) 
p3=P (x+8x<Xisy)=F(y)—F(x+58x) 
pPa=P(y<Xisytdy) =F (y+5 y)—F(y) 
Ps=P(X;>y+Sy)=1-P (Xisy+dy)=1-F(y+dy) 
Substituting in (8-62). and using (8-61) we get: 


n! ae ii [ F (x +5x)—F (x) ] 
~ (r=1) $($—r-1) 1 (n- sine OC) * oro dx 


» tim aa ae |xtp0 bins 


xe LFO)-F@+dx)]" 


ni F’"' @). fd. [FO)-F OT £0)-[1-F oI 


 (r-1)'(s-r-1)!(n-s)! 
(863) 


8:14-4 Joint p.d.f. of k — Order Statistics. The joint p.d.f. of k — order 
Statistics X(r,), X(r;), Xr) Where 1 sr) <r2<..,<msn and 1sksn is for 
X1SX2<...<x_ given by [on using the following configuration and the multi- 
nomial probability law as in § . 14:3}: 


a aoa | acme cae Rue mes es 


xy K,#SX, x9 X%2#6X5 ay X3+8X3 


1 a. ee 


Xk Xk+BxK 
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ni 
(n=-1!(2- rj—1)!.. eae r-—1)!(m—r,)! 
x FM! (ny) x f (ry) x x | Fe) F(x) ]" — x f (x9) 
x [F (xs) ~“F () if wf (03) Kx fx) {1- F(a] (864) 


8:14-5. Joint p.d.f. of all n - Order Statistics. In particular the joint p.d_f, 
of all the n order statistics is objained on taking k = in (8-64) . This implies that 
r=i fort=1], 2,...,. Hence joint.p.d.f. of Xi), X(2); ++, X(n)_IS given by: 

fir yn (x1, x: X2y vey Xn) =! f (x1) f (x2) «. f (Xn) ...(8-65) 

Aliter, We can easily obtain (8-65) by using the following configuration: 


Po 1 be ch ar ’ fe 0 ’ [-+-—— 0 ——+| 1 = 
Xy X44 8x, X27 Xot8% Xz Xgt Bx, Xp Xp rt bXy 
and the multinomial probability law as in § 814-3. 
$-14-6. Distribution of Range and other Systematic Statistics. Let us 
obtain the p.d.f. the statistic Wrs = X(s5) —X(p) 5 7 < S. We start With the joint p.d.f. of - 
Xr and Xs) given in (8-63) and transform [ X(y, X(s) | to the new variables Wrs 
and X(r) S.t. 


Sr. 12, 2 2y of (x), *2; se » Xr) = 


=y-x; X=xX Sty y=xX+w,s and X=x 
_ 9% y) =1 => |J|=1 
dQ, Wrs) 0 1 


The joint p.d.f. f,;(x,y) in (8-63) transforms: to the joint p.d-f. of 
Xr) and W,s as given below: 


8 (X, Wrs) = Crs. PF (x). f(x) Lee Wrs) — ~F (x) 7 
x Poe Wnt I Peay” .. (8-66) 
_ on! 

(7-1)! (s~r+1)! (ns)! aay 


Integrating (8-66) w.r.to. x from —%° to %, we obtain the p.d.f. of W,s as: 


-1 


where C,; = 


2 (Wrs) = Crs f { Fx) fo) | Fe + Wrs) - F (x) ine Sf (x + Wrs) 
[UFO + Wn) Tt? dx ...(8-68) 


‘Remark. Distribution of Range W'= X(q)—- Xa). Taking r=1 ands=n 
in (8-68), we obtain the p.d.f. of the range W = X(n)—X(1) as: 


g(w) =n(n—-1) f f@) [F(c+w)-F(x) "flit w) des w20- 
- 0 (8°69) 
The c.d..f of W is rather sir:ple as given below: 
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G iw) = P(Wsw)= fg (u) du 
Os 


=f n(n el) f FOL F @ tu) Fa) "Of (+ u) de bed 
C -@ ’ 


q ~ 


=. nf f(x) f@r- 1) f(x +u) (Fi +0) =F 0X)" 2 du b de 
~ 0° ' 


en ffl Fa+w-F i] de 


Example 848 Let X,. X32, ....X, be a random sample from a population 
with continuous density. Show that Y; = min (X1, X2, .... Xn). is exponential with 
parameter nb if and only if each X;_ ts exponential with parameter d. 

Solution. Let X;_ be i.i.d. exponential variates with parameter A and p.d.f. 


f(vy=hE* 20, ~>0 : AE) 
F (i)=PXsaye f fwduar fe duet-e"* ...(ii) 
Distribution function G( .) of Y; ah (X1, X2, ...,.Xn) is given by: 
Gy.) =P(Yisy)=1-[1-Fiy)] : [ From (8-54) ] 
-1-[1-@-e* ] a1-e" iii) 
[From (1)] 


which is the distribution function of exponential distribution with’ parameter 
nh. Hence Y; =min (Xj, X32, ..., Xn), has exponential distribution with 
parametern aA. 

Conversely, Let Y: = min (Xj,-X2, ..., Xn) ~ Exp (7 A) so that 


P(Yisy)=1-e""* = P(Y;zy=e""* iv) 


=> P min (X1, X9, ..., Xn) zy] wen. 

=> P (Xx; 2y) VC 12y) (Vike (X:, 2 y) = eo nhy 

= I] P (X, 2y) ‘= ett ¥ 
i=] 


G (Xi 2 y) ] mete (+. Xi’sare itd. | 
= P(Xizy)=e"" 
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=> P(Xisy)=1-e" 
which is the distribution function of Exp (A) distribution. Hence X;’s are iid. 
Exp (A). 

Example 8-49. For the exponential distribution f (x) = e&* ,x20; show 
that the cumulative distribution function (c.d.f.) of Xin) in a random sample of size 
n is F, (x) =(1-e7*)" . Hence prove thatasn -» &, thec.d.f. of Xin) — log n tends 
to the limiting form exp [ — (exp (—x)) J, -- <x<™. 

Solution. Here f(x) =e" ,x20; F(x)=P(X¥sx)=1-e” ..(#) 

The c.d.f. of Xin) is given by [From (8°53)] 

Fx @)=P[Xqsx]=[FQ)]'=(1-e°)" — [ From(+)] ... (+8) 

The c.d.f. Gn(.) of X(ny—log n is given by: 

Gn (x) = P [ Xe) — logan sx] 


=P[ Xn sx+logn] 


a [1 ee sren [ From (#4) | 
n n 
; +1" 
lim lim a (ee _o% 
Gn (x) = a-$ | exp[ e 


Bes fos “2 
n 


ane @ 


Example 8:50 Show that for a random sample of size 2 from N (0, 0°) 
population, E (X1y) =— o/Vx (Delhi Uniy. M.Sc. (Stat.), 1988, 1982] 
Solution. For n = 2, the p.d.f. f; (x) of Xi) isgivenby: [ From (8-58) | 


. 1 . 
AQ)= Fa a l1-F @) ISG) =2[1-F@)] fe) 5-2 <xee 


where f(x) = #20 [¥ - , 09] 
2 o 
“ E (Xw) =fx.fr@) de=2f[1-F (x) .xf@) de mA) 


We have: log f (x) = — log (V2 x 0) en 
oO 


Differentiating w.r.t.x we get: 
f'@) x 


fo? 


= Sxf@) de=- 0? ff’ @)de=-o' f() _Aii) 
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‘Integrating (i) by parts and using (ii), we get: 


EXw)=2. [[1-F@)1Co°f@)} -2f Co fO) FQ) ax 


=-20°f [ f(x) | d= —* f etl? dy 


= aye) | Je aad 
=—oO/Vx ; 7 


Example 8-51. Show that in odd samples of size n from U [0,1] popula- 
tion, the mean and variance of the distribution of median are 1/2 and 
1/[ 4(n + 2) ] respectively. 

Solution. We have: f we 1; 0sxs : 


F (x) =P (Xsx)= fu) dum du =x 


Let n=2m+1 (odd), where m is a salle integer = 1. Then median 
observation is Xm 41). Taking r = (m +1) in(8-58), theip.d.f of median X(m 4 1) is 
given by: 

1 


fm+1 (x) = Bim-+i,m+1)- 2" (1 -x)” 
1 va m 
Enea med Se Os) dx 
_B (m +2, m+1) 
B(m+1,m+1)- | 
_ T(m+2). Tims 1) ['(2m + 2) 
~  P(m+3) * Tim+ 1) F(m+ 1b) 
ort as (On simplification) 
1 
Eins =f 2 fms (@) dre es rs S 221 a" dy 


z (m+3,m+ 1) m+2 

Bit, m+1)_ nee) 
. Var (X(m + 1) = =E (Xin 1))- [EXims 1) I 

_m+2_ 1. 1 

2(2m+3) 4 4(2m+3) 4(n+2) 

Example 8-52. Let Xj, X2, ..... Xn be i.t.d. non-negative random variables 
of the continuous type with p.d.f. f (.) and distribution function F (.). 

IfE|X|<, Show that E|Xy)|<&. 
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(b) Write M,=Xin) = max (Xj, X2, ..., Xn). Show that 
E (Mj) =E (Mn-1) #f F*"' (x) [1-F @) Jaen =2,3,... 
0 
(Delhi Univ. B-Sc. (Stat. Hons:), 1999) 
Hence evaluateE (M,) if X41, X2, ...,X, have common distribution function: 
F(x) =x; O0<x<1. 
Solution (a2) E}Xp|= f |x]. fe (x) de 
a 0 
LG - X is non — negative continuous t.v:) 


- fia. Topiary OF OL - Fa) Pr ae 


< v(t). flxtre as 
ene i)é |X| a % 
Hence E |Xq|<0 ifsE|X[<o. 


(b) The p.d.f. f, x), of My=Xn) is given-by: 
fn(x) =n[F (x) J". f@) 


E(M,) =" i X fa (x) de (-X20,a5,) 
: EM =H retires f(x) dx 
Integrating by parts we on 
E (Men, im | x ; ——- @) 4. “ 
Qe ? 0 A 
- = srre)- [Frees 
0 


! 


cs 
"° 


lim | fa oi () : i) de+aF" (a). 


a @® 
0 
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z ae f (1-Fren) ene eatr 
0 . 
ee f (4-F" @) dea (1-F" (a), 
0 
Since E M, exists, (By part (a)), 
a. P(Mn>a)=a[1-P (Mqs.a)]*a[ 1-F"(a) | 
—> 0 as a —> &, 
E(My)=i™ | f (1-F"G)) de=f (1-F"@)) ax ~(#) 
6 0 


tl 


il 
Ooty BF Ooty 8 oS 


( 1—F"'(y).F (x)) dx 


[1-FT(y[1-G -F @))} ] ae 


( = ae (x) dv +f F"-'@) [1-F (x) Jdy 
‘ d ; 


=E(Ma-i\ +f F’o' (w) [1--F (x) J dy | From (« | ..0##) 
0 N 


I¥~V[O,1], then © 5 
f@)=1;,0<x<1 and Fear; 0<x<1 
Substifuting in (**), we get: 


I 
E (Mn) —E (Mp-1) = J x"°' (1x) de 
0 ‘ 
1 


= E (My) ~ E'(My-3) 32 = 


ntl (#**) 
t 
Changing n ton--1,n-2,...,2,1 in (***) we get respectively: 
: a | 1 
EB (Mn— 1) E (Mn -2) 5 


E (M2) —E (A1)) = [ 


E (M,)'- F (My) = 1 a4 a 
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Adding (***) and the above equations and noting that E (Mo) = 0, we get: 
1 n 

cia mae ae 

Example 8-53 (a) Find the p.df. of Xi) in a random sample of size n 
from the exponential distribution: 

fx)sae**a50, x20 
(b) Show that Xp) and W,; = Xs) —X(p), 1 < S, are independently distributed 
(c) What is the distribution of Wi = X31) -—X(r) ? 
x 


Solution. Here F (x) = P(X s x) =fio eo 8" dy=1—-e%* ..(i) 
0 


The p.d.f. of Xi) is given by: 
ij. 5 CF@)TL1-F@r’ se) 


B(r,n-r+1 
1 —-ax i -ax(n-r) -ax 
Baca: (ire as pot 
= 1 ~ax(n-r+1) _ grax je 
B@nerat) 8 [1 e ;x>0 
(b) The joint p.d.f. of X~) and W,s = X(s)-X(r) is given by [From (8-66)] 
- -r-1 
& (X, Wrs) = Crs FF’ (2) £@) | F (e+ wis) -F GT : 


xf (X + We) [1 - F(x + Wee) | 


n! (n—r)! Ser -ar 
‘Caan Gare sie 


s-r-1 n-S$ 
se ferent | xa en kt Mn) x [eer] 


r-1 
‘|For eee (tet) | 


a es ~(n-541) 0% aveeeren|| 7 
od cecrerrsy i (1-e wit 


=> Xi) and W,;, are independently distributed. 
(c) Takings=r+1in (au), the p.d.f. of Wi =X(-41)—Xry) becomes: 


1 -al-rm 
Z (w) BaLn-n "ss 
=(n-roa.e "98"; wi 20 


which shows that W; has an exponential distribution with parameter (nm —r) a. 
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EXERCISE 8 (j) 


1) (@) Obtain the distribution function and hence the p.d.f. of the rth order 
statistic X~) ina random sample of sizen froma population with continuous 
distribution function P(.). Deduce the p.d.f.’s of the smallest and the largest 
sample observations. [Délhi Univ. M.Sc. (Stat.), 1987] 

(b) Let Xi, X2,...,Xn be a random sample of size n from a population 
paving continuous distribution function P (x). Define the rth order statisti¢ X(-) and 
obtain its distribution function and hence its p.d.f. 

[Delhi Univ. M.Sc. (Stat.), 1983] 

2. Define rth order statistic X(-). Obtain the joint p.d.f. of Xr) and X(), 
r<s, ina randonrsample of sizen froma population with continuous distnbution 
function P(.). Hence deduce the p.d.f. of sample range W = Xin) — X(1) . 

(Delhi Univ. M.Sc. (Stat.), 1988, 1982] 

3. Obtain the distribution function and hence the p.d-f. of the sinallest 
sample abservation X(1) Ina random sample of size n from a population with a 
continuous distribution function F (x). Show that for randoni sample of size 2 
from normal population N (0, 0’), E (X(1)) = -—o/Vt 

[Bombay Univ. M.Sc. (Stat.), 1992] 

4. Let X,X2,...,Xn be nm independent variates, X; having a geometric 
distribution with parameter pj, L.e., 

P(Xj=xi) =F! pis i= 1—-paxi=1, 2,3, ... 

Show that X(1) is distributed geometrically with parameter (1 -— q1 q2 ... Gn) 

[Delhi Univ. M.Sc. (Stat.), 1983] 

(b) Let Xi, X2, ....Xn ben independent variates, X; having a geamcetnc 

distribution with parameter pj te. 
P (Xj = xj) = qi! -Pis G=1-—pi, xi = 1, 2, 3, ..., 

Show that X(1) is distributed geometrically with parameter (1 - q; 92 43 
wee Qn) 

5, Forarandomsample ofsizen froma continuous population whose p.d_f. 
p (x) issymmetrical at x = 4, show that 

fr (u +x) =fn—r4i (U—x), 

where f,(.) is the.p.d.f. of X¢). 

Hint. f(w+x)=f{(u— x) 

F(w+x)=P(X¥sp+x)=P(X2p-x) (By symmetry) 
=1-P(Xsp-x)=1-—F(p—x). 

6. LetX}, Xo, ....Xn bea random sample ofsize n froma population having 
continuous distribution function F (x) . 

Define the order statistic of rank k, 1 sk sn. Find its distribution function. 

Show that for the rectangular distribution 

f()=1/82, % —5 92 sx< 0; += 02, 
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7. Let X;,X9, ....Xn be a random sample with common p.d.f. 


foto O<x<1 


0, otherwise 

(i) Find the p.d.f£., mean and variance of X(1) . 

(it) Find the p.d.f., mean and variance of X(q) . 

(ii) Find Corr. (Xv, X(n)) - 

Ans. (i) fi (x) =n(1—-x)""), 05x515E (X= 1/(n+1) 
Var (X(1)) = n/[ (1 + 2) (2 + 1)’ ] 

(ii) fax) =nx’': 05x51; E(X%q)=n/(n4+1) 
Var (X(n)) = n/]- (n+ 2) (n + 1) J 


ee e Cov (Xa Xn)) 
Hint. ; - ov Xa, Xm) _ 6 ¢ Chapter 10 
(iii) Hint 7 (Xa, Xm) = Way. Var Xen (c.f. Chapter 10] 


y 1 y 1 
E (Xu .Xm) =f f xy fin (x, y) de dy =n(n—-1) f f xy (y-x)"~* dea 
0 0 00 


(fin, y= 2-DS@)LF O)-F @) TSO): 0sx<ys1) 
n-2 


yl 
. aac all ue) dx dy 


=n(n—- vf p> v"*1 91 -1)""? dt dy; S| 


=1/(n+ 2 { On simplification ] 
Cov (Xu, Xn) = 1/[ (n+ 1)*. (n + 2) | 
Corr. (X(1), X(n)) = 1/1 


8. Show that the c.d.f. of the mid-point (or mid-range) M = 5 (X(1) + Xn), 


in a random sample of size n from a continuous population with c.d.f. F (x) is: 
m 


F (m) = P(Msm)=n f [ F (2m-x)~F (x) "~" .f (x) de 


—-@ 


9. LetX; (i= 1, 2, ..., 2), be i.id. non-negative .v.’s of continuous type. If 
My = Xn) = Max (X1, X2, ..., Xn), and E (|X !) < oo, then prove that 
E (My) = E (Mn-1)+ f F"~' (x) [1-F (x) ] de 
0 


{Delhi Univ. B.Sc. (Stat. Hons.), 1990] 
Hence find E (M,) if X;’s are i.i.d..exponential variates with parameter A. 
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Ans. E (M,) => beg tg tty | 


10. Show by means of an example that there may exista r.v.X for which 
E (X) does not exist but E (Xi) exists for some r; 


Hint. Let Xj, X2, ...,X_ be a random sample from the poplin. with p.df- 
fix =, l<x<0; F(x)=1 = 
x x 
E (X) does not exist, but E (Xj) exists foranyr<n. 
11. LetX}, X2, ....X, be arandom sample of size” froma population with 


p.d.f. f@)=1, O<x<1 
=(), otherwise 
Show that XY, =XiV/X 27, Y2 = X(x/X 2), «5 0s 


Yn-1=X(n~1)/X(n) and Y, = Xin) are independently distributed and identity 
their distributions. 

Hint. Proceed as in the hint to Q. No. 18 and 19. 

12. LetX), X2,X3 be a random sample of size 3 from exponential distribu- 
tion with parameter A. Show that Y; = X(a) —X(2) and Y2 = X2y are independently 
distributed. 

Hint. n=3,; write joint p.d.f. of order statistics X(2) and Xia) and then 
transform to Y; and Y2. 


13. Let X), X2, ....X2m+1 be an odd-size random sample from a N (p, o*) 
population. Find the p.d.f. of the sample median and show that it is symmetric about 
u, and hence has the mean p. 


14. A random sample of'size n is drawn from an exponential population: 
piane*®; 6>0, x20 


(:) Obtain the p.d.f. of X() . 
() Show that Xi) and W,s = Xs) Xp, r<s are independently 
distributed. } * 
(wt) Identify the distribution of W; = X41) -X() 
(Gujrat Univ. M.Sc. (Stat.), 1991] 


15. Let X;, Xo, ...,X, be a random samp!e of size 7 from a uniform 
population with p.d_f. 


1, if O<sxs1 
f(x) = 0, otherwise 
Show that : 
(a) Xe isaBi(r,n—r+1) variate. 
(b) W,; = Xs) — Xi) also has a Beta distribution which depends only on 
S—rand not ons and r individually. 


= 1 -r-1 -s+r 
Ans. uss wil a n-str, 
ns fovrs) B (s =e r, Aes 1) Wrs (1 Wrs) ? () s Wrs s 1 
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16. In a random sample of size 2 from uniform U[ 0,1 ] population, 
obtain the p.d.f. of Ws = X(5) —X(r) and identify its distribution. 


(Delhi Univ. M.Sc. (Stat.), 1987} 
17. Obtain the p.d.f. of the range in a random sample of size 5 from the 


population with p.d.f. e~*, x > 0. (Meerut Univ. M.Sc. Stat.), 1993} 
18. LetX;, X2, .... Xn be a random sample from continuous population with 
p.d.f. fd=Bpe**; x50, B>0 


=(0, otherwise 

(a) Show that X,) and X(s)—X(y) are independent for any s > r. 
(b) Find the p.d.f. of X41) -Xq 
(c) LetZ;=nXq), Z2=(n—-1) (X(y-X)), 

= (n — 2) (Xx) - Xa), -) Zn = Xn) -Xn- 1) ..(*) 
Show that (Zi, Z2, ..., Zn) and x 1, X2, ..., Xn) are identically distributed. 
Hint. (c) The joint p.d.f. of X(1), X(2y, ..., Xn) is: 
FQ, X2, «., Xn) =! f Orr) f (x2)... f Xn) 

ant pre Pa eo Be oo Bam ..( #4) 

Transformation (*) gives: 


Zi 
X(1) = a? X(2) = he wo #*) 


Z2 Zn-1 Zn 
ne H* ene 2 us 1 
Je 3 (X(1), X(2p --- to )_ 1 
0(Z1, Z2,.-,Zn) nn! 
0<X1)<X<...<Xn<% = 0<Z;<%f=1,2,..,7 
Using (***) and |J |, (**) transforms to 


8 (21, 22, ~2n) = ( B PM) (Be ®*) 6 (BeP™); 0<Zj<0 


=> Z1,Z2,..,Zn are i.i.d. exponential variates with parameter B. 
19. Let .X1, X2, ..., Xn a p.d.f. 


aaa = ~ exp - FS) |sx>0 


=(Q, otherwise 
Show that X(1), X(2) —X(1), X(3) —-X(2), ..-» X(n) —X(n— 1) are independent. 
Hint. Z1=X1)3 22 =X(2) -—X(1),_ «9 Zn = X(n) —X(n- 1) 
=> X (1) = Z1, X(2) = Z1 + Z2, «.., X(ny = Z1 + Z2 +... + Ln 


ye? (X(1), X(2)) «+0» Xen) _ 1 
0 (Z, Yo eee Za) 


Xn) = 


As in above problem 


Rn 
& (21, 22, +) Zn) =n! TT f(xi) |F| 
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- es i _ Flea 
o o G 
x E ev 
oO 


=> Z,Z2,...,Zn are independently distributed. 
20.- Find the p.d.f. of rth order statistic. 
Let X1, X2, ...,.Xn be i.i.d. with a- distribution function 
Fiyely¥ if O<y<1 3 a>0 
0) 0, otherwise 


Xi 


Show that | — ¥.. i=1,2,...,n-—1 and Xn) are independent. 
(n) - 


[Delhi Univ. B.Sc. (Stat. Hons.),.1990] 


21. Let Xi1, Xi2, ......, Xin, U= 1, 2, «...... ,k be k random samples from 
R > ') population. Find the distribution of U = X 1 (n). X2(n) «....- Xk(n), Where 
xi) is the maximum of ith sample. 


(R = Rectangular population) 

(Delhi Univ. M.Sc. Stat.), 1989] 

22. Forthe exponential distribution f (x) = e"*, x 20, find the p.d.f. of the 
range W ina random sample of size n_ and show that 
1 1 1 

EW)el+tstat...t7 

Ans. g(w)=(n-1)e "(1 -e")"~2. w20. 

23. Let X1, X2, ...,X, bea random sample of sizen from U[ a, b ] popula- 

tion. Obtain the p.d.f’s of (i) Xi, (td) Xn) and (iii) joint p.df. of Xa) and 

Xn). 


24. LetX1, X2 bei.id.r.v.’s with p.d.f. 
e* = 


P(X; =x)=——, x=0,132,..5 i= 1,2 


where A >0. Let M = Max. a and N = Min (Xj, X2) 

Find the marginal p.m.f.’s of M and N. 

8-15. Truncated Distributions. Let X bea nde variable with p.d.f. 
(or p.m.f.) f(x). The distribution of X is said. to be truncated at the point X = a if 
all the values of X sa are discarded. Hence the p.d.f. (or p.m.f.) g(.) of the 
distribution, truncated at XY =a is given by: 


g (x) = mae 5 x>a - (8°71) 
-{e. x>a (For discrete rv.) ...-(8°71 a) 


x>@ 
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(x) 


== ; x>a (For continuous rv.) ...(8°71 b) 


f fide 


For the continuous r.v. X, the rb moment (about origin) for the truncated 
‘distribution is given by: 


s f XS @)a& 
= E(X’) =f x8) de = + ..(8°72) 
. f f@ax 


Example 8-54 Let X ~B a Find the mean and variance of the 
binomial distribution truncated at X = 0. 
Solution. Let f(x) be the p.m.f. of X ~ B (a, p) variate. Then the p.m.f. 
2 (x) of the Binomial distribution truncated at XY = 0 is given by: 
ato f (x) f(x) f (x) 
P(X>0) 1-P(X¥=0) 1-f(0) 
1 


a ip |G”: x= 1,2, soe M + *) 
eed 
1 
E(X)= 2 xg (x) =——_ 5 Cc. 
1-¢ x21 P v 
= 1 | nea 
1-q x=0 
= wee aa) v( ¥*) 
E (X’) = ere z = "Cx p* f-* ss 


—— Beveretd 
1-q | x-0 


ett 2.2 
te [me oe] 
| X ~B (np); EX") = Var + (EX)’ = npq + ° p’ | 
2 
Var (X) = EX’ -[ E (X)] 
2,2 Ap 
= npq + n° 
1-7 pq +n p — 1-q' 
Example 8-55 Obtain the mean and variance of a standard Cauchy 
distribution truncated at both ends, with relevant range of variation as (- B, B) . 


(Delhi Univ. M.Sc. (Stat.), 1987] 
Solution. Let f(x) be the p.d.f. of standard Cauchy distribition. Then the 
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d.f. g(x) of the truncated distribution with relevant range of variation as 


(_ B, B) is given by: 
f(x) __f() 


~§ 
1 1 
2 L+x’ __1 1 
B 
5 dx 1+x ltan7? x | 
mI 14x’ z 
-B 
1 a 
SSR wee xs wo * 
oan B (+x) a B (*) 
: ae ee / x 
: —~ dx 
Pan Jiez 


Mean = | x g(x 
J gx) => ee 
Integrand is- an odd function of x ] 


= 0 ise 
B 
Variance = pa’ py) =’ = f x? g (x) dx 
-B 
8 B 
2 x 1 1 
a1 tee Titers 
pee ac eaal eel 
ree |x tan Magers tan p ) 
__B | 


tan} B 
Example 8-56 Consider a truncated standard normal distribution trun- 
cated at both ends with relevant range of variation as (A,B). Obtain the p.d.f., 
mkean, mode and variance of the truncated distribution. 
(Delhi Univ. M.Sc..(Stat.), 1988] 


Let Z~N (0,1) with p.d.f. m(z) and c.d.f. ® (z) = P (Z sz) 


Solution. 
Then the p.d.f. g(.) of the truncated normal distribution is given by 
gy (z 54 
82) =Da oan ® (B) ow" PRICE aezse 4) 
k= (B)-@ (A). 


where 
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B B 
Mean =f zg (2)dz == f z @ (z) dz 
A A ii) 
1 7/7 
We have PE) é 
v2 
d Px 1 -2/2 
=> de ” (2) =e (—z)=—z@(z) 


= fz9@dz=-9() 


Substituting in (ii) 
B 
Mean -7 | —@ (z) 5 = 2 29) =u’ (say) »+( Uti) 


0, if A<0, B>O 
A if A>0O, B>0 


B if A<0, B<0 
Mode 


B 
Variance f 2 g (z)dz-p" 
A 


~ B f . 
+ | z(-9@) b. + f p(2)az|-n" 
A 


==> [Be (8)~A 9 (A)- | (6)- )} ]-2" 


seer 7/) 25.97. 


where p’ is given in (iit) . 
EXERCISE 8 (k) 
i 


Find the mean and variance of the truncated Poisson distribution with 


g (x) = rl 


1. 
parameter A, truncated at the origin. 
+ Ams. pdf.’ . 
P. -hK 4x : ; 2 
en |} eee A 7 a A+A 
»x= 1, 2,3, 5 £ ¢ 2 ;EX ol re 
| | One 4-e% 


1 
1-e* 
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2. Obtain the p.d.f. and the mean of truncated standard normal distribution, 
for positive values only. 


2 
Ans. g(2)=2.5= 7/7 =2p(z);2>03E(Z)= 


3. (2) LetX be normally distributed with mean p and variance 0°. Trun- 
cate the density of XY on the left at a and on the right at b, and then calculate the 
mean of the truncated distribution. [Note that ifa =—c and b= +, then the 
mean of the truncated distribution should equal H. ] 

(Delhi Univ. B.Sc. (Maths. pe ), 1989] 

(b) IfX is normally distributedwith mean p and variance co”, find the 
mean of the conditional! distribution:of X given a < X <b. 

Hint. In fact = in Part (a) is same as in Part (5), stated differently. 


z ~@-w/20°, 
f (x) 5 o e ° Oc xKr< DP 
Mean of truncated distribution is : : 
b b t Pa ‘s 
Sxf)  f &-+ wf @) a 
; a a 
ES = b = 


7) 
ff@ de Sf @) ax 


t 


b 
f @—w) fe) ax 


ra _ 2 [ £@)-fO) 
PES 


where F (x) = P(X sx), is the distribution function of X .. 


|» fi) -stax- a | > + of @) = -W se) 


- 


bot : ; ; 
=f Wf @)de=-07 | fo) | 20? [s(a) -f)1 | 


i) ‘ - 
4. A truncated Poisson distribution is given by the mass function 


' 4 1 ee * : ° ’ 
ed le es xe wm 1253) ASO 


¢ ae * € ‘ 


Find the m.g.f. and bence mean and variance of the distribution, 


ee : 
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S. Consider the p.d.f. 
g(x)= i oo 7)” , X>Xo ...(#) 
where f(x)= (V2x .o)! . exp [ -@-w)'/2.0°],-» <pwp<%,G>0 
Xy 


and = F (x,) = ff@ du 


[(*) is the p.d.f. of N (u, o*) distribution, truncated at the point x = xo ]. 
Show that the first two raw moments can be expressed as: 


Mi =Wtho; M2 =p +AG(xo+M) +0" 
where A [2 —F (xo) ] =f (xo —)/o ] 
6. LetX ~y(@). Obtain the p.d.f. of the truncated distribution, truncated 


at the point x9 and prove that p,’ = EX" for the truncated gamma distribution 
= [E X’ for the untruncated y (a) distribution | 


x [ {1 -Frola+y)}/[ 1 tro (a)}] 


—x x27! 
where /[,, (a) “fé a dx (Incomplete Gamma Integral) 
0 
1 e*.x77) 
Ans. Tobe a| To |i xo 


7. Explain the concept of ‘Truncation’. Fora standard normal distribution, 
truncated at both ends With relevant range of vanation as [ A, B ], obtain mean, 
variance and mean deviation about mean. (Delhi Univ. M.Sc. (Stat.), 1983] 


ADDITIONAL EXERCISES ON CHAPTER VIII 

1. Ifthe random variable X has the density function f (x) = x/2, 0 <x <2, 
find the r* moment of X*. Deduce that Z =X” has the distribution g (y) = 1/4, 
0<y<4, 

Z. Suppose that Y is a random variable for which E (X) =p and V(X) 
=o. Further suppose that Y is uniformly distributed over the interval 
(a, B) . Determine @ and B so that E (X)=E (Y) and V(X)=V(Y). 

3. A boy and a girl agree to meet at a certain park between 4 and 5 P.M. 
They agree that the one arriving first will wait’ hours, 0 s1< 1, for the other to 


arrive. Assuming that the arrival times are independent and uniformly distributed, 
find the probability that they meet. 


Hence obtain the probability that they meet iff = 10 minutes. 
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O tC 1-t } X 


. Shaded Area 
Hint. p=P(|X-Y|s)=7 
= 2[ Area OAB — Area CAD ]|/1 x 1 
21-(1-/)?s2-2 
Ans. ¢=10 minutes, Probability =. 11/36 
4. LetX ~U[0, 1]. Find Corr. (X,Y), where Y=X" 


[Delhi Univ. B.A. (Hons.) (Spl. Course-Statistics), 1989] 

§. (a) LetX, X2, ..., Xn be independent random variables having a com- 
mon rectangular distribution over the interval [ a, b ].. Obtain the distnbution of’ 

Y = max. (M1, X2, ..., Xn) 

(b) Let ¥~U(0,1,2,..., r},r=ab,a>1, b<r, where a and b are 
positive integers. Show that the distribution of X coincides with U + V, where 
Uand V are independent r.v.’s both with uniform distribution on appropriate 
subsets of 0,1,..., ab : (Indian Civil Services, 1988) 

6. IfX1, X2, ...,Xn are mutually independent rectangular variates on [0,1], 
prove that the density function of X) ..X2.....Xn is 

(— log x)"~ : 
f&)=) (n-1)! 
0, otherwise 
7. (a) -Let X; and X2 be independent r.v.’s, each uniform on [0,1]. Show 


,0O<xs1l 


that: 

Y; = V—2 log X, . (sin 2x X2) and Y2 =V-2 log X; . cos (2 m.X2) are inde- 
pendent r.v.’s, and that each is N (0, 1). 

[This is known as Box and Muller transformation .] 

(b) Given a sequence of independent r.v.’s Xi, X2, .... which are uniform 
on [0,1], produce a sequence of independent r.v.’s Yi, Y2,... that are N (0, 1) and 
independent. 


(You may assume that the sum of two independent Normal distributions is 
itself Normally distributed.) 
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8. (a) Assumea random variable X hasa standard normal distribution anq 
let Y=X" 
_ vt 


(i) Show that Fy (f) = f e“““du,t20 
v2x : 


(it) Determine Fy (t) when ¢ <0 and describe the density function fy (f) . 
(b) Let D(x) be the standard normal distribution function and let 


®(x)=f p(u)du 


- © 


Show that 
(i) [ao] =1-90) 57 9),x00 


.. lim x[1-® (x) J. 
pe g (x) 
9. (a) If X; and X, ar independent normal variates with means 


41 and w2 and variance oj and 33, respectively, find’ the relation between a, B, 
y and 8 so that 


»-P (1X1 +02.X2<a)=y and. P(e, X1 + 02X2< B)=h 


(b) IfX and Y are independent normal variates with equal means and 
Standard deviations 9 and 12 yespectively, and if 


P[X+2Y<3]=P[2X-Ye24], 
determine the comnion mean of X and Y. 
10. IfX~N(0,1)andA is constant, obtain the characteristic function of 
(X _A)? , Hence or otherwise prove that 
: Kes 2°! (147A). (r-1),! 
where x, is the rth cumulant of X. 


Ans. Px-a)'(t) = (1—2 it). exp'{ it a*/(1 2 it) } 


=1 


11. (a) IfX is a standard normal variate and @ is a small number, prove 


8 a? 1 o@ 
that Potsalnit gg (e-store | 


(b) Show that 


ff oF de=xe™ "hig 22) 4 5 (ar) +. +} 


‘ 0 
12. LetX ve log-normal] variate with p.d.f. 


ae 


f(x, p, 0”) = <= exp | - (log x — u)?/2.0}, x>0, o>0,|p|< 
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Show that e7X° has a lognormal p.d.f. f(x,a+ bp, 60). If X1, X2, ...,Xn 
are # independent observations on X, then show that G = (X| X2...X,)" also 
has a log-normal p.d.f. f(x, p, o/n) . 

13. (a) The distribution 


dF “a= ve(-28 } —@O<X< 0 
‘5 transformed by the transformation X =.a log. (Y- b) + C . Find the distribution 


of Y. Evaluate the mean, the mode and the median of this distribution of (Y) ‘and 
arrange them in order of magnitude when b>0. 


(b) If Y=alog(X¥—5b)+C bas normal distribution with mean zero and 
nit variance, obtain the distribution of X and evaluate its mean, median and mode. 


14. Astandard variable X is transformed to Y by the relation 
X=+ [logio Y-a] 


2 2 : 
with m =e" © and . = logio e ; show that for the transformed variable 


B, =m? (m+ 3)—4 and Bo =m? (m+ 2m +3) —3 

15. IfX and Y are independent normal variates with zero expectations and 
variances Of and 0% show that Z = XY/VX"+Y" is normal with variance 
o7 = | (1/03) + (1/63) 

16. IfX;;i=1,2,...,n is a random sample of size n from a normal popu- 
lation wilh mean p and variance o’, obtain the joint distribution of 

n n 
U= = a;X; and V= = DX; 
is Tal 

where a;’s and b;’s are arbitrary constants. 

Hence or otherwise show that the necessary and sufficient condition that U 
and V are independent is that a;b;=0. 

17. LetX,,X2 and X3 be three independent normal variates with the same 
mean and variance 0’. Let 
X1 7X2 y. _ Xi 2h2 + X3 d Y. _X1 + X2 + X3 
v2 0 GO 
Show that Y;, Y2 and Y3 are independent normal variates. 
Show that 


Yi = 


3 
Yi+Y3= 5 (X;—X)’, where yates 
aA 


18. A random variable X has probability density function 
f(x) = Cp (x), x2 k 
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where k is a given number, C is a constant chosen to ensure that f(x) is q 
probability density function and 


p(x) =. exp {-! 


i.) 


If g (k) =f gy (x) dx, show that the arithmetic mean and the variance of X 
k 


(k) (k) (k) 

are-respectively ary and 1+ a) kT 

19. Three independent observations X,, X2, X3 are given from a univariate 
N.(m, 07) . Derive the joint sampling distribution of : 

(a) U=X,-X3; (b) V=X2-X3 

(c) W2X; +X2+X3-—3m 

Deduce the p.d.f. of Z = U/V. Show that mode Z=1/2 and obtain the 
significance of this modal value. (Indian Civil Services, 1986) 

20. Neyman’s Contagious (Compound) Distribution. Let X ~ P (AX y) 
where y itself‘is an observation of a variate Y~ P(A;). Find the unconditional 
distribution of X and show that its mean is less than its variance. 

21. Jf X1,X2, ....X_ are independent random variables, having the prob- 


— a 4% 


ability law _p (xj) -<—* ; i= 1,2,...,”,x;=0, 1,2, ..., © 


& 


and if X= =X; and’ Az ip, 
i=! iol 


then under certain conditions to be specified clearly, 


X--_ ]_ 1 6 en = 
Pr | ae dt as no, 


-~ © 


22. Prove that 


“ fe -h4r 
+ fetxde= »y aie -n=0,1,2,... 
n! r! 
rn r=0 
Using the above, write down the relation connecting the distribution func- 
tions of a Poisson and a Gamma variate. 
23. A two-dimensional random variable (X, Y) has the joint p.d.f., 
1 -1 v-1 - 
(x, y) = —______—_ x" ax) oe? 
JoM"Tarm* % 
in the x —y plane where 0 < x < y < & and zero elsewhere. Show that the marginal 
distributions of X and Y are gamma distributions. 
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24. Starting from a suitable urn model, deduce the differential equation of 
the Pearsonian curves in the form 


1 dy E atx 
yd bot bixthyx? 
Also discuss the limitations of ranges in the solution of such differential equations. 
25. Karl Pearson showed that the differential equation 
d[ f(x) ] 7 d-x dx 
f@) a+bx+cx 

yields most of the important frequency curves when appropriate values of 
a,b,c, andd are chosen. Show that 


(.) whend=0 and a=c«=0 as well as b>0, the differential equation 
yields exponential distribution, 


(a) whenb=c=0 and a> 0, the differential equation yields normal dis- 
tribution, and 

(ui) whena=¢=0, b>0O and d>=~b, the differential equation yields 
gamma distribution. 


26. Find the m.g.f. of the distribution with p.d.f. 
1/2 


2 
fee) «(55 | exp ers ,x>0,y>0,p>0 
Also show that 71h cumulant is given by: 
K, =13.5... (2r—3) p¥ Px 
(*) is the p.d.f. of Standard Inverse Gaussian distribution. 
27. Obtain My (t), when X ~ P(A). Find the limit as 1 —> © of m.g.f. of 


(X~A)/Vi and interpret the result in the context of C.L.T. Also prove that: 
6 -r 4k 6 
lim e°.X 1 1 2 
sn. [a | -veeL oe (22) a 


azAtavn, PzrAtbva 


Show that 2X is not a Poisson variate. Give a set of conditions under which 
X + Y too is a Poisson variate. (Indian Civil Services, 1983) 


28. The random variables Xj (k= 1, 2, ...,) are independent and have the 
Cauchy distribution with p.d.f. 


1 n 
,-~m<x<om, Let 1-7 s Aj. 


inl 


1 
x)=—. 
F@) mo 14x 
Examine whether the sequence Yn obeys the weak law of large numbers. 
29. LetXj, X2, ...,Xq,.... be independent Bernoulli variates such that 
P(X =1)=p,, P(X =0) =a = (1—px), k=1,2,..., 0,.... 
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n 


Examine whether the sequence | S Xi } follows the central limit theorem, 


iol 
OBJECTIVE TYPE QUESTIONS 
1. Choose the, correct answer from B and match it with each item inA . 
A B 

(a) B2 fora Normal distribution (1) 3 o* 

(b) Bi fora Normal distribution (2) 0 

(c) 3 fora Normal distribution (3) 3 

(d) 4 for a Normal distribution ipe-t' 0/2 
(4) e 

(e) Characteristic function for Normal distribution (5) $0 

(f) Moment generating function of Normal distribution (6) 0 

(g) Mode of Normal distribution (7) ett oh 


(hk) Mean deviation from mean for Normal distribution (8) 
11. Match the distributions: 


(a) Uniform distribution (1) f@)= —-O<X¥< 


1 
Pa —p)? +a}? 


(b) Normal distribution 2) fae d'? -weree 


(c) Exponential distribution (3) f(x) = ,asxsb 


(ad) Beta distribution (4) f(x)= a”! a-x)""O0sxs1 


B a n) 
(e) Cauchy distribution (SY f(X)= x e*" y20 


WN. if; (¢ = 1, 2, 3, ...,.2) are independent N (0, 1), write (without proof), 
the distribution of 
x 


wn X2 eae 
(1) X;-2X2+X3, (it) Xs’ (tit) = Xi (wv) = - - 


n 3 
(v) = a (iv) ded (i) EX (ii) inj 


q 
IV. Ineach case, in the distribution for which: 
(t) Moments do not exist. 


(ti) Mean = variance. 
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(iii) Mean < variance 
(iv) Mean > variance. 
(v) 6 (= elt-? 
(vi) > (t)= en!!! 
V. State the conditions under which 
(i Binomial distribution, 
(ii) Poisson distribution 
tends to Normal distribution. 
VI. (é) Give two examples of variates which you expect to be distributed 
normally. 
(i) Give two examples of variates which you expect to be distributed 
exponentially. 
VII. State which of the following statements are TRUE and which are 
FALSE. In case of false statements, give the correct statement. 
(i) For normal distribution mean deviation about mean ts greater than 
quartile deviation. 
(ii) X is a random variable following Cauchy distribution, for which 
mean does not exist but variance exists. 
(iii) -In case of normal distribution B, = 3, B2 = 0, 
(iv) IfX and Y are two independent normal variates, then X — Y is also a 
normal variate. 
(v) Binomial distribution tends to normal distribution as n — 9. 
(vi) For normal distribution, mean = mode = median. 
(vii) It is possible to reduce every normal distribution to’ the standard 
normal distribution by a transformation. 
(viii) In uniform distribution, the percentile points are equi-spaced. 
(ix) Normal distribution is symmetrical only for some specified values 
of the mean and variance. 
(x) Normal distribution can be obtained as a limiting case of Poisson 
distribution with the parameter A — 9. 
VIII. Give the correct answer to each of the following : 
(i) The mean and variance of Normal distribution 
(a) are same, (b) cannot be same, (c) are sometimes equal, 
(d) are equal in the limiting case, as n — 9, 
(ii) The mean and variance of Gamma distribution 
(a) are same, (b) cannot be same, (c) are sometimes equal, 
(d) are equal in the limiting case, as n — ©. 
(iii) X is normally distributed with zero mean and unit variance. The 
variance of X? is 
(a) 0, (b) 1, (c) 2, (d) 4 
(tv) The points of inflexion of Normal curve are 


(a)mtzo0, (b)mt20 (c)mt30 (d)m+£o 


8-164 


~» 


Fundamentals of Mathematical Statistics 


(v) The moment generating function of gamma distribution is 
(a) (1+), (+) 1-0, (c) G-9"*, @ +9" 

(vi) The characteristic function of Cauchy distribution is 
ae’, well Me, @e" 

(vii) Area ic the right of the point x; is 0-6 and to the left of the point 
X2, is 0-7. Which is the correct: 
(1) xj >xX2, (i) x1 <x2 or (iii) x1 = x2? 

(viii) The standard normal distribution is represented by 
(a)N (0,0), (6) N(1,1), (c) N(@,1), @ NGI,0). 

(ix) Fora normal distribution, quartile deviation, mean deviation, standard 
deviation are in the ratio 


(a) 3:2: 1, (by 3:4: 1, (c) 12:4, (d) 2 1 
(x) The normal distribution is a limiting form of binomial distribution if 
(a)n > @, p+0, (b) n+0,p>q, (c)n> pn, d) n> 
and neither p norq is small. 
(xi) Normal curve is 


(a) very flat, (b) bell shaped symmetrical about mean, (c) very 
peaked, (d) smooth. 

(xit) The normal distribution is a limiting casé of Poisson’s when 
Ya A+0, WAG, (QA+%, Marco. 

(xiii) In a normal curve the number of observations Jess than mean are 
included in the range 
(a)x#30, (b)X¥+196, (c)x2#20, (d)x+0670 

(xiv) IfX isa standard normal variate, then 4X” is a 


es 


(a) Gamma variate with parameter>, (b) anormal variate, (€) a 
Poisson variate. 
(xv) The range of the beta variate is 
(a) (0,), (b)%,%), (€) (0,1), (@)(-1,+1) 
IX. Fill in the blanks :— 
(t) The mean deviation of normal distribution is... 
(ii) The p.d.f. of Gamma distribution is... 
(iii) The relationship between Beta distributions of first and second 
kind is... 
(tv) The nomiai distribution is a limiting form of binomial distribu- 
tion if... . 
(v) Mean= variance for ... distribution (continuous). 
(vi) For the normal distribution : 
) Bi=.. 
B2=.. 


: Mean deviation =... 
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Quartile deviation = ... 
(viz) The characteristic function of a Gamma distribution is ... 
(viit) The points of inflexion for a‘normal curve are ... 
(ix) For the normal distribution with varaince 0”, 
Hoy =... 
M2ar+1=.. 
(x) A normal distribution is completely specified by the parameters... 
(xi) For normal distribution 
S.D.:M.D.:Q.D.:: 
(xii) IfX~N(p, 0”) then 
P(pno<X<pUt+o)=... 
P(p-20<X<p+20)@... 
P(p-30<X<p+30)=... 
(xiii) If X is a random variable With distribution function F then 
F (X) has ... distribution. ‘ 
(xiv) If ¥~N (0, 1), then X°/2 has ... distribution with parameter ... 


X. Random variables X; are independent and all of them have the same 
distribution defined by 


{(@)= a= exp |- -(x- 1/8}, —-O<X<0 


Find the ieiuibaae 
10 


(i) = X;/10' and (i) X, —2X7 + X3 
iol 
2 
Ans. (1) N|1,= 5 } 
XI. The random variables X;,i= 1,2, ... are independent and all of them 
have the same distribution — by 


(ii) N (0, 24) 


eo IS. 


f=ae e —-O<xX¥<O®, : 
Find the seo of 
1 
(a) 5 5 Xp. 


ie] 


(b) 3X —X2 + 2X3. 


XII. Find the mean and standard deviation of a probability distribution 
whose frequency function is given by 


fx) = =Ce — (1/24) (x" = ~& +9) wex<m 
where C is a cosntant. (Delhi Univ. B.A. (Stat. Hons.), 1986] 


- 4 
Ans. Mean 3. 0° = 12, C = a= 
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XI. (a) Iff(x)= ke t© ww exree, 
obtain the Pas of k, w and o’. 
Ans. k=V(3/x) €°; p#=1, o =} 


(6) If X is anormal variate with mean p and variance o”, find the 
distribution of Y=aX+b. 


Ans. Y~N(ap+b, a’ 0”). 


(c) IfX is distributed normally with mean p and standard deviation o, 
write down the distribution of U=2X—3 and find the mean and 
variance of U. 


Ans. U~N (2-3, 40°) 


XIV. If.X, is normally distributed with a mean 10 and variance 16 and 
X» is nromally distributed with a mean 10 and variance 15 and if W =X, +X2, 
what will be the values of the two parameters of the distribution of the variate 
W ? (Assume that X; and X2 are independent). 
(c) LetX and Y be independent and normally distributed as N (11, o3) and 
N (112,03) . Find the mean and variance of ; (X + Y) 


XV. : Write a note on the role of Normal distribution in Statistics. 
XVI. IfX;, G= 1, 2, 3,...,”) are iid. standard Cauchy vatiates, Write the 
distribution of X g-1 5 2 AGS 


ie’ 
Ans. Standard Cauchy. 
XVII. Is the sum of two independent Cauchy variates a Cauchy varaite? 
If X;. (¢= 12,3, 4) are independent standard normal variates, what is the 
Xi X3 
distribution of X + X.. ? 
Ans. Cauchy. 
XVII. ‘‘The role of Cauchy distribution in statistical theory often lies in 
providing counter examples.’’ Elucidate. 
XIX. Write a note on the role of Central Limit Theorem in Statistics. 


XX. IfX;, (= 1, 2,3, 4) are t..d.N 1), write the distribution of: 


(i) X, -X2 (u) Xi + X2 (iii) “ YG > (iy x — ' 
4 
(v) X-X2+X3—Xq (vi) Ly (vii) Lage 
ree < X3 xt 
(viit) moa (ix) z (x) Yen 


Ans. (i) N (0,2); (i) N (0,2); (ut) Standard Cauchy ; - (iv): ae 
(v).N (0,4); (vi) ¥G); (vit)y(1); (vidi) Bi GD); @) Be Gp); (x) Bz G, 1) 


4 


CHAPTER NINE 
Curve Fitting and Principle of 
Least Squares 


9-1. Curve Fitting. Let (x, y)3!=1,2,...,2 be a given sct of n pairs 
of values, X being independent vanable and Y the dependent variable. The 
general problem in curve fitting is to find, if possible, an analytic expression 
of the form y =f (x), for the functional rclationship.suggested by the given data. 

Fitting of curves to a set of numerical data is of considerable importance- 
theoretical as well as practical. Theoretically it is useful in the study of correla- 
tion and regression, e.g., lines of regression can be regarded as fitting of linear 
curves to the given bivariate distribution (c.f. § 10-8-1). In practical statistics it 
enables us to represent the relationship between two variables by simple al- 
gebraic expressions, é.g., polynomials, exponenual or logarithmic functions. 
Moreover, it may be used to estimate the values Of one variable which ‘would 
correspond to the specified values of the other variable. 

9-1-1. Fitting of a straight line. Let us consider the fitung of a straight 
line 

Y=a+bx (9-1) 
to a set of n points (mu, y;); i= 1, 2, ..., 2. Equation (9-1) represents a family of 
straight lines for different values of the arbitrary constants ‘a’ and ‘b’. The problem 
is to determine ‘a’ and ‘b’ so that the line (9:1) is the line of ” best fit”. 

The term ‘best fit’ is interpreted in accordance with Legender’s principle 
of least squares which consists in minimising the sum of the squares of the 
deviations of the actual values of y 
from their estimated values as given 
‘by the line of best fit. 

Let Pi (xi, yi) be any general 
point in the scatter diagram (§ 10-2). 
Draw P; M L to x-axis meeting the 
line, (9-1) in Hj. Abscissa of H; is 
x; and since H;, lies on (9-1), its. or- 
dinate is a+ bxi. Hence the ‘co-or- 
dinates of H; are (%, a + bxi). 

P; Hi=PiM-HiM 
= yj — (a + bx), 
is called the error of estimate .or the 
residual for y;. 
According to the principle of, 
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least squares, we have to determine a and 5 so that 
2 OA 2 n 2 
E= PH = (y,-a-bx) 
i= t= 


is minimum. From the principle of maxima and minima, the partial derivatives of 
E, with respect to (w.r.t.) a and 6 should vanish separately, i.e., 


a OE : 

om =0=-2 2 (y; -a-bx;) and — 0=-2 be x; (yj — a —bx;) ---(9°2) 

=> 2 y,=na+b > x; and z xy, =a Ex, +b x? ..(9-2a) 
j=] j=]. is i= i=) 


Equations (9:2) and (9-2a) are: known as the normal equations for estimating 
a and 3b. 


n n n n 
All the quantities = x, © x7, Ly, and = x;y;, can be obtained from 
i= i= i= 


i=l 
the given set of points (x, y,); i = 1, 2, ..., 2 and-the equations (9-2a) can be 
solved for a and b. With the values of a and b so obtained, equation (9:1) is 
the line of best fit to the given set of points (x, y,); i = 1, 2, ..., 7. 

Remark. The equation of the line of best, fit of y on x ts obtained on 
Climinating a and 6 in (9-1) and (9-2a) and can be expressed in the determinant 
form as follows: 


Y > ae | 


rs a (92) 
Uxy LX, Lx; 


9-1-2. Fitting of second degree parabola. Let 
Y=at+bX + cX? (93) 
be the second degree parabola of best fit to set of m points (x, y,); i = 1, 2, ... 
n. Using the principle of least squares, we have to determine the constants 4, 
6 and c so that 


n 4.9 
E= & (vy; —a—bx; — cx; ) 

ty . ° ‘= 

is nunimum. 


Equating to zero the partial derivatives of E with respect to a, 6 and ¢ 
separately, we get the normal equations for estimating a, 6 and c as 


OE =Q=-—2 LU; —a-—bx, —cx?) 
Oa 

OE Q=-—2 Xx; -a=-bx; ~cx?) 
Fi) (94) 


—=0=-2 E x? (y, -a—bx, cx?) 
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Dy,enatbEx, +ceLx? 


= Ux Y= aLxFbL x7 +0) ~ ...(9-4a) 
Laxey =alxretb Exec d xA, 
summation taken over i from | to 7. pote 


For given set of points (x, y,); i= 1, 2, ..., 1, equations (9-4a) can be solved 
for a, 5 and c, and with these values of a, 5 and c, (9-3) is the parabola of best 
fit. 

Remark.. Eliminating a, 6 and c in (9-3) and (9-4a), the parabola of best fit 
of y on x is given by 

y x X ] 
Ly, Ex xy on -, 
Ixy; Cx? Lxp Ux, (9°46) 


Lx; y, =x ox Dx | 


9-1-3, Fitting of Polynomial of Ath Degree. If 
Y=a,+a,X+a,X?+..+a," ...(9°5) 
is the &” degree polynomial of best fit to the set of points I) Teds 
n, the constants ay, a). 4, ..., a are to be obtained so that 


= | 2 ky2 
E> 2 Yj ~ 4g — Xj — 4g Xj — 1 Oy XY” 
is minimum. Thus the normal: equations for estimating ay @,, .... a, are obtained 


on equating to zero the partial derivatives of E W.r.t. Ag, 4), ..., a, Separately, 
Le., 


OE 2 k 
Oa 
OF 2 k 
anon te L xX; (Vj — 9 — 4 Xj — Ay Xj — 1. — AyX; ) 
1 ...(9°6) 
OE k 2 k 
Oa, 
=> 
‘ 4 a 
Ly, = nay +a, Xv; tay Ux7 ++++ + ak ¥ xt 
mee 3 ~ k+l 
LU X;Y; = ay UX; +4, Uz +a, Up +++ + ak Ux; (96a) 


4 ” 
Ex y, =ay xf +a, i +a, Se Sr Hok Dee 


> 


Summation extended over / from 1 to n. These are (k + 1) equations in (A + 1) 
unknowns ay, 4), 45, ..., 4, and can be solved with the help of algebra. 
Remark. It has been found that in all the above cases, the values of the 
CE OE 
Baz’ da? 


eee 


second order derivatives, viz., come out to be positive at the 
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points do, 4;, ..., ay, the solutions of the ‘normal equations’. Hence they 
provide minima of E. For proof see Remark | to § 10.7.1-Lines of Regression. 
Example 9-1. Fit a straight line to the following data. 


X: l 2 3 4 6 8 
ae 2-4 3 3-6 4 5 6 
Solution. Let the line be Y=a + bX 
X Y xX? ale XY 
] 2-4 l 2-4 
2 3-0 4 6:0 
3 3-6 9 10-8 
4 4.0 16 16-0 
6 5:0 36 30-0 
8 6-0 64 48-0 
Total 24 24 130 113-2 


Using normal equations (9.2a), we get 
24 = 6a + 246 and 113-2 = 24a + 1306 
Solving these equations, we get a = 1-976 and b = 0-506. 
Example 9-2. Fit a parabola of second degree to the following data : 
Ae 0 1 2 3 4 


y | 1-8 1:3 2:5 6-3 
(Delhi Univ. B.Sc., Oct. 1992) 
Solution. Let Y= a+ bX + cX* be the second degree parabola. 


Using normal equations (9-4), we get 
12-9 = 5a + 106 + 30c ; 37-1 = 10a + 305 + 100c ; 
130-3 = 30a + 1006 + 354c 


Solving these equations, we get a = 1-42, b = — 1-07 and c 0-55. Thus the 
required equation of the second degree parabola is 
Y =1-42-1-07X + 0-55X? 

Remark. If the values which X and Y take are large, the calculation of 


Ex, LX x2, Lx y, ..., becomes qtite tedious and the solution of the normal 
equations, is also quite cumbersome. In this case arithmetic is reduced to a great 
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extent by suitable change of origin in X or (and) in Y. 

9-1-4. Change of origin. Let us suppose thal the values of X are given 
to be cquidistant at an interval of A, i.e., X takes the valucs, (say), a,a+h, 
at 2h,... In is odd, i.e., n= 2m+ 1 (say), we take 

X -—(middictcrm) X—-(a+mh) 
C= SS ES Se 
Interval h 

Now U takcs the values — m,- (m- 1),...,- 1,0, 1,...,(m—- 1), m, so that 
LrU=LU 3=0,. 

If n is even, 1.e., 2 = 2m (say), then there are two middle terms, viz., mth 
and (m+ 1)th terms which are a+(m~ 1)h and a+mh. In this case,we take 


1 
y = X= (mean of two middle terms) _*~ 14 +3 @m~ DA 


+ (interval) - 1 (h) 
_ 2X =2a- (2m - lA 
° h (9-7) 


Now for X=a, ath,...,at+t(2m-—1)h; U takes the values 
—(2m-1), -(2m-3),...,.-3,-1, 1, 3,....,(2m-3),(@m-1). 

Again we sée that [U = £U>=0. 

Example 9-3. The weights of a calf taken at weekly intervals are given 


below. Fit a straight line using the method of least squares and calculate the 
average ratc of growth,per week. 


Age(X) : 1 2 3 4 § 6 7 8 9 10 
Weight(Y): 52-5 $8:7 65:0 70:2 75-4 81-1 87-2 95-5 102:2 108-4 
Solution. Let the variables age and weight be denoted by X and Y 
respectively. 
Here n= 10, é.e., even and the values of X are equidistant at an interval 
of unity, i.e., A= 1. Thus we take 


y=X- ts 6/2} oy 
2 


Let the least-square line of Y on U be Y=a+ BU. 
The normal equations for estimating a and 6 are 
LY =na+bQ and LUY=akU + b£LU? 


X Y U u? UY 
52:5 -9 81 ~ 472.5 
2 58:7 ay 49 ~410-9 
3 65:0 -5 25 ~ 325-0 
4 70-2 23 9 -210-6 
5 75-4 ~] 1 ~ 75-4 
6 81-1 1 1 81-1 
7 87-2 3 9 261-6 
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8 95.5 5 25 477-5 

9 102.2 7 49 715-4 

10 108-4 9 81 975-6 

Total 796-2 0 330 1016-8 


Thus the normal equations are 
796-2=10a+Oxb and 10168=ax0+ 3306, 


which give a=79-62 and es = 3-08 ( approx ). 
The least square linc of Y on U is 


; Y = 79-62 + 3-08U 
Hence the line of best fit of Y on X is 
Y= 79-62 +3-08(2X-11) = Y=45-74+6-16X 
The weights of the calf (as given by the line of best fit Y= A+ BX ) after 
1, 2, 3, ... weeks are A+ B,A+2B,A+ 3B,..., respectively. Hence the average 
rate of growth per week is B units, i.e., 6-16 units. 


EXERCISE 9 (a) 


l. (a) (%, vi); 2=1,2,..., 2, give the co-ordinates of n points in-a plane. 
lt is proposed to fit a straight line Y = aX + 6 to.those points such that the sum 
-of the squares of the perpendiculars from those n points to the line is a mini- 
mum. Find the constants a and b. Use the above method to fit a straight line 
to the following points : 

Ks 0 1 2 3 4 
Ye l 1-8 3-3 4:5 6-3 
Ans. Y = 0-72 + 1-33X 
(b) Fit a straight line of the form Y= AX +B to. the following data: 


> 0 5 10 15 20 25 30 
Y: 10 14 19 25 31 36 39 
2. Show that the line of best fit to the following data is given by 
=—0:5X +8 


xX: 6 7 7 8 8 8 9 9 10 
Y: 5 5 4 5 4 3 4 3 3 


3. (a) How do you define the term “line of best fit". Give the normal 
equations gencrally used to obtain such a line. Fit a straight line and parabolic 
curve to the following data :X :1-01-52-02-53-03-54-0 

ee 1-1 1-3 1-6 2-6 2-7 3-4 4-1 
Ans, Y= 1-04 - 0-20X + 0-24X’ 
(b) Fit a straight line to the following data. Plot the observed and the ex- 
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(hb) Fit.a straight line to the following data. Plot the observed and the ex- 
pected values ina graph and examine whether the straight line gives an adequate 
fit 

5 ae | Z 3 4 5 6 7 8 
y... 55 465 40 38 33 30 29 30 


wv 


4, An experiment ts os to ven the law of falling under gravity 
expressed by =t x gt? 


where S is the distance fallen at time ¢ and g is a gravitational constant. The 
following results arc oblaincd : 


¢( seconds ): ] 2 3 4 5 
S(fect)  : 15 70 140° 250 380 
Taking § as the depenent variable, fit a straight line to the data by the 
method of Icast squares in a manner that you can cstimate g. What is the estimate 
of g? 
5. (a) Explain the method of fitting a second degree parabola by using the 
principle of least squares. 
(b) Fit a parabola Y = a + bx + cx” to the following data : 


X; 1 2 3 4 5 6 7 
Y: 2:3 5:2 9-7 165 29-4 35:5 54-4 

6. Fit a sccond degree parabola to the following data taking X as the 
independent variable : 

X* | Z 3 4 5 6 7 8 9 
Y: 2 6 7 8 1) vu Il ll 10 9 

Ans. Y=-143-55X-0-27X* 

7. In a spectroscopic method for determining the per cent X of natural 
rubber-contcnt of vulcanizates, the variable Y used is 1 + logior, where r is the 
ratio of transmission at two sclected wavelengths. In order to establish a relation- 
ship between X and Y, the following data were obtaincd : 

X: 0 20 40 60 80 100 
Y: 2-19 2-65 3-16 3-57 3-93 4-27 
Using Icast square method, fit a parabola. pone on your resulis. 

8. Fit a second degree curve Y = a + bX + cX” to the following data relat- 

ing to profit of a certain company.. 


Year : 1980 1982 1984 1986 1988 
Profit in lakhs ofrupees: 125 140 165 195 230 
Estimate the profit in the year 1995, 

Ans. Y=1144-7-2X +3-15X?* 


9. Explain the method of Icast squares of fitting a curve to the given mass 
of data : 


Xs. —2 -| 0 ] 2 
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Y; yi y2 y3 4 ys 
Fit a parabola Y = a+ bX + ¢ (X* — 2), by the method of least squares and 
show that 


= l I 
a=y, baa 2y1 — yo + ya + 2ys), C= 44 (2y1 — ya + ya + 2ys) 


10. Show that the best fitting linear function for the points (x1, y:), 
(X2, ¥2), +s (Xas Ya) = May be expressed in the form 
x y I 
9 L Xi Ly; n = 0, (=1,2,..,7). 
Ex? Luy Lx 

Show that the line passes through the mean point (x, y). 

9-2. Most Plausible Solution of a System of Linear Equations. 
Method of least squares is helpful in finding the most plausible values of the 
variables satisfying a system of independent linear equations whose number is 
more than the number of variables under study. Consider the following set of 
m equations in n variables X, Y, Z, ..., T: 

ayX + bY + caZt eee ki T= l, 
a2X + boY + 22+... +hkT= b (98) 
AmX + bmY + CmZ +... + kmT = lm 
where aj, bj, ..., l;; i=-1,2,...,m are constants. 

If m= an, the system of equations (9-8) canbe solved uniquely with the help 
of algebra. If m>n, it is not posible to determine a unique solution X, Y, Z, ..., 
T which will satisfy the system (9-8). In this case we find the values of X, Y, 
Z, ..., T which will satisfy the system (9-8) as nearly as possible. 

Legender’s principle of least squares Consists in minimising the -sumi of the 
squares of the ‘residuals’ or the ‘errors’. If 

E;=aX+b0Y+cZ2+...4+ &T-—Uj; t=1,2,..,m 
is the residual for the ith equation, then we have to determine X, Y, Z, ..., F so that 


m m 
U= LEP= L GiXt+b¥+eZ+...+kT- ly : 
t=] i=] 
is minimum. 
Using the principle of maxima and minima in differential calculus, the par- 
tial derivatives of ‘U’ w.r.t. X, Y, Z, ..., T should vanish separately. 
Thus 
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aU mn : Pe 

—=()= FF a(axthY+aZ2t+...+kT—- lh) 

aX i=1 

aU . Tl) 

ay 70 LY b(aXt+bhY¥+oZ+...+kl - ly (9-9) 
i=} 

aU : aT 

a7 797 L ki(aX+bY¥+ciZ+...+ kT — li) 


i=l 

These are known as the normal equations for X, Y, Z, ..., T respectively. 
Thus we have n — normal equations in n unknowns X, Y, Z, ..., T and their 
unique solution gives the best or the most plausible solution of the system (9-8). 

Here we see that the normal equation for any variable ts obtained by mul- 
tiplying each equation by the coefficient of the variable in that equation and 
then adding all the resulting equations. 

Example 9-4, Find the most plausible values of X and Y from the 
following equations : 

X-5Y+4=0, 2X -3¥+5=0 
X+2Y-3=0, 4X +3Y+1=0 

Solution. Normal equation for X 1s 

1.(X —SY¥+4)4+2 (2X - 3V¥ 45)41.(X¥ + 2Y¥-3)4+4/4X +3Y4+1)=0 

=> 22X +3Y¥+15=0 A *) 

Normal equation for Y is 

~5(X -SY + 4)-—3 (2K -3Y¥ +5) +2 (X + 2Y —3)+3 (4X +3Y 4+ 1)=0 

=> _ 3X+47Y-38=0 .a(**) 

Solving (*) and (#**), we get X =— 0-799 and Y=0-86. 

Hence the most plausible values of X and Y are X = - 0-80 (approx.) and 
Y = 0-86 (approx.) 


EXERCISE 9 (b) 

1, Find the most plausible values of X and Y from the following equations: 
(i) X+Y=3-01, 2X — Y=0-03, 

xX + 3Y= 7-03, 3X + Y= 4-97. 
Ans. X= 1-0003, Y=2-0007. 
(ii) X+Y=3, XaY=2. 

X+2Y=4=0, X= 2Y+1. 
2. Find the most plausible values of X, Y and Z from the following 

equations : 


X-Y¥+2Z=3, 3X +2Y-SZ=5, 4X+Y+4Z=21 and -X + 3Y + 3Z= 14 

Ans. X=2-47, Y= 3-55, Z= 1-92. 

9-3. Conversion of Data to Linear Form. Sometimes it may happen that 
the original data is not in a lifi¢ar form but can be reduced to linear form by 
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some simple transformation of variables. We will illustrate this by considering 
the following curves : 
(a) Fitting of a Power Curve. Y = aX® (9 10) 


to a Set of n points. 
Taking logarithm of both sides, we get 
log Y=loga+ blogx 
— U=A+bvV 
where U=log Y, A=loga and V=logX. 
This is a linear equation in V and U. 
Normal equations for estimating A and B are 
LU =nA+b2ZV and LUV=ALV + bxIV? (9: 10a) 
These equations can be solved for A and b and consequently, we get 
a=antilog (A) : 
With the values of a and b-so obtained, (9-10) is the curve of best fit to 
the set of n points. 
(b) Fitting of Exponential Curves. (i) Y = ab* , (ii) Y = ae™ 
to a set of n points. 
(i) Y =ab™ 9 11) 
Taking logarithm of both sides, we get ; 
log Y= log a+X log b 
=> U=A+BxX = 
where U=log Y, A=loga and B= log b. 
This is linear equation in X and U. 
The normal equations for estimating A and B are 
ZU = nA + BEX and XXU = AXX + BIX? (91a) 
Solving these equations for A and B, we finally get 
a =antilog (A) and B = antilog (B) 


With these values of a and b, (9-11) is the curve of best fit to the given 
set of n points. 


(ii) Y =ae™ (9-12) 
log Y = log a + bX log e= log a+ (6 log e) X 
=> U=A+BX 


where U=log Y, A=loga and B=b loge. 
This is linear equation in X and U. 
Thus the normal equations arc 
LU = nA + BEX and LXXU = ALX + BEX? (9-124) 
From thesc we find A and B and consequently © . 
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a = antlog (A) and h= 
oe loge 


Example 9-5, Fit an exponential curve of the form Y=ab* to the 
following data : 


Xx: I -2 3 4 5 6 7 8 
Y: 1-0 1-2 1-8 2-5 3-6 4.7 6-6 9-1 
Solution. 
X Y U = logY XU x2 
———T To 0:0000—“(i«‘ SHGSs“‘i” 
2 }-2 0-0792 0- 1584 4 
3 1-8 0-2553 0-7659 9 
4 2:5 0-3979 1-5916 16 
5 3-6 0-556 2:7815 25 
6 4-7 0-6721 4-0326 36 
7 6-6 0-8195 5:7365 49 
8 9-] 0-9590 7-6720 64 
Otal 36 ()-5 3-7 eye 


(¥-1 1a) gives the normal equations as 
3-7393 = 8A + 36B 
and 22-7385 = 36A + 204B 
Solving, we get = 
B = 0-1408 and A =-0-1662 = 1:8338 
“, b= Antilog B = 1-383 and a= Antilog A = 0-6821 
Hence the equation of the required curve is 
Y = 0-6821 1-38)" 
Example 9-6. Derive the least square equations for fitting a curve of the 
type Y= aX + (b/X), toa Set of n points (xi, yj); i= 1,2,..., 2. 
Solution. The error of estimate E, for the ith point (x, yj) is given by 
b 
E=[ ae | . 
According to the principle of least squares, we have to determine the values of a 
and 5 so that sum of the squares of errors E, viz., 


n n b 2 
ES > E? = »y (x. ax-< | 


i=] i=] ' 


iS Minimum. 
Consequently, the normal equations are 
dE - b 
<==) =]=9 ‘| y;-axy—-— 
da : : ! ° ( ae 
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which on pines seare give 
n 


L xXUYi=a 5 x + nb 


i= | i=] 
and E 2 | ana b E S| 


Example 9-7, Three independent measurements on each of the 
three angles A, B, C of a triangle are as follows : 


A B C 
39-5 60-3 80:1 
39-3 62-2 80:3 
39-6 60:1 80-4 


Obtain the. best estimates of the three angles taking into account the relation that 
the sum ofthe angles is equal to 180°. 

Solution. Let the three observations on A be denoted by x1, x2, x3, on B 
by yi, ¥2, y3 and on C by 21, 22, z3. Let @;, 82 be the best estimates for A and 
B respectively. 

According to the principle of least squares, our problem is to estimate 
8; and 62, so that 

E = ¥ (x; — 01)? + £ (yi— 2)? + E (2; — 180 + 0) + 2)” 
is Minimum, summation being taken over i from 1 to 3. 

Equating to zero the partial derivatives of E w.r.t. 0; and 92, thé. normal 

equations are 


oe =02-E(n- 01) + L (z;-— 180 + 8; + 82) 4 

oe 0=- 2 (yi — 82) + Z (z;- 180 + 8; + 82) 

00> = i 2 Z; l 2 8) 
From (*) and (#*), we get 

30; —Lxi+ Dz -— 540+ 30; + 302 =0 

302 - Lyi t+ Lz; — 540 + 38) + 302. =0 .(*#**) 


But L x, = 39-5 + 39-3 + 39-6 = 118-4 
L yi = 60-3 + 62-2 + 60-1 = 182-6 
2 z; = 80-1 + 80-3 + 80-4 = 240-8 
Substituung in (#**), we get 
60; + 30> — 417-6=0 and 30, + 60> — 481-8 =0 


A = 6) = 39-27, B = 6 = 60-66 and = 180 — 6; - 82 = 80-07 
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9-4, Selection of Type of Curve to be Fitted. The greatest limitation of the 
method of curve fitting by the principle of least squares is the choice of the 
mathematical curve to be fitted to the given data. The choice of a particular 
curve for describing the given data requires great skill, intelligence and expertise. 
The graph of the given data enables us to have a fairly good idea about the 

pe of the curve to be fitted. The graph will clearly reveal if the trend is linear 
(straight line) or curvilinear (non-linear). [If the graph exhibits a curvilinear trend 
shen further approximations to the type of trend curve can be obtained on 
plotting the data on a semi-logarithmic scale. A careful study of the graph 
obtained on plotting the data on an arithmetic or semi-logarithmic scale often 
provides adequate basis for selecting the type of the curve. The various types 
of curves that may be used to describe the’ given data in practise are: [If v,. is 
the value of the dependent variable corresponding to the value x of the 
independent variable| 


(i) A straight line: Vy = at bx 
(ii) Second degree parabola: y,= a+ bx + cx? 
(iii) kth degree polynomial: y,= ay ta, x ta, pe aeaneat a, xk 
(iv) Exponential curve: y, = ab* 

= log v, = log a + x log b = A + Bx, (Say). 
(iv) Second degree curve fitted to logarithms: 

yy = ab*® cx? 
= log y, = log a + x log b = x? log c 


= 4+ Bx + Cx’, (say). 
(vi) Growth Curves: 


(a) y, = a+b (Modified Exponential Curve) 
(b) vy, = abc* (Gompertz Curve) 
=> log v, = log a +c’. log-b =A + Be", (say) 
i 
k 
= ———>———-.5 < 0 (Logistic Curve 
©) My 1 +exp (a + hx) (08 ) 


For decideing about the type of curve to be fitted to.a given set of data, 
the following points may be helpful: 

(’) When the v, series is found to be increasing by equal absolute amounts, 
the straight linc curve is used. In this case, the graph of the data will give a 
straight line graph. 

(ii) The logarithmic straight line (exponential curve y,. = ab*) is used when 
the serics is increasing or decreasing by a constant percentage rather than a 
constant absolute amount. In this case, the data plotted-on a semi-logarithmic 
scale will give a straight line graph. 

(iii) Second degree curve fitted to-logarithms may be tried if the-data plotted 
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on a semi-logarithmic scale is not a straight line graph but shows curvatuie, being 
concave.either upward or downward. 
For further guidelines, the following statistical tests based on the calculus 
of finite differences [c.f Chapter 17] may be applied. 
We know that for a polynomial y, of nth degree in x, 
A’ So constant, r=” 
=0 \ ,r>n 
where A is the difference operator given by A yz= yx+h— yx, A being the interval 


of differencing and A‘ y, is the rth order difference of yx. 
I. IfAy,= constant, use straight line curve. 


2. If A? yx = constant, use 4 second degree (parabolic) curve. 
3. IfA (log yx) = Constant, use exponential curve. 


4. If A? (log y,) = constant, use second degree curve fitted to logarithms. 
5. IfA yx tends to decrease by a constant percentage, use modified ex- 
ponential curve. 
6. If A yx resembles a skewed frequency curve, use a Gompertz curve or 
Logistic curve. 
7. The growth curves, viz., modified exponential, Gompertz and Logistic 
curves, Can be approximated by the constancy of the ratios 
A yx A log yz A (1/yx) 
A yx~1 : A log yx-1 A (1/yx-1) : 
respectively for all possible values of x. 
EXERCISE 9 (c) 
1, Describe the method of fitting the following curves : 
(i) Y=ae™, (ii) Y= aX? 
2. (a) Fit an equation of the form Y= ab” to the following data : 
oN OX: 2 3 4 5 6 
Y: 144 172-8 207-4 2488 298-6 
Ans. Y = (101-3) (1-196) 
(b) Fit a curve of the type Y= ab™ to the following data : 
x 3 4 55 6 
Y: 8-3 18-4 33-1 65:2 127-4 
Esumate Y when X = 4-5, 7 and 3-5. 


3. Fit a curve of the' form Y = bé* to the following data : 


Year (X):. 1951 1952 1953 1954 1955 1956 1957 
Production 
in tens (Y): 201 «-263—C'—s314. 395 427 S504 «ss «612 


4. In an experiment in which the growth of duck weed under certain con- 
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ditions was measured, the following results were obtained : 


weeks (X)... Oo 1 2 3 4 2 6 7 8 
No. of friends (Y)... 20 30 52 77 135 211 326 550 1052 


Assuming the relationship of the form Y = ae, find the best values of a 
and 6 by the method of least squares. 


5. For the data given below, find the equation to the best fitting exponential 


curve of the form Y= ae. 


xX: 1 2 3 4 > 6 
y: 1-6 4-5 13-8 40-2 125-0 300-0 
Ans. Y =(0-557) er 
6. Fit the curve Y = aX* + (b/X) to the following data : 
X: 1 2 3 4 
) —1-51 0-99 3-88 7-66 
7. The following table gives corresponding values of two variables 
X and Y. 


xX: 1 2 3 4 5 
Y: 1-8 5-1 8-9 14-1 19-8 
It is found that they are connected by a law of the form Y = aX + bX”, where a aid 
b are constants. Find the best values of a and b by the method of least squares. 
Calculate the value of Y for X = 2. 

Ans. a= 1-521; b=0-49 ; 5-006 

8. The following pairs of observations were noted in experimental work on 
cosmic rays. Find, by the method of least squares, the best values of @ and b 
for the equation logR=a-—bC which fits the data and estimate the most 
probable value of R for C = 20-7. 
Cc: 14 15 16 17 18 
R; 24-1 20-5 14-0 73 5-0 

9. (a) Explain the principle of least squares and describe its applications. 
in fitting a curve of the form Y =a exp (6X + cX’). 

(b) Fit an indifference curve of the type XY=5+ aX to the data given 
below: ‘ 
Consumption of Commodity X : l 2 3 4 

Consumption of Commodity Y : 3 1-5 «6 7-5 
Hint. y = a + (6/x). Now proceed as’in Example 9-6. 
Ans, XY=1-3X+1-7 
10. (a) Show that the parabola of best fit for the points 
(x1, y1) 3 (X2, y2) 3... > (X2n+1, Y2n+1) 


where the values of x are in A.P. with common difference unity and x = 0, can be 
expressed in the form 
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2 
x 


x 1 
2, satGert) ; ae 
4 i 0 n(n+1)(2n+1) 0 =0 
, > > 3 n(n+1)(2n+1) : 
L Xi Yi n° (n+1) n 3 
2 


(Delhi Univ. B.A. (Pass), 1984] 
Hint. Use (9-45), with 
=a+i; i=1,2,...,(2n+1). Since x=0, 
Sx=(2n+Da+Li = 0= n+ tar At UCn* 
=> a=-—(n+1) 
Lx72=L (ati =(2nt l)a’+ Li7+2%Li 
and so on, for Ex; and Dx; , 
2 
¥ ta ment NQm+ 1). 5 2 =| mnt) | 
, 2 
=] 


i=] i 


i=l 
(6) When do we prefer logarithmic curve to ordinary curve ? 


and 5 if = 35 (mn + 1)(2m + 1) + 3m- » 


9-5. Curve Fitting by Orthogonal Polynomials. Suppose that the 
pono of pth degree cf Y on X is 
Y=agt aX+arX7+...+aX? (9-13) 
The normal equations for ssieriining the constants a;’s are ‘obtained by 
the principle of least squares by minimising the residual or error sum of squares 
E=2(y~ag—ayx—a2x? -... - agx? P (9-14) 
summation being extended over the given set of observations. The normal equa- 
tions are : 


a. 
yn = = e009 ) 
aa 0, (j= 0,1, 2,....D 
i.e, Ex! (y~ao—ayx~a2x°—...-apx’)=0, [j=0, 1, 2, 0p) ...(9-15) 


Assume that X and Y are measured from their means (and this we can dc 
without any loss of generality) so that 


Mr = [ty = HE(X')=— 5 x’ 
and write, 


] °t 
Ha = Day; 
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where N is number of observations taken on cach of the variables X and Y. 
Hence (9-15) gives 


Hit ~ oH, — 41 y+ — G2 Wy+2—--. — Ap Mysp =O; f=0, 1,2... ei 
= GoW, + i Wj+1 t+ dz bs2+... + Gp Uysp=Mn ss =O, 1, 2... 
Putung s=9, 1, 2, ...,p, we get respectively 
GoWot+ afi +42M2 +... +apUp =Mo 
eve =a +. i ae ae tl ..A9-16) 


Aes aes se. Gs ies ane 
Solving (9-16) for ao, @:,...,ap in terms of the moments p,’s and p's, 
j=0,1,2,...,p and substituting in (9-13) we get the required curve of best fit. 
Let | ) 
[Uo Ly M2 =... Sp 
AY) = Mi Ha M3... Hp +) (9-17) 

: hip p+ Mp+2 +. Wap 
and A ©) be the determinant obtained on replacing (j+ 1)th column of A” by 
the column : 


boi 
Hi” AY) 

then aj = Am (9-18) 
[U1 


The required curve of best fit is the eliminant of a;’s in (9-13) iand (9-16) 
and is given: by 
yt Kk X wan X? 
Hor =o M1 M2 wwe Mp 
Mir py [2 M3. Mpa | = O 9-19) 


Mp) Up Up+) Up+2 + 2p 

The use of equation (9-19) is subject to one serious drawback. If ‘we have 
a set of data and apart from inspection if there is no guide regarding the order 
of the polynomial to be fitted, the only way left to us is to try curves of order 
1, 2, 3, ... until we reach the point where further terms do not improve the fit. 
Every time we add a new term, the a, ’s given by (9-18) change and accordingly 
the determinantal arithmetic has to be done afresh. For example, if we want to 
fit a polynomial curve of third or higher degree to the same data then we cannot 
use the coefficients which we computed while fitting a second degree parabola. 
To overcome this drawback Prof. R.A. Fisher suggested a method which 
involved the fitting of Orthogonal Polynomials by the principle of least squares, 
So that each term is independent of the other, f.e., each of the coefficients in 
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the polynomial ts independent of the other so that each of them can be calculated 
independently. In this method, the coefficients computed earlier remain the same 
and we have to compute the coefficient only for the added term. 
95-1. Orthogonal Polynomials (Def.). Two polynomials Pi(x) and 
P(x) are said to be orthogonal to each other if 
x P1(x) P2(x) = 0, (9-20) 
where summation is taken over a specified set of values of x. If x were a 
continuous variable in the range from a to 6, the condition for orthogonality gives 


b 

| Pics Pox) dx =0 

a ..(9-20a) 
For example, if we take 
Po= 1, P(x) =x-4, Pox) = x7 — 8x4 12, P(x) =x? - 12x7+ 41x - 36 


.-(9°20b) 
then these are orthogonal to each other for a set of integral values of x from 1 to 
7 as explained in the following table. Other examples of orthogonal polynomials 
are Hermite polynomials, Gram Charlier’s polynomials, Legender’s polynomials, 
etc. 


ORTHOGONALITY OF POLYNOMIALS DEFINED IN (9-205) 


x PoP; PoP.  PoPs P,P PiPs - PoP 
1 2 -6 -15 18 -30 
2 2 0 6 0 -12 0 
3 =| ~3 6 3 ~6 -18 
4 0 -4 0 0 0 0 
5 1 ~3 ~6 3 -6 18 
6 2 0 ~6 0 -12: 0 
7 3_ 5. 6 15. 18 30 
Total 0 0 ) U0 U U 


9-5-2. Fitting of Orthogonal Polynomials. The pth degree polynomial 
(9-13) can de rewritten as 
Y = boPo + biP) + b2 P2+...+ bp Pp (9-21) 
where P’s are polynomials in x,P; being a polynomial of degree 
j, G =9, 1,2, ..., p). We shall determine P ’s so that they satisfy the condition of 
orthogonality, viz., 
eo. ~ = * = ‘ is 
LP Pe. & Pix) Px) =O; ;j#k 02) 
the summation being extended over the observed values of x. The normal equations 
fOr estimating the constants 5; ’s are obtained on minimising 


E = (y— boPo— bP — ... — bp Pp)” (9-23) 
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and are given by ' 


=> L Pj (y — boPo- bP, - ... --bp Pp) =0:7=0, 1, 2, ..,p. 
Simlifying and using (9-22), we get 
UP;.y-bj UP? =0 

=> b= 22 jJ=0,1,2,...p. (9-24) 

xP; 

Thus 6; is determined by Pj. If having fitted a curve of order p we wish 
to go a Step further by adding a term bp +1 Pp +1, the coefficients already obtained 
in (9-24) remain unaltered. 

Moreover, the use of orthogonal polynomials will give us a very convenient 
method of determining, step by step, the goodness of fit of the polynomial 
curve. For pth degree polynomial (9-21), the error sum of squares is [c.f. (9-23)} 


E = (y— bo Po- bP} —... — bp Pp)” 
=Zy? +b LPG + by LPL +... + bp BPs 
—2 boLyPo - 2 biLyP1 —...-—2 bp XyPp, 


other terms vanish because of orthogonality conditions (9-22). Using (9-24) we 
finally obtain 
E = Zy*— bo’ © Po’ - by E Pi? ... — by” E Pp” (9-25) 
Thus the effect of adding any term 6; P; is to reduce the error (residual) 
sum of squares E by b? xP? and we may examine the effect of this term on 
E separately. If we find that the addition of any term b, Pp does not reduce E 
significantly, we may conclude that it is not desired (as far as the representation 
of the given data by a polynomial curve is concerned). 


9-5-3, Finding The Orthogonal Polynomial P,, in Fitting a Polynomial 
of Degree p. Let Pp, the polynomial of degree p in x be given by 
Pp : 
Pp= cae Cpj x! .-(9-26) 
j= 


This contains (p+ 1)-unknown constants Cpo, Cpl, «.-, Cpp- Hence in all the 
polynomials in (9-21) up to and including those of pth order, there are 


1+243+...¢(n+ I= 22 UO*? | 


unknown constants. The orthogonality conditions 
LPP;=0,i#j=0, 1,2,...,p, 


: +1] Bs 
provide ?*!C = ore conditions on the c’s so that there are 


(p+ l)p+2) _ (e+ Np _ 
i ile Aes 
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constants which can be assigned arbitrarily. We will take one for each polynomia) 
P, (j = 0, 1, 2, ..., p) and assign it such that the coefficient of » in P, is Unity 
i.e., 


Cy = 1,/= 0, 1, 2, ..., Dp, (9.27) 

In particular Cog Po = |. The orthogonality conditions give: 
= PLP =0,j<p=0,1,2,...,p- 1 --(9:28) 
j =0, gives 2 P, Py = 0 => 2XP,=0; (+ Py = 1) -~(*) 


j=0, gives P,P;=0 => EP,=0,@+h)=0 

— a P, xtked P, = 0 

=> > x * = : (#4) 
P 

=> DP, = (x7 + k,x + k) = 0 [Using (+) 

= x: a = 0 [Using (*) and (**)) 

Similarly proceeding, we shall get in general 


j= 2, gives 2 P, P= 0 


=p, =0, r=0,1,2,...,p-1 (9°29) 
=> x § Ca -* ae 0 
=> AC Ex!*" | = () 

j=0\ % x 


Dividing both sides by N, the number of observations on each of the 
variables X and Y, we get. 


P 
- Cy Hypa, = 0;r=0, 1, 2,.., @- 1) (930) 
where x is assumed to be measured from mean. Putting r = 0, 1, 2, .., (p- 1) 
in (9°30), we respectively 


tot seep 
a + ee +: a Pid Fe Fey yp 1 Upa * pp Up=0 
po HY ie Cp H+ ae eae +e ee 
C0 “Bp —1 * Spt Mp °F Spy Mes pa Fo + Sp p_ 1 Mop_2 F Spy Ma'p- 1 =0 
Noting that Cop = 1, solving the above equations for cs, we get 
Ho ‘ eee “Hp oe H p-1 


Hy Hea wee “Moar ee Hey 


(9°31) 
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where A” has been defined in (9 17) and A”), is the minor of the element 


in the last row and (j + 1)th column in A®. Substituting this valuc of cp, in (9-26), 
we gel ' 


p p - A”, 
j=0 jeo APY 
p 
=— rE AY x! 
AP ) JF 
i AM) 9+ x AP 1+... +x? A, 
Mo pi 2 bp 
1 Hi H2 H3~ UUp+) 
= P, = aon * : - i, 5 (9-32) 
A -1 Up Up+i W2p-1 
l x x? 


In particular if po = 1, 1 = 0 and po = 1, i.e. if x is a standardised variate 
then the orthogonal polynomials are given by 


Po=1 (9-33) 
el 
1 x 
Pi) = ————sx (9-330) 
Mo pi H2 
i p2 H3 
a, : 9.335) 
PXx)= =x°- -] (9: 
(x) va X°— [3x ( 
by b2. (*.° fo = 1, wy = 0, pe = 1) 
eee ae a i 
Pxx) =) 7! + | Mr be ps A9-33¢) 
Ho 3 Ha ps 
11 sx x? x? M2 Us py, 


and so on. 

If we further assume that x isa standard normal variate so that p13 = Us 
=... =2+1=0, then the above orthogonal polynomials are called Hermite 
Polynomials and are given by 

Po=15 Pix) =x; Pox) =x2- 13 Pax) = 22 3x Pax) = x4 — 6x7 43; 
and so on, where x is a continuous r.v. taking values from — © to ©, ...(9-34) 

Remark. Hermite Polynomials defined in (9-34) are orthogonal w.r.t. the 
weight function 
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Q(x) = ex ee * —o <X< 00 
Te | =}: 


ee [p Pi(x) P(x) ox) =0; i#j | 
a ..(9-35) 
where P(x), P2(x), Ps(x), Pa(x) are defined in (9-34). 
9-5-4, Determination of the Coefficients 57s in (9-21). From (9-24), we 
get 
= LyPp/Z P,? ...(9-36) 
Now EP, = P, Pp 
=Z Pp [Cpo t CpiX + Cp2X7 + ... + CopX? | 
x 


=> Ps.x", 
x 
on using (9-29) and the fact that cpp ='1. 
cepa & E cy [x= £ o[ #2?” | 
x j=0 x 


A®) : 


P 
=N Eco tpsi=N ET ues) —— (From(931) 
Mo fs ... Pp 
N 1 Mow. Mpa ; 


~Ae-) |” 
4 Mp1 Mp ss Map=1 


Mp = Hp+1 +++ Hap 
[Proceeding exactly as we obtained (9-32)] 
N A” 


~ Ae-D . (9°37) 
bag Po A Dj 
Similarly, Ly Pp=N roe ye-0 Hi 


N a . [2 eee [Up +1 


~ Aen) | 
Up-1 Hp ee H2p-1 
Hor = bln «s- Mp 
N.A 
~Ae= ...(9-38) 


where A® and A®); are‘defined in (9-17). Substituting in (9-36) we get 
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...(9-39) 
If the variable x takes the intcgral values 1, 2, ..., NV, then the first seven 
of these orthogonal polynomials P;’s j=0, 1, 2,3,...,6 are given by : 
Po(x) = 1, Pi(x) =A1.€ 


2 


P3(x) =A3) § 


‘ Bp eae | 
Pa(x) = Aa) & 565, (N? - 1)(N?- 9) 


Ps(x) = anaes urn 8 


ge _ 26 N?2— 31) &* +— 


(15N4 — 230N? + 407) 5 


Pana he (SN* — 110N? + 329) £2 


"i 
oa ae 2_ 2_ a | 
aag WN? 1YIN?— 9)(N? — 25) 
and so on, where = x—% so that  & = 0 and A;’s are arbitrary constants. 
If y=bo+ biP\(x) + b2 Po(x) +... +b, P,(x); is the orthogonal polynomial 
fitted to the given data then, using (9.24), we gel 


zyPo_Zy 
bo = = ; ( Po= 1) 

xPy ON 

Ly P. | (9-40) 
6; = 2 »@=1, 2, »P) 


The ongin of P;’s is so chosen that 2 P; = 0. 
If N, the number of observations is odd, then we take 


E= 3j-A 
h 
and if N is even thn we take 
_ xi Aj 
5-72) 
where * h= length of the :nterval (for values of x) 
A= middle value (item) of the data 
and Al= Anithmeuc mean of two middle values of the data. 


The values of P;’s and ,’s are obtained from. ‘Statistical Tables’ by 
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R.A. Fisher for the values of N from 3 to 75. In these tables the orthogona) 


polynomials P;’s are denoted by 6,’s. We reproduce below these tables for 
N=3toN=6. 


TABLES OF ORTHOGONAL POLYNOMIALS 


N=3 N=4 N=5 
1 2 } $2 $3 1 =f $3 4 
| l 3 1 -| —2 2 -| } 
-2 —} ~| -3 -} -] 2 4 
1 l l -l -3 0 2 0 6 
3 ] ] 1 -\ —2 -—4 
2 2 | l 
; 2 6 20 4 20 10 14 10 70 
2 4; 10 5 35 
ru l 3 2 1 3 l 1 6 22 
N=6 
91 p2 3 4 p5 
-5 5 -5 1 ~1 
-3 -] 7 -3 5 
—| -4 4 2 -10 
] —-4 -4 —2 10 
3 -1 ~7 -3 —5 
5 5 b) 1 ] 
r oF 70 84 180 28 252 
: 3 a 21 
Vv 2 2 3 12 10 


Example 9-8. Fit a straight-line y = a+ bx...(#) to the following data by 
using orthogonal polynomials. 


Solution. Here N=5. Let us transform to the variable 
p= 424 =x-2 sothat LE=0 


Let the orthogonal polynomial form of straight line (*) be 
y = bot biP1(x) = bo + b191(x) (##) 


curve bitting and Principle of Least Squares 925 


x G=x-2 y cy ¥ 1 
ae . = 2 2 
1 18 -1 18 
2 0 33 0 0 
3 1 45 1 4:5 
4 2 63 2 126 
Total 169 0133 


The values of o) are noted ‘from .the tables for N= 5. From tables we also 
find 


E617 = 10, A= 1 
ing (9- _»y¥_ 169 440. , 2 ym _ 133 _ |. 
Now using (9-40), bo N 5 3-38 3; b= or 10 7 1-33 


d(x =A F=1.(x-2)=x-2 
Substituung in (+#*), the required stratght line is 
y = 3-38 + 1-33 (x - 2) 
=> y = 1-33x + 0-72 
Example 9-9. Fit a second degree parabola to the following data, using 
the method of orthogonal polynomials. 


05 10 1:5 20 25 340 


y 72 110 158 214 290 380 


Solution. Let the second degree parabola be 


y=atbx+cx? .(*) 
and its orthogonal polynomial transform be : 
~y = bo + bi Oi (x) + be 2(x) A **) 
Here we have N=6. Let us transform to 
x-3(1-5+20) 
= = 4 (x - 1-75) = 4x - 7, 
3 (0:5) 


so that & ¢ = 0. From Fisher’s tables we note the values of @1 and » (as given in 
the following table) and also 
E17 = 70, E on? = 84 5 Ay =2, An = 3/2 


x §=4x-7 y ry b> y or y 2 
0-5 5 ! 72 5 5 -360 360 
1-0 as 110 2 -| -330 = -110 
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1-5 -1 158 -1 -4 -158.  - 632 
2-0 ! 214 i -4 214. - 856 
2-5 3-290 3 -1 870 290 
3-0 5 380 5 5 1900 1900 
Total 1224 2136-372 
_ Ey 1224 nay. ,, hy or _ 2136 _ 30, 
bom eg MM: bia Tp = Mag: = 305 


« 


pp 2? 2445s Qe dees 2dr TS ee 14 


2 
_ y eb 2 es 2_ 36-1 
x)= | 12 |-3[@ ary) | 


=3 | 16x? + 49 - 56x — 55 |= 24x? - 84x+ 69-125 


Substituting in (**), we get 
y = 204 + 30-51(8x ~ 14) + 4-43 (24x? — 84x + 69-125) 
= 106-32x7- 128-04x + 83-08 


which is the required second degree parabola of best fit. 


CHAPTER TEN 


Correlation and Regression 


* 


10-1. Bivariate Distribution, Correlation. So far we have con- 
fined ourselves to unvariate distributions, i.e., the distributions involving only 
one variable. We may, however, come across certain series where each term of 
the series may assume the values of two or more variables. For example, if we 
measure the heights and weights of a certain group of persons, we shall get what 
is known as Bivariate distribution—one variable relating to height and other 
variable relating to weight. 


In a bivariaté distribution we may be interested to find out if there is any 
correlation or covariation between the two variables under study. If the change in 
one variable affects a change in the other variable, the variables are said to be 
correlated. If the two variables deviate in. the same direction, i.e., if the increase 
(or decrease) in one results in a corresponding increase (or decrease) in the other, 
correlation is said to be direct or positive. But if they constantly deviate in the 
opposite directions, i.e., if increase (or decrease) in one results in corresponding 
decrease (or increase) in the other, correlation is said to be diverse or negative. 
For example, the correlation between (i) the heights and weights of a group of 
persons, (ii) the income and expenditure is positive and the correlation between 
(i) price and demand of a commodity, (ii) the volume and pressure of a perfect 
gas, is negative. Correlation is said to be perfect if the deviation in one variable 
is followed by a corresponding and proportional deviation in the other. 

10-2, Scatter Diagram. It is the simplest way of thé diagrammatic 
representation of bivariate data. Thus for the bivariate distribution (x,, y,); i= 1, 
2,...,, if the values of the variables X and Y b= plotted along the x-axis and 
y-axis respectively in the xy plane, the diagram of dots so obtained is known as 
scatter diagram. From the scatter diagram, we can form a fairly good, though 
vagne, idea whether the variables are correlated or not, é.g., if the points are 
very dense, i.e., very close to each other, we should expect a fairly good 
amount of correlation between the variables and if the points are widely 
scattered, a poor correlation is expected. This method, however, is not suitable if 
the number of observations is fairly large. 

10-3. Karl Pearson Coefficient of Correlation. As a measure of 
itensity or degree of linear relationship between two variables, Karl Pearson 
(1867—1936), a British Biometrician, developed a,formula called Correlation 
Coefficient. 

Correlation coefficient between two random variables X and Y, usually 
denoted by 7(X, Y) or simply ryy, is a numerical measure of linear relationship 
between them and is defined as 

HX, ¥) = =) (10-1) 


Oy Oy 
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If (x y;) ; i= 1,2, ..., n is the bivariate distribution, then 


Cov (X,Y) =EU{X- EX) ((Y-E(y)}} 
=* 2x1 -X) OF -Y) =H 


Gy? = E(X-BO)?=*5 (,- 3) |, 10.2) 
oy? =E(Y-B))*=" E05 - 5? 


the summation extending. over i from | to n. 


Another convenient form of the formula (10-2) for computational work is as 
follows : 


l os as A Bis - 
Cov (X,Y) =7 2 i -*) OF - Y= OETA —¥ it XY) 


Ly; +x y 


_ | _1 
> a 


a I Pe 
Cov (X, Y) = Ex yi-* y, GP = Eh xp-x? 


and oa Ey? -F? .. (10.2a) 


Remarks 1. Following are the figures of the standard data for r > 0, <0, 
=0, andr=t 1. 


oo 
(r>0) = (t'< 0) a (r =0) ‘ (r=+1) a (r=--1) : 


2. It may be noted that r (X, Y) provides a-measure of linear relationship 
between X and Y. For nonlinear relationship, however, it is not very suitable. 
3. Sometimes, we write : Cov (X, Y) = Oyy 
4, Karl Pearson’s correlation coefficient is also called product-moment 
correlation coefficient, since 
Cov (X, Y)-= E([{X ~ E(X)} (Y- E(Y))) = wa. 
10-3-1. Limits for Correlation Coefficient. We have 


I = 
r(X, ¥) = Cov (X,Y) _, Pi Cee #2 ey) 
Ox Gy ! 


i eats ~ ye 
ye Oi *) ree > (i - | 


fl 
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r(X, Y) = 2 + where _ ses) 
(a,P(Lb;? ) b; =y;-yp 


We have the Schwartz inequality which states that if a;, bj, i= 1,2,...,n 
are real quantities then 


(E abr <( % a?) E bP) 


the sign of equality holding’if and only if 


Using Schwartz inequality, we get from (*) 
r2(X, Y)s lice, Ir(X, Yi<1 => “1 <r(X, Y)<1 _...(10-3) 
Hence correlation coefficient cannot exceed unity numerically. It always 


lies between ~1 and +1. If r = +1, the cérrelation is perfect and positive and if 
r=-l, correlation is perfect and negative. 


Aliter. If we write E(X) = ly afid E(Y) = tly, then we have 


; : 2 
Ox Oy 
7 e(* - nyse =a ELK Hd (¥ - Hy), 5 
Ox Oy Ox Oy 
=> 1#1+27(X,Y20 
=> -1 <'7(X, Y) <1. 


Theorem 10:1. Correlation coeffi cient | is independent of change of-origin 
and scale. ; : 


+. - 


Proof. Let y =A vets , so thatX =a+hU andY=5+kV, 


where a, b, h, k are constants; h > 0, k > 0. " 
We shall-prove that r(x; Y) = r(U, V) -{ 
Since X = a +,hU and Y= 6 + kV, on taking expectations, we get 
EQ) =a+hE(U) and E(Y)=65+kKEV) oe 
X -E(X) =h{U-E(U)} and Y- E(Y) = kV E(V)] 
= Cov (X, Y) =EL{X -EQ)}{Y - E(y)}] ok 
= Efh{U-EWU)) {kK{V-E(V)}} 
= hk E[((U— E(U)}{V —- E(V))}] = kk Cov(U,V)_ ...(10-4) 
is = El{X - E(X)}"\ = Elh{U — E(U)}9) = Woy? 
=> =hoy,(h>0) . ...(10-4a) 
and of = E[{Y -E(¥)}4] = EV - E(V))2) = Ro? | 
=> = koy ,(k >0) .10-4b) 


® 
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Substituting from (10-4), (10-4a) and (10-45) in (10-1), we get 
HX, Y) = LOVELY) _ hk. Cov (U,V) _ Cov U,V) 


Oy Oy hk.oy Sy Oy Oy 


= r(U, V) 


This theorem is of fundamental importance in the numerical computation of 
the correlation coefficient. 


Corollary. /f X and Y are random variables and a, b, c, d are any numbers 
provided only that a #0, c #0, then 
ac 
r(aXx+b,cY+d)= las 
- Proof. With usual notations, we have 
Var (aX + b) = a’oy”; Var (cY + d) =c’oy’; 
Cov (aX + b, cY + d) = acoyy 


= Cov (aX +b,cY+d) . 
r(QX +b, cY¥ + d)=" A (ax +b) Var (cY + d))i2 


_ ac Oxy -  _ _ ac 
“talleloy oy lac r(x, Y) 
If ac > 0, i.e., if a and c are of same signs, then ac/lacl = +1 
If ac <0, i.e., if a and c are of opposite signs, then ac/lacl = -1. 


r(X, ¥) 


Theorem 10-2. Two independent variables are uncorrelated. 
Proof. If X and Y are independent variables, then 


Cov (X, Y) =0 (cf. $64) 
r(X, Y) aa =0 


Hence two independent, variables are uncorrelated. 


But the converse of the theorem is not true, i.e., two uncorrelated-variables 
may not be independent as the following example illustrates : 


Cov (X, Y) 
r(x, = = 
(1) Ox Oy : 
Thus in the above example, the variables X and Y are uncorrelated. But on 
careful examination we find that X and Y are not independent but they are 
connected by the rélation Y = X2. Hence two uncorrelated variables need not 


that r(V, Y) = 0, mercly implics the absence of any linear relationship between 
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the variables X and Y. There may, however, exist some other form of 
relationship between them, e.g., quadratic, cubic or trigonometric. 


Remarks. 1. Following are some more examples where two variables are 
uncorrelated but not independent. 


(i) X ~N(O, 1) and Y =X? 
Since X ~N (0, 1), EX) =0=E(X?) 
Cov -(X, Y) =E (XY) - E(X) E(Y) 


by 


= E(X*) — E(X) E(Y) =0 (.. Y =X) 
_ HX, Y) LUN 9 
Oy Oy 


Hence X and Y are uncorrelated-but not independent. 
(ii) Let X be ar.v. with p.d.f. 


fix)=},-1S5x <1 


and let Y = X2.. Here we shall get 
E(X) =0: and E(XY) = E(X3)=0, => r(X, Y) =0 
2. However, the converse of the theorem holds in the following cases : 
(a) If X and Y are jointly normally distributed with p = p (X, Y) = 0, then 
they are independent. If p = 0, then [c,f. § 10-10, Equation (10-25)] 


1, 5 Sy 1 - } all 
,y) = exp|- = oT <= exp | — = 

fix, y) Oy Von o| > ( Ox ) Oy V2n s 2 ( a 
fix ») =fi@) AO) : 

=> X and Y are independent. 

(b) If each of ‘the two variables X and Y takes. two values, 0, 1 with 
positive probabilities, then r (X,Y) = 0 => X and Y are independent. 

Proof. Let X take the values 1 and 0 with positive probabilities p, and q, 
respectively and let Y take the values 1 and 0 with positive probabilities p. and 
Gz respectively. Then 

r{XX%,Y) =0 >» Cov &%, Y=0 
=> 0 = E(XY) - E(X)E(Y) 
=1.P(X = 1QNY =/1) -[1. P(X) =1)x1.P(¥ =1)) 
= P(X = 10Y = 1) -pypr 


=> PX = 1NY=1)= pip. = P(X =1).PVY=1) 
=> X and Y are independent. 
10-3-2. Assumptions Underlying Karl Pearson’s Correlation 


Coefficient. Pearsonian correlation coefficient r is based on the following as- 
sumptions : 


(i) The variables X and Y under study are linearly related. In other words, 
the scatter diagram of the data will give a straight line curve. 
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(ii) Each of the variables (series) is being affected by a large number of 
independent contributory causes of such a nature as to prodice norma] 
distribution. For example, the variables (series) relating to ages, heights, 
weight, supply, price, etc., conform to this assumption. In the words of Kar] 
Pearson : 

“The sizes of the complex of organs (something measurable) are determined 
by a great variety of independent contributory causes, for example, climate, 
nourishment, physical training and innumerable other causes which cannot be 
individually observed or. their effects measured.” Karl Pearson further observes, 
“The variations in intensity of the contributory causes are small as compared 
with their absolute intensity and these variations follow the normal law of 
distribution.” 

(iii) The forces so operdting on each of the variable series are not 
independent of each other but are related in a causal fashion. In other word, cause 
and effect relationship exists between different forces operating on the items of 
the two variable series. These forces must be common to both the series. If the 
operating forces are entirely independent of each other and not related in any 
fashion, then there cannot be any correlation between the variables under study. 

For example, the correlation coefficient between, 

(a) the series of heights and incomes of individuals over a period of time, 

. . () the series of marriage rate and the rate of agricultural production in a 
country over a period of time, 

(c) the series reiating to the size of the shoe and intelligence of a group of 
individuals, ‘ 
should be zero, since the forces affecting the two variable series in each of the 
above Cases are entirely independent of each other. 


However, if in any of the above cases the value of r for a given set of data is not 
zero, then such correlation is termed as chance correlation of spurious or non- 
sense correlation. 


Example 10-1. Calculate the correlation coéfficient for the following 
heights (in inches) of fathers (X) and their sons (Y): 
} a 65 66 67 67 68 69 70 72 


» 


Ys 67 68 65 68 72 72 69 71 
Solution. 
CALCULATIONS FOR CORRELATION COEFFICIENT 
xX Y x y2 — XY 
65 67 4225 4489 4355 
66 68 4356 4624 4488 
67 65 4489 4225 4355 
67 68 4489 4624 4556 
68 72 4624 5184 4896 
69 72 4761 5184 4968 
70 6S 4900 4761 _ 4830 
72 71 5184 5041 §112 


Total 544 552 37028 38132 37560 
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r(X, Y) = Cov (X, Y) _ - 


OyOy 


* 37560 — 68 x 69 


Es (68) | 38182 _ (69y2| 


= 4695 — 4692 = > = 0-603 
V (4628-5 - 4624) (4766-5 ~ 4761) V4.5 x '§-5 
Aliter. 
(SHORT-CUT METHOD) 


% 
S 


U=X -68 V=¥ -69_ 


1 


4 
1 
6 
1 
9 
9 
0 
4 


COoOoOWN Core aN A 


Cov (U,V) = SUV-T Vax =3 
Oy? = 2 DU? - (Tj2= 9x 36 = 45 


2 1 1 yy (Vyt=d x 4A. 5.5 


Cov, V)_ 
r(U,V) = = ——3 = 0.603 = r(X, 
SySy = V4.5 x 5-5 = 


Remark. The reader is advised to calculate the correlation coefficient by 
arbitrary origin method rather than by the direct method; since the latter leads to 
much simpler arithmetical calculations. 
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Example 10-2..A computer while calculating correlation coefficien: 
between two variables X and Y from 25 pairs of observations obtained the 
following results : 


n= 25, YX = 125, DX? = 650, XY = 100, XY? = 460, YXY = 508 
It'was, however, later discovered at the time of checking that he had copied 


down two pairsas X | Y while the correct values were X | Y 
6 14 8 | 12 
& | 6 618 


Obtain the correct value of correlation coefficient. 
[Calcutta Univ. B.Sc. (Maths. Hons. ), 1988, 1991] 


Solution. 
Corrected 2X =125 - 6- 8+ 8+ 6=125 
Corected YY=100 - 14 - 6+ 12 + 8=100 
Corrected DX?=650 - @& -— 8 + 82 + 6&=650 


Corected LY? =460 - 17 - @ + 127 + 8= 436 
Corrected “XY = 508 -—- 6x14 —- 8x6 + 8x12 + 6x8 = 520 


¥ =f yx 52x 12555, ¥=? SV =3ex 10024 
Cov (XY) =* EXY ~#¥ 235% 520-5x4=4 
Ox = 1 EX? X2=3-x 650 - (5)? = 
oy? ny DY? ~ Yong x 436 - 16 = 38 
‘4 
Corrected r(X, ¥) SO 8737 067 


Example 10-3. Show that if X’,Y’are the deviations of the random 
variables X and Y from their respective means then 


ae A 
(i) rel -wE (5 - st} 
1. (Xi. Yi)" 
(ii) ale, Carel 


Deduce that-1<r<s+1. 
Pere on B.Sc. Oct. 1992; Madras Univ. B.Se., Nov. 1991) 


Solution. (i) Here X; = (x;. -X) and Y; = (Y;- Y) 


, 1of(ii vi 
RHS. =1-g72($i - =a 
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=i oy [AP 4 XB 2x 
~ 0,7 Oy =o x0y 


1 | > l ? 2 Yep?’ 
=1—-—— [oO X? + LY? - X iY; 
reese : op 2 oy = 


1 ps a ry. 
1-Alas za ~X)2+4 sph Wi- Y)2_ yo) LX; X)(Y; Y)) 


w1-$/5.0 Bi die acy 2_—2_ rey 
2la°°* “Ge ~ oyay®  *"* 
e1-2[1+1-2J=r . 
(if) Proceeding similarly, we will get 
RHS. =-1+5(1+142/) 27 ‘ 


X; . ¥; aoe a 
Deduction. Since a a) being the square of a real quantity is 


always non-negative, XI * or) also non-negative. From part (i), we get 


r= oe a non-negative quantity) = r<l suc”) 
Also from part (ii), we get ’ 
r =—] + (some non-negative.quantity) => <~l<r ac*) 

The sign of equality in (*).and (**) holds. if and only: if 

X; YY; 

Ox - Oy -™ Vi=1,2 

x; -_ i=1,2,...,7” 
and Ox Oy 7 

respectively. 
From (*) and (**), we get 
-l<ersl 


Example 10-4. The variables X and Y are connected by the equation 
aX + bY +c =0. Show that the correlation between them is —1 if the signs of 


aand b are alike and +1 if they are different. _ ‘ 
[Nagpur Univ. B.Sc. 1992; Delhi Univ. B.Sc. (Stat. Hons.) 1992) 


Solution. aX + bY+c=0 => aE(X) + bE(Y) +c =0 
: a{X —E(X)) + b(Y -E()) =0 


= (X-E00) =-% (¥-£) 
Cov (X, ¥) = El (X -EQ))(Y-E@}] 
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E(X-EO))? = Se 1(Y-£0))1 = 3. 0? 
by b 


D Re 
.o --6o 
coe a * 
' [P 1b: ‘ 
Voy ~j re ra. Cy” 


_ |+1, if b and a are of opposite signs. 
~ \-1, if b and a are of same sign. 


Example 10-5. (a) If Z= aX + bY and r is the correlation coefficien 
between X and Y, show that 


Gz’ = a’6,” + bay” + 2abroy Sy 


(b) Show that thé correlation coefficient r between two random ee x 
and Y is given by 


r= (Gy? + Gy? — 6x _y*) / 20x6y 
where Oy, Gy and Oy_y are the standard deviations of X, Y and Moe Y 
respectively. 
[Calcutta Univ. B.Sc., 1992; M.S. Baroda Univ. B.Sc. 1992) 


Solution.. Taking expectation of both sides of Z = aX + bY, we get 
E(Z) = aE(X) + bE(Y) 
Z-E(Z) =a(X - E(X)) + O(Y ~E(Y)} 
Squaring and taking expectation of both sides, we get 
67 =a’d;* + bo; + 2ab Cov (X, Y) 
= a0,’ + b*oy* + 2abroy oy 
(b) Taking a= 1, b =-1 in the above case, we have 
Z=X-Y and oy _y = oy? + of — 2ro; oy 
Ox” + Oy? - Sx_ yr” 
"20x 8y 
Remark. In the above example, we have obtained _ 
V(aX + bY) = a? V(X) + b? V(Y) + 2ab Cov (X, ¥) : 
Similarly, we could obtain the result, 
V(aX' — bY) = a? V(X) +b? V(Y) = 2ab Cov (X; Y). 
The above results are useful in solving theoretical problems. 
Example 10-6. X and Y are two random variables with variances oy: 
and Oy respectively and r is the coefficient of correlation between them. If 
U =X +kYand V =X + SEY, find the value of k so that U and V are 


uncorrelated. [Delhi Univ. B.Sc. 1992; Andhra Univ. B.Sc. 1993] 
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= 
Solution. Taking expectations of U = X + kY and V =X + s Y, we get 


BU) = E(X) + REC) and E(V) = BG) +2 EW) 
U -E(U) = (X- nerenre E(Y)) and 
V -E(W) = (X-'BQ)) += (¥ -£)) 
Cov (U, V)= EL(U - EU) (V -FV))] 
= B[ (X - EQ) + k(Y - BON) x (XK - B00) + 3 (Y- £0) ] 


, eo} 
= 0x7 + 2% Cov (X, Y) +k Cov (x, YN) +k a. oy 


= lox? + kox0r | + E + k| Cov (x, Y) ° 


Ox: + kOy 
Oy 


= Ox (Gx + kay) + Cov (X, Y) 


“= (Gy + koy) E S Co = (6x + ky) (1 + oy 


U and V will be uncorrelated if 
r(U,V)=0 => Cov(U, V)=0 
ie., if . (Sx + koy) (1 +r) oy =0 
=> : Ox +koy =0 (-: Oy #0, r #-1) 
=> k= ~ Sy 


Example 10-7. The ‘random Variables. x and Y are jointly normally 
distributed and U and V are-defined by 


, U=X cos arty sina, 
V=¥cosa-X sina 
Show thai U and V will be uncorrelated if 


=r 
n2a = ee | 
lar - Oy’ 


where r = corr. (X, Y), 0x? = my a) and i Var (VY). Are U and V then 
independent ? 
[Delhi Univ. B.Sc. (Stat. Hone.) 1989; (Maths. Rons.), 1990] 
Solution. We have 
Cov (U,V) =E[(U - E(U)) (V~ E(V))] 
= E[[(X - E (X)) cos a + (¥ ~E(Y) sin 
x [LY - E(Y)} cos a — {X - E(X)} sin a} ] 
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= cos? o Cov (X, Y) — sin a cos a.0,7 
+ sin & cos 0..6y” — sin? o (Cov (X, Y) 
= (cos? @ — sin? a) Cov (X, Y) -sin a cos @ (Oy? - Gy’) 
= cos2a Cov (X, Y) - sin & cos a (Gy? - ay’) 
U and V will be uncorrelated if and only if 
r(U, V)=0, ie. iff Cov(U, VY) =0 
i.e., if cos 2a Cov (X, Y) ~ sin a cos a (Gy? — dy”) = 0 


or if cos 2a r OyOy = sin 20 . (Bx? — 0) 
e ‘“ - By Ox; Y 
or if tan 20 = 3,2 - oy? 


However, r(U, V) = 0 does not imply that the variables.U and V are 
independent. [For detailed discussion, see Theorem10-2, page 10-4.]. 
Example 10-8. If X, Y are standardized random variables, and 
r(aX + bY, bX + a¥) = ++ 72? aC) 
find r(X, Y), the coefficient of correlation between X and Y. 
[Sardar Patel Univ. B.Sc., 1993; Delhi Univ. B.Sc. (Stat. Hons.), 1989) 


Solution. Since X and Y are standardised random variables, we have 


E(X) = E(Y) = 
and Var (X) = Var (Y)=1-> wee E(Y) = 1 a 
and Cov(X,Y) = E(XY) = E(XY)=r(X,Y).cyoy| 7") 


= r(xX,Y) 
Also we have 


r(aX + bY, bX + aY) 


7 E[(aX + bY)(bX + aY)] —- E(aX + bY) E(bX +:aY) 


[Var (aX + bY). Var (bX + aY)]'/2 


a E[abX2 + a2 XY + b2 YX + abY?] -0 

~ {[a? Var (X) + b? Var (Y) + 2ab Cov (X,Y)] 
x [b? Var (X) + a2 Var Y + 2ba Cov (X,,Y)]}1/2 

. ab.1 + a2r(X, Y) +:b2 r(X, Y) + ab.1 
~ {[a? + b? + 2ab r(X, Y)I[b? + a? + 2ba r(X, Y)]} 
[Using (**)] 
_ 2ab + (a* + b?). r({X, Y) 
~ a2 + b2 + 2ab. r(X, Y) 
From (*) and (**), we get 
142ab (a? +b). r(X, Y) + 2ab 
a+b? ~ a2+ b2 4 2ab.r(X, Y), 
Cross multiplying, we get 
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(a? + b?) (1 + 2ab) + 2ab. r(X, ¥) (1 +2ab) = (a2 + b*)2. r(X, Y) + 2ab (a? + 5%) 


= 
= 


=> 


(a4 + b* + 2a2b? ~ Zab — 4 ab?) r (X, Y) = (a? + b) 
[(a? ~ b*)2 — 2ab] r(X; Y) = a? + b? 

__ a+b? 
r (X, Y) = (2 52)2 — 2ab 


Example 10-9. If X and Y are uncorrelated random variables with means 


zero and variances 0,” and 0,7 respectively, show that 


U=X cosa+Ysina,V=X sin a-Ycosa 


have a correlation coefficient p given by 


oO 17 ~ 02? 
= [(0,2 — 6,7)? + 40,202? cosec? 2a)!” 
Solution. We are given that 
r(X, Y) =0 = Cov (X, Y)=0, 6,? = 6)? and oy = op? .. (1) 
We have 
Oy? = V(X cos a + ¥ sin a) 
= cosa, V(X) + sin’a V(Y) + 2 sin a cos @ Cov (X, Y) 
= cos’ 6? + sin2a 622 [Using (1)] 
Similarly, 
oy = V(X sin a — Y cos @) = sin?a.o,2 + cos?a.o,2 


Cov (U,V) =E[(U-EU)} (V-E(V)}] 


= E[ ((X - EX) cos a + ((¥ -E(Y)) sin a} 
x ((K - E(X)) sin o - (¥ - E(Y) cos a} | 
= sin & cos a V(X) — cos? Cov (X, Y) 
+ sin? a ‘Cov (X, Y) - sin a cos @ V(Y) 


= (0,7 - 0,2) sin a cos o [Using (1)] 
Now 9? = [Cov (U, V)? 
, Sy2y? 


where Oy’0v' = (cos*a 0,2 + sin*a 0,7) (sih2a 0,2 + cos"a 0”) 


= sin? cos?0(o,4 + 624) + 0,205? (cos* o + sin*ta) 
= sin*a. cos*a(o,4 + 62") + 0,7077[(sin2a + cos’)? — 2 sin’: cos*a) 
= sin’a cos?a (6,4 + G24 — 20,2057) + 0)702 
= sin?ar cos?@ (6,2 — 62)? + 0,20" 
pi (0,2 — 6,2)? . sin?a& cosa 
~ 61269? + sin2a cos?a(0,2 - 67)? 
4 (0,2 - 62)? sin? 20 


0;20,2 + sin? 2a. 4 (6; — 6,7)? 
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=> 
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_ (6? — 02”)? 
~ 40,7o6,? cosec?2a + (6)? -- op)? 
e) 2 = C2” 
~ [(6,2 — 62)? +.40,20,2 éosec22a]!2 


Example 10-10. /f U = aX + bY and V = cX + dY, where X and Y are 
measured from their respective means and if r is the correlation coefficient 
between X and Y, and if U and V are uncorrelated, show that 


OyGy = (ad — bc) oyoy (1 — r?)!2 


[Poona Univ. B.Sc., 1990; Dethi Univ. B.Sc. (Stat. Hons.), 1986) 


Solution, We have 


Cov (X, Y) 5 [Cov (X, Y)}? 
r= => IL-r=t- 5 
Cy Oy Gy? oy” 
(1 - r2) 6? oy? = 0? oy? — [Cov (X, Y)]? .(*) 
[This step is suggested by the answer] 
U =aX+bY ,V=cX+dY 


Since X, Y are measured from their means, 


fs 


E(X) =0=E(Y) = E(U)=0=E(V) ies 
= E(U2); oy? = E(V2) =") 
aX+bY-U=0 and é¢X+dY-V=0 
a Se, ae 
-bV+dU emcee ad -—bc 
X =U - bv) 
aC) 
Y= ps al adibaade 


Var (X) “ole Oy” + b? 0,7 - 2 bd Cov (U, V) 


a | ; 
= (ad — bape Su’ + F Gy’) 
[Since U, V are uncorrelated = Cov (U, V) = 0] 


Similarly, we have 


Var (Y) reas (c? Oo? + a* Ov”). 


Cov (X;'Y) — E(X) E(Y) = E(XY) [-» E(X)=0=EM)) 


“(ad - aoe ad bat EL (dU - BV) GcU + avy] [From (***)] 
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= oo [-cd 6,7 - ab oy7] 
[Using (**) and Cov (U, V) = 0, given] 
= ae [cd Oy? + ab oy’) 
Substituting in (*), we get 
(1-r*) Ox’ oy “aa x [@? oy? + b? oy”) (c? Gy? + a? oy’) 


— (cdo? + ab Gy*)*| 
1 
~ (ad — bc)4 
x [c2d? a4 + a2bo¥4 + (ad? + bc) Gy? G7 
~ 22 Poy a? B a4 - 2abcd oy? 0,7] 


: any [a’d? + bc? —2abed] oy? 6/7 


= ara (ad — bc)? oy o/ 
oy’ oY ¢ 
~ (ad — be)? 
Cross multiplying and taking square root, we get the required result. 
Example 10-11. (a) Establish the formula : 
NrOxOy = Nj) Ox Oy, + N2r20x, Oy, + n,dx,dy; + n2dxody2 (10-5) 
where n,, Nz and n'‘are respectively the sizes of the first, second and combined 


sample : (X1, 1), (Xa, Y2), (X, y), their means r;, 72 and r their coefficients of 
correlation; (Ox, Oy,). (Ox;, Oy,), (Ox, Oy) their standard deviations, and 


dx, =X, —-x ? dy, =y1 -y 


dig=X.-X , dyo=y2-y 
(b) Find the correlation co-efficient of combined sample given that 


Sample I Sample II 
Sample size 100 150 
Sample mean (x) 80 72 
Sample mean (y) 100 118 
Sample variance (6,2) 10 42 
Sample variance (6,*) 15 18 
Correlation coefficient 0-6 0-4 


Solution. (a) Let (41 yi) 3 i = 1, 2, ..., my and (x2), ya); jf = 1, 2,» 
No, be the two samples of sizes n, and nz respectively from the bivariate 
population. Then with the given notations, we have 
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nytn, ° 7 nN, + nz 
NOx? = Ny (Ox,?:+ dx?) + nz (Gx,” + dx2”) ! 
nCy* = n, (Gy,? + dy,”) + nz Gy,” + dy2”) 


— MX, +N2X%. — my +n2y2 
-- (1) 


ny, x a ie _ ee 
2 (x15 — 1) Ou - Yd z (x2; - X2) Oa -— Y2) 
at? Se —————— 
a n,Ox,Oy, » n20x,0y, --@) 
For the pooled sample, we have 
m 2 " m = ie 
a (x1; -X) Oui-y) + a (x2; - X) O25 - y) 


i 


ye fae »+ (3) 
Now 


my he | 
2 Gir *) Ou-y= 2 { { (x1; -%1) + @ -X)} LOu- WD + (9: -y)}} 
Ry _ = a et ny _ 
= 2% ii —%1) Ou-yW+(yr-y) Z (1; - *1) 
_ => ie! — ow -_ -_ 
+ (tx) 2 On1i- ¥1) + m1 (41 ~%) M1 -) 
m my 
But 2% Gui -x,) = 0 and 2 Ou-y) =, 


being the algebraic sum Of the deviations from the mean. 


” - = . 
2 (1i- x) Qri- Y) = M17) Ox, Oy, +m dedy, [Using (2)] 


Similarly, we will get 
m si as 
y (3; ~ X) (¥oj — Y) = nore Ox, Gy, + Nz dxz dy2 
J =z 


Substituting in (3), we get the required formula. 
(b) Here we are given : 
nh = 100, xX; = 80, yi = 100, Gx,” = 10, Oy,” = 15, r; = 0-6 


No = 150, X2 = 72, y2= 118, 0,7 = 12, Gy," = 18, ro= 0-4 


— _ 1X1 + M2 X%2_ 100 x 80 + 150 xX 72 | } 
ET Reh. i00+150 7? 
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— _MmyYi + M2y2_ 100 x 100 + 150 x 118 | : 
Yee 100+150 ~ 08 


dx, =X, -X = 48, dy, =y; -y= 10-8 
dx» =X -x= 3-2, dy2=y2-y= 7-2 
nox’ =n, (Ox,” + dx;”) + ny (Gx,” + dx”) = 6640 
noy’ =n, (Gy,? + dy;”) + n2(Gy,” + dy,”) = 23640 
Substituting these values in the formula and simplifying, we get 
Ny, Ox Gy, + No20x. Sy, + n,dx,dy, + Nz dX2 dy 
se ee OG IKE 
nOxOy 
Example 10-12. The independent variables X and Y are defined by : 
f(x) = 4ax,O0sxS8r fy) 4by ,OSySs 


= 0 , otherwise = 0 , otherwise 
Show that: i 
b-a 


Cov (U, V) =", 


where U=X+Y and V=X-Y 
(1.1.7. (B. Tech.), Nov. 1992] 


Solution. Since the total area under probability curve is unity (one), we 
have : 


Tr 


[ fedde = 4a fade = 1 => 2r=l > a=a5 ..(8) 
0 0 
[ Paddy = 4b [yay =1 => 2bst=1 => b=55 (ii) 
0 0 


2 (@=4axe— »,OSxSr; and fly) = Aby =, Osyss_...(iii) 
Since X and Y are independent variates, 

(X¥,Y=0 => Covi, Y=0 .. (iv) 
Cov (U, V) = Cov (X + Y,X -Y) 


= Cov (X, X) - Cov (X, Y)'+ Cov (Y, X) — Cov (Y, ¥) 


= Oy" — 0/7 (Using (iv)] 
Var (U) = Var (X + Y) = Var (X) + Var (Y) + 2 Cov (X, Y) 
= Oy” + oy” (Using (iv)] 


Var (Y) = Var (X = Y) = Var (X) + Var (Y) - 2 Cov (X, Y) 
= Oy? + oy? [Using (¢v)] 
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Jes 2 
r(U,V) = Cov (U,. Vv). os Ox" = Oy 


2 2 + (v) 
We have: ° ‘ ee ox en 
j 
E(X) = a x f (x), dx =% | xt dx = a [From (iii)] 
0. 0 ° 
2 2) ape 
E(X*) =| f(x) dx= | * dx = 7, 
0 0 7 
4r2 pe | 
. Vat (X) = E (X*) - (EQ) => a 18 = 36a [From (i) 


Similarly, we shall get 
EY) = 7, E(Y) a Vane 
: 3” 2 “18 36b 
Substituting in (v), we get 
1/(36a) — t/(36b) _ b-a 
(U,V) =1136a) + 1366) "b+ a 
Example 10°13. Le? the random variable X have the marginal. density 


fi® = 1,-t. ee 


| 2 a2 
and let the conditional density of Y be 
fylx) =Ix<y<xt+l,-37< x<0 (*) 


2 
= l-w<y<1-x,0<x<5 a 


Show that the variables X and Y are uncorrelated. 
Solution. We have 


! 
2 
1/2 
x? 
xf; (x) dx = | x.L.dx =|] = 0 
1 1 - 1/2 
72 7) ° 
If f(x, y) is the joint p.d.f. of X and ¥, then . 


fx y) =f If, @) =f GO ha). (¥*) [i f, @) = 1] 


E(X) = 


Lx 
O x+l se, © 
E(XXY) = | | xy f (x, y) dx dy + | : f(x, y) dx dy 
so, 0 Ly 
0 «+t ‘1 -Xx 


i 
2 

= | «| y dy acs | x | y dy dx {From (*) and 
0 


—t 
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1 


0 2 
le. 1 
=> [ 0x4 Idx +5 fe (1 =2x) dx 


2/12 8 12° 8 


Cov (XY) = E(XY) -E(X) E(Y)=0 = 7(X, Y)=0 
Hence the variables X and Y are uncorrelated. 


2 
=2| 12 1 41, s|*° 


EXERCISE 10(a) 


1. (a) Show that the co-efficient of correlation r is independent of a change 
of scale and origin of the variables. Also prove that for two independent 
variables r = 0. Show by an example that the converse is not true. State the 
limits between which r lies and give its proof. 

[Dethi Univ. M.Sc. (O.R.), 1986] 


(b) Let p be the correlation coefficient between two jointly distributed 
random variables .X and Y. Show that | p |< 1 and that | p | = 1 if.and only if X 


and Y are linearly related. [Indan Forest Service, 1991] 
2. (a) Calculate the coefficient of correlation between X and Y for the 
following : 
» Ore 1 3 4 5 7 8 10 
Vx 2 6 8 10 14 16 20 


Ans, r(X, Y) = +1 

(b) Discuss the statistical validity of the following statements : 

() “High positive coefficient of correlation between increase in the sale of 
newspapers and increase in the number of crimes leads to the conclusion that 
newspaper reading may be responsible for the increase in the number of crimes.” 

(it) “A high positive value of r between the increase in cigarette smoking 
and increase in lung cancer establishes that cigarette smoking is responsible for 
lung cancer.” 

(c) @) Do you agree with the statement that “r = 0-8 implies that 80%. of 
the data are explained.” 

(ii) Comment on the following : 

“The closeness of relationship between two variables is proportional to r”. 

Hint. (a) No (6b) Wrong. 

(d) By effecting suitable change of origin and scale, compute the product 
ay correlation coefficient for the foHowing set of 5 Observations on 

xX: -10 -5 0 5 10 

| ae 5 9 7 11 13 

Ans. r(X, Y) = 0-34 
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3. The marks obtained by 10 students in Mathematics and Statistics are 
given below. Find the coefficient of correlation between the two subjects. 
Roll No. 1 2 3 4 5 6 7 8 9 10 
Marks in 
Mathematics : 75 30 60 80 53 35 15 40 38 48 
Marks in ; 
Statistics : 85 45 54 91 58 63 35 43 45 44 


, 
4, (a) The following table gives the number of blind. per lakh of population 
in different age-groups. Find out the correlation between age and blindness. 


Age in years : 0—10 10—20 20—-30 30—40 40-~50 
Number of blind 
per lakh : 55 67 100 111 150 
Age in year : 50—60 60—70 70—80 
Number of blind 
per lakh : 200 300. 500 

Ans. 0-89 


. (6) The following table gives the distribution of items of production and 
also the relatively defective items among them, according to size-groups. Is 
there ay correlation between:size and defect in quality ? 


Size-Group  : 15—16 16—17 17—18 18—19 19—20 20—21 
No. of Items : 200 270 340 360 400 300 
No. of defective 

items 150 162 170 180 180 120 


Hint. Here we have to find the correlation coefficient between the size- 
group (X) and the percentage of defectives (Y) given below. 


xX 15-5 16-5 17-5 18-5 19-5 20:5 
Y 75 60- 50 50 45 40 
Ans. r= 0-94, 


5. Using the formula 
Oy _y2= Oy? + Of? — 2 7(X, Y) Oy Oy 
obtain the correlation coefficient between the heights of fathers (X) and of the 
sons (Y) from the following data : ° 
P Ge 65 66 67 68 69 70 71 67 
Y : 67 68 64 72 70 67 70 68 


6. (a) From the following data, compute the co-efficient of correlation 
between X and Y. 


X series Y series 
No. of items ‘15 15 
Arithmetic mean 25 i8 
Sum of squares of deviations 136 138 


from mean ' 
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Summation of product of deviations of X and Y series from the respective 
arithmetic means = 122. 
Ans. r (X, Y) = 0-891 ; 
(b) Coefficient of correlation between two variables X and Y is 0-32. Their 
covariance is 7-86. The variance of X is 10. Find the standard deviation of Y 
series. | 
(c) In two sets of variables X and Y with 50 observations each, the 
following data were observed : 


X = 10, oy = 3, Y =6, oy = 2 and r(X, ¥) = 03 

But on subsequent verification it was found that one value‘of X (= 10) and 
one value of Y (= 6) were inaccurate and hence weeded out. With the remaining 
49 pairs of values, how is the original value of r affected ? 
(Nagpur Univ. B.Sc., 1990) 

Hint. LX =nX = 500, LY=nY=300 

=X? = n(Gy? + X*) = 5450, XY? = 50(4 + 36) = 2000 
OXY vey 


ro0xOy = Cov %, Ya XY 
+» 03x3x2 =2X¥ _10x6 
a SEXY =50(1-8 + 60) = 3090 


After weeding out the incorrect pair of observation, viz., (X = 10, Y = 6), 
the corrected values of }X, LY, /X?, YY? and LXY for the remaining 50 -1 = 
49 pairs of observations are given below : 


Corrected Values : 
> X = 500 - 10 = 490 sx SY = 300 - 6 = 294 
Y XY = 3090 - 10 x 6 = 3090 — 60 = 3030 
YX? = 5450 — 10? = 5350, LY? = 2000 — 62 = 1964 


pe Corrected Cov (X, ¥) _ 9/49 _o.3 
"*" “(Corrected Oy) x (Corrected) oy je, a 
a9 * 49 


Hence the correlation coefficient is invariant in this case. 

(d) A prognostic test in Mathematics was given to 10 students who were 
about to begin a course in Statistics. The scrores (X) in their test were 
examined in relations.to scores ¢Y) in the final examination in Statistics. The 
following results were obtained :— 

~X=71, DY =70,> X2 = 555, d Y? = 526 and © XY = 527 

Find the coefficient of correlation between X and Y. 

(Kerala Univ. B.Sc., 1990) 

7. (a) X, and X, are independent variables with means 5 and 10 and standard 
keviations 2 and 3 respectively. Obtain r(U, V) where 

U = 3X, + 4X» and V= 3X, —X> 
Ans, 0 (Delhi Univ. B.Sc. ,1988) 


10-22 Fundamentals of Mathematical Statist, 


(b) If X and Y are normal and independent with zero means and standard 
deviations 9 and 12 respectively, and if X + 2Y and kX — Y are non-correlated, 
find k. 

(c) X, Y, Z are random variables each with expectation 10 and variances |, 
4 and 9 respectively. The correlation coéfficients are 

r(X, Y) =0,r(Y, Z) =r (X, Y) = 1/4 

Obtain the numerical values of : 

(i) E(X + Y-2Z), (ii) Cov & +3, Y +3), (éii) V(X -—2 Z) and 

(iv) Cov (3X, 5Z) 

Ans. (t) = 0, (i) 0, (it) 34, and (iv) 45/4. 

(d) X and ¥ arc discrete random variables. If Var (X) = Var(Y) = 07, 


Cov (X,Y) = x find (i) Var (2X — 3Y), (ii) Corr (2X + 3,'2Y — 3). 
8. (a) Prove that : 
V(aX tbY) = a*V(X) + b°V(Y) + 2ab Cov (X, Y) 
Hence deduce that if X and Y.are independent 
V(X + Y) = V(X) + V(Y) 
(b) Prove that correlation coefficient between X and Y is positive or 
negative according as 
Oxy+y>Or<Oy_y 
9. Show that if X and Y are two random variables each assuming only two 
values and the correlation co-efficient between them is zero, then they are 
independent. Indicate with justification whether the result is true in general. 
Find the correlation coefficient between X and a —X, wheré X is any 
random variable and a is constant. 
10. (a) X; (i =1, 2, 3) are uncorrelated variables each having the samé¢ 
standard deviation. Obtain the correlation between X, + X2 and X2 + X3. 
Ans. 1/2 
(b) If X; (i=1, 2, 3) are three uncorrelated variables having standard 
deviations 0,;, 0, and 6, respectively, obtain the coefficient of correlation 
between (X, + X9) and (X, + X3). 


Ans. 6,2/V (6,7 + 07) (o + 037) 

(c) Two random variables X and Y have zero means, the same variance 0 
and zero correlation. Show that 

U=Xcosa+Ysina and V =X sina-Ycosa 
have the same variance o? and zero correlation. 

(Bangalore Univ. B.Sc., 1991) 

(d) Let X and Y be uncorrelated random variables. If U =X +Y and 
Vv =X -Y, prove that the coefficient of correlation between U and V #8 
(Sx? - 5 y2) (ox? + Oy*), where oy? and Gy" are variances of X and! 
respectively. 

(e) Two independent random variables:X and Y have the followiné 
variances : 6? = 36, oy” = 16. Calculate the coefficient of correlation betwee 


U=X+YandvV=xX-Y 


Correlation and Regression ve 
(f) Random variables X and Y have zero means and non-zero variances 0,” 
and oy*. If Z= Y —X, then find oz and the correlation coéfficient p(X, Z) of X 
and Z in terms of Gy, Gy and the correlation coefficient p (X, Y) of X and Y. 
(g) If the independant random variables X,, X2 and X3 have the means 4, 9 
and 3 and variances 3, 7, 5, respectively, obtain the mean and variance of 
(i) Y=2x, - 3X + 4X3, (ii) Z=X, + 2X» — X3, and 
(iii) Calculate the correlation between Y and Z. 
[Delhi Univ. M.A.(Eco.)., 1989] 
11. (a) X;, X2, ..., X, are uncorrelated random variables, all with the same 
distribution and zero means. Let X = DX; /n 
Find the correlation co-efficient between (i) X; and X and (ii) X;- X and X. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1993} 
= o2/n 1 
Hint. r(X;, X) oo. oe! 
Vo2.07%/n Vn 
Cov(X;- X,X ) = Cov (X;,X)- Var(X) 
= (67/n) — (9?/n) = 0 
ee r(X;~X,X)=0 
(b) X,, Xa, ... «.. , X, are random variables each with the same expected 
value }1 and s.d. o. The correlation coefficient between any two X's isp. Show 


that (i) Var f) == 4 { _- 1 }Po* 


1 
n-1 


WH) BE ,- X )2=(n- 1)(1 — p)o?, and (iii) p > = 


12. (a) If X and Y are independent random variables, show that 
r(X + Y,X-Y=r(X,X +Y)—r?2 (Y,X +Y), 
where r(X + Y, X —Y) denotes the co-efficient of correlation between (X + Y ) 
and (XY), (Meerut Univ. B.Sc., 1991) 
(>) Let X and Y be random variables having mean 0, variance 1 and 
“orrelation r. Show that X — rY and Y are uncorrelated and that X'-7rY has 
zero and variance 1 — r2. 
13. X, and X,-are two variables with zero means, variances 6,2 and 6,2 
ppectively and r is the: correlation coef.icient between them. Determine the 
ues of the constants a and b which are independent of r such that X, + aX, 
nd X1 + bX, are uncorrelated. 
14. @) If X, and X, are two random variables with means 1, and py, 
cg 6,”, 62? and correlation coefficient 7, find the correlation co-efficient 
n 
U=a,X, + a2X> and V= bX, + 2X2, 
"here a), a, and b,, by are constants, 
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(b) Let X,, X_ be ped slau random variables with means [Hy {lz and nop, 
z6ro variances 0)”, >? respectively. Let U = X, — X_ and V= X, X>. Find the 
at relation cocfficient, between (i) X, and U, (ii) X, and V, in terms of t1,, I, 
0,2 ; O>?. 

15. (g) If U = aX + bY and V= bX —aY, where X and Y are measured fron 
their respective means. and if U and V are uncorrelated, r the co-efficient of 
correlation between X and Y is given by the equation. 

Oy Oy = (a? + b?) oy Gy (1 — r?)!2 (Utkal Univ. B. Sc., 1993) 

(b) Let U = aX + bY and V = aX — bY where X, Y represent deviations fron, 
the means of tWo measurements on the same individual. The coefficient og 
correlation between X and Y is p. If U, V are uncorrelated, show that 

Oy Oy = 2aboy Gy (1 — r*)!” 

16. Show that, if a and b are constants and r is the correlation coefficien, 
between X and Y, then the correlation coefficient between aX and bY is equal to 
rif the signs of a and 6 are alike, and to — ¢ if they are different. 

Also show that, if constants a, b and c are positive, the correlation 
coefficient between (aX + bY) and cY is equal to 


(aroy + boy) /V(a2o,2 + boy? + 2abroyo;) 

17. If X,, X. and X3 are three random variables measured from thei 
respective means as origin and of equal variances, find the coefficient of 
correlation between X, + Xz and X2 + X3 in terms of 12, 7,3 and r23 and show 
that it is equal to *. 

(jy Lif r13 = r3 = 0, and (ii) = aoe if ry3 = 123 = 1 

18. (a) a weighted distribution (x;, w;), (f= 1, 2, ...., 2) show that the 
weighted arithmetic mean x,, = £ w;x,/Z w; > or < the unweighted mean, 
x = Lx,/n according as r,, > or <0. 

(b) Given N values x), Xo, ..., Xy Of variable X and weights wy, to, ...., Wy, 
express the coefficient of correlation between X and W in terms involving the 
difference between the arithmetic mean and the weighted mean of X. 

19. (a) A coin is tossed n times. If X and Y denote the (random) number ol 
heads and number of tails turned up respectively, show that r (X, Y) =~=1. 

Hint. Note thatX + Y=n=> Y =n-X 

: r(x, Y=r(X,n-X)=r(X, -X)=-r (CX, X)=-. 

(6) Two dice are thrown, their scores being a and b. The first die is left on 
the table while the Second is picked up and thrown again giving the score ¢. 
. Suppose ‘the process is repeated a large number of times. What is the.correlation 
coefficient between X=a+band Y=a+c? 


Ans. r (X, Y)=5 


20. (a) If X and Y are independent random variables with means 1, and 2 
and variances 6,2, 2? respectively, show that the correlation coefficient betweet 


U=X and V=X—Y in terms of ty, ty, 0,2 and 62 is o\/ VO ,2 + O22 . 
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(6) If X and Y are independent random variables with non-zero variances, 
show that the correlation coefficient between U = XY and V =X in terms of 
mean and variance of X and Y is given by 


‘P20; /: N6,20%7 + [1209 + [2207 
(Dethi Univ. B.Sc. (Stat Hons.), 1987] 
21. If X;, Y;and Zj are’ all independent random variables with mean zero 
and unit variance, find the ee coefficient ia 
¥ m 
U = Exedy, andV = 3. X;+ Pe? 
‘= ial 
Ans. r(U, V) = m/(m + n) (Bombay Univ., B.Sc, 1990) 


22. (a) Find the value of / so that the correlation coefficient between 
(X — lY) and (X + Y) is maximum, where X, Y are independent random 
variables each with mean zero and variance 1. [Ans. / = —1] 


Hint. U =X - ly ;V=X +Y. Now find / so that r (U, V) = 1. 


(b) If U =X +kY and V=X + mY andr is the correlation coefficient 
between X and Y, find the correlation coefficient between U and V. Show that 


U and V are uncorrelated if gE = —Ox(Ox+rm oy) 
Oy (r Oy + mOy) 


and further if m =", then k =- ©. (Gujarat Univ. M,A., 1993) 
Y Y 
23. X,, X2, X4 are three variables, each with variance o? and the correlation 


coefficient between any two of them is. If X = (X, + Xz + X3)/3, show that 
= 2 - 
Var ®)=F + 2r) 
Deduce that r 2-1/2. 
24. (a) If U = aX + bY and V = 6X —aY, show that U and V are 
uncorrelated if ae Je 5 = —POxSr- Sy 
—b Sy” -90 y’ 
where p is the coefficient of correlation between X and Y. Show further that, in 
this case 
Gy? + OY = (a? + b*)(Gy? + ay’) and Gy oy = (a? + b?) ay oy V1 - p2 
(b) If u = aX + bY, v=cX + dY, show that 
var(1) cov (u,v) a b |2|var(X) cov (X, Y) 
cov (u,v) var. (v) c ld cov (X, Y) var (Y) 
25. If X is a standard normal variate and Y = a + bX + cX2?, 
where a, b,c are constants, find the correlation coefficient between X and Y. 


Hence or otherwise obtain the conditions when (i) X-and Y are uncorrelated and 
(ii) X and Y are perfectly correlated. 


26. (a) If X ~N (0, 1), find corr (X, Y) where Y= a + bX + cX?. 
(Dethi Univ. B.Sc. (Maths. Hons.), 1985] 
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Vb? + 2c? os 
(b) If X has Laplace distribution with parameters (A, 0) and 
Y=a+ 6X + cX2, find p(X, VY) 
[Dethi Univ. B.A. (Stat. Hone. Spl. Course), 1989) 
riint. px) = ; dX exp [-Al xl], 0 <x <oe, 


E (X**1) = 0 = poe; EX) = py = (2K)! / A* 


Ab 
PAY pa? + 10c? 

27. Ina sample of n random observations from exponential distribution 
with parameter 1, the number of observations in (0, 1/A) and (1/A, 2/A), 
denoted by X and Y are noted. Find p(X, Y). 

t/a 


Hint. p,=pO0<X<i1f)= [ Ae ai= € 
0 


Ans, r(X, Y) = 


- 1 
e 


2fr 
Pr= pil < ¥ <2) = 
1A 


Then (X, Y) has a trinomial distribution with parameters (n = 3, py, po, 
p3 = 1—p; —p2). 
Hence we have 
e- 1 


nt real as 
(1 = py) — p2) Vez-e41 


28. Prove that : 
r(x, ¥+Z)=— 5 re YN+— ne!) 


29. If X and Y are beacon ac random pois find Corr(X, XY). 

Deduce the value of Corr(X, X/Y). 
Ans. 7(X, XY) = Oy My/lOy707? + py? Oy? + Hy? 0x7]! 

30. Prove or Disprove : 

(a) r(X,¥Y)=0 => r(iX |, Y) =0 

(b) r(X%,Y=0,r(Y¥,2Z=0 = r({X,ZF =0. 

Ans._(a) Falsé, unless X and Y are independent. 

(6) Hint. Let Z=X, andX and Y be independent. Then 
(X,Y) =0= r(Y, Z). Butr(X, Z) =r(X, X)= 1. 

31, Let random variable X have a p.d.f. f(.) with distribution functiort 
F (.), mean" and variance o?. Define Y=a+8X, where o and B are 
constants satisfying — 0 < a <«, and B > 0. 

(a) Select o and B so that Y has mean 0 and variance 1. 

(b) What is the correlation coefficient between X and Y ? 
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32. Let (X, Y) be jointly discrete.random variables such that each X and Y 
have at most two mass points. Prove or disprove : X and Y are independent if 
and only if they are uncorrelated: 

Ans. True. 

33: If the variables X,, X>, ..:, Xz, all have the same variance o? and the 
correlation coefficient beeen X; ant Xj G4 iD has the same value, show that 


the correlation between z X; and = Xi is given by [np/{1 + (n — 1)p}]. 


34, The means of caleaten r.v'’s X;,X>, ...,X, are zero and variances 
are equal, say unity. The corrélation coefficients between the sum of selected t 
(<n) variables out of these variables and the sum of all n variables are found 
out. Prove that the sum of squares.of all these correlation coefficients is *'C,.;. 


[Burdwan Univ. B.Se. (Hons.), 1989) 


35. Two variables U and V are made up of the sim of a number of terms 
as follows : ; ; 


U =X) +Xo+ o +X, +Y, + Y, +... +Y, ; 
V=X,+Xo+...+¢X,4+2Z,+2,+...+Z,, 
where a and b are all: suffixes and where X’s, Y’s and Z’s are all uncorrelated 
standardised random variables. Show that the correlation coefficient between 


U and V is ————" ——. Show further that 
V(n + a)(n + b) 
ee ea) (*) 
and n=aVin+b)U-V(n+a)V - 
are uncorrelated [South Gujarat Univ. B.Sc., 1989) 
36. (a) Let the random variables X and Y have the joint p.d.f. 
fy) =18; &, y) =, 0), (1, 1) (2, 0) 
Compute E(X), V(X), E(Y), V(Y) and r(X, Y). Are X and Y stochastically 
independent ? Give reasons. 
(b) Let (X, Y) have the probability distribution : 
fC, 0) =.0-45, fC, 1).= 0-05, fC, 0) = 0-35, f(1, 1) = 0-15. 
Evaluate V(X), VY) and p(X, Y). 
Show thatwhile X and Y are correlated, X and X—SY are ‘uncorrelated. Are 
X and X — SY iridependent ? 
(c) Given the bivariate probability distribution : 
f£C1,0) = 1/15, fC1, 1) = 3/15, f(-1, 2) = 2/15 
(0,0) = 2/15, f(0,1) = 2/15, f(0,2) = 1A5 
= 1/15, f(,1) = 1/15, f(Q,2) =2/15 
f (%, y) = 0, elsewhere. 


PA 


(i) The inarginal distributions-of X and Y. 
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(ii) The conditional distributions of Y given X = 0. 
(iti) E(YIX = 0). 
(iv) The product moment correlation coefficient between X and Y. 
Are X and Y independently distributed ? 
37. If X and Y are standardised variates with correlation coefficient p, 
prove that E [max (X?, Y2)]<1+ V1- p? 
Hint. max (X?, ¥2) = 51X%-Y21+1Q24+¥2) = (4) 


E(X) = E(Y) = 0; E(X) = E(¥?) = 1; EQY) =p 
and [EIX-YI.IX+YIPSE(X-Y)?.EX+Y) 
(By Cauchy-Schwartz Inequality) 
38. The joint p:d.f. of two variates X and Y is given by 
fix. y) = Kiet y)-(C2 &y)150<, y) <1 
= 0, otherwise. 
Show that X and Y are uncorrelated but not independent. 
39(a). If the random variables X and Y have the joint p.d.f., 
x+y;0<x<10<y<] 
FG y= : 
0, elsewhere 
then show that the correlation coefficient between X and Y is ~ + ; 
[Madras Univ. B.Se., Oct., 1990] 
(b) The density function f of a random variable X is given by 


fn { kx?, if- hex's 1 
; nae | 0, otherwise 
(i) What is the value of k ? What is the distribution function of X ? 
(ii). Obtain the density function of the random variable Y = X?. 
(iit) Obtain the correlation coefficient between X and Y. 
(iv) Are X and Y independently distributed ? 
40(a). lef (x, y) ===; o<x52,25y <4, 
find ~(i) Var (X), (i) Var (Y) Gii) r &, ¥). 
wal oe dle ok ie 
Ans. (i) 36 » (ii) 36” (iit) a 11° | 
(b) Given the joint-density of random variables X,.Y, Z as : 
f@ y,z)=kxexp + (y +2)),0<x<2,y 20,220 


=Q, elsewhere 
Find 


(i) k, 
(ii) the marginal density function, 
(iii) conditional expectation of Y, given X and Z, and 
(iv) the product moment correlation between X and Y. 
[Madras Univ. B.Sc. (Main Stat.), 1988) 


.Correlation and Regression 10-29 
(c) Suppose that the two dimensional random variable (X, Y) has p.d_f. 


given by Ff ( y)=ke? ,0O<x<y<l 
= 0, elsewhere 
Find the correlation coefficient ryy. [Delhi Univ. M.CA., 1991] 


41. The joint density of (X, Y) is: 
fix, =i +y), O<x<2,0<y <2. 
Find ’,,=£E (X” Y*) and hence find Corr (X, Y). 


4 = OF eee) a 7 sce 
AMS. [t',,= 2 | (r +2) (s + 1) o (r + 1)(s + 2) |r 11° 
(b) Find the'm.g.f. of the bivariate distribution : 
f(x, y) =1,0<@,y)<1 
= 0, otherwise 

and hence find r (X,.Y). 

Ans..M (t,, t2) = (e4t ~'1) (e2 - I(t, &); 4, #0, 2 40,7 X,Y) =0. 

42. Lét (X, Y) have joint density : 

es y) = ex + yy (0, «) (x) I (0, 0) 

Find Corr (X, Y). Are X and Y independent.? 

Ans. Corr (X, Y)=0: X and Y are independent. 

43. A bivariate distribution in two discrete random variables X and Y is 
defined by the probability generating function : 

exp [a(u -— 1) + b(v—-1)+cQu-1)(W-)), 

simultaneous probability of X = rnY = s, where r and s are integers being the 
coefficient of u’v*. Find the correlation coefficient between X and Y. 

Hint. Put u =e"! and, v = e'2 in exp [a(u — 1) + b(v- 1) + c(u—-1) (v- 1), 
the result will be the m.g.f. of a bivariate distribution and is given by 


M(t, t2) = exp [a(e" — 1) + b(e2 — i) + c(e’t- 1) (e2 - 1)) 


0M , | 2M = 
Wehave | Sit at, leans =d, t,2 bas = a(a + 1) 


ou | oe am) 4 [2]. 
Ot Oto, \.. 0 = ab +C, Oty |, =0 =b, Ot" i = 0 = b(b +; 1) 
lo = =0 t= 0 t, = 0 


So we have 
E(X) =a, E(X*) = a(a + 1), E() =, E(¥*) = b+ 1) and E(XY)=ab+c 
r(X, Y) = E(XY) — E(X) E(Y) Cc’ 


VIER) - fEO)P11EG) —(ECX))) Vab 
44, Let the number X be chosen at random from among the integers 1, 2, 
3, 4 and the number Y be chosen from among those at least as large as X. 
Prove that Cov (X, Y) = 588. Find also the regression line of Y on X. 
[Delhi Univ. B.Sc. (Maths. Hons.), 1990] 


Hint. PX =k)=isk=t,2, 3,4 and Y 2X. 
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P(YsylX= agsy= 1, 2, 3,4¢. y¥2x);3 
p (Y=y¥ pe 2) = hye 2, 3, 4 


P(Y=yIX=3)=5,y=3,4;P(VsylXe4) ety a4, 


The joint probability distribution can be obtained on using : 
P(X=x,Y=y) =P(X=x).P(Y=ylX=x). 


Cov (X, 5/8 15 
rX, Yen ee 
Sx Oy = (5/4) x (41/48) 
oO 
Regression line of Yon X: Y-E(Y)= = [X-E (X)] 
X 


45. Two ideal dice are thrown. Let X; be the score on the first dice ad X9, 
the score‘on the second dice. Let Y = max {X,, X>}. Obtain the joint-distribution 
of Yand X, and show that 


Corr (CY, X}) == 
2VRB 


46. Consider an experiment of tossing two tetrahedra.-Let X be the number 
of the down turned face of-first tetrahedron and Y, the larger of the two numbers. 
Obtain the joint distribution of X and Y and hencé-p (X, Y). 

Cov (X, Y) 3/8 ee 
Ans. p (X, Y) = = = 
Pee ox Oy Vga. V55I64 VII 
47. Three fair coins are tossed. Let X denote the number of heads on the first two 
coins and let Y denote the number of tails on the last two coins. 

(a) Find the joint distribution of X and Y. 

(b) Find the conditional distribution of Y given that X = 1. 

(c) Find Cov. (XxX, Y) 

Ans. Cov. (X, Y) =—-1/4. 

48. For the trinomial distribution of two random variables X and Y: 


fy) = - eee 


xiy!(n-x-y 
for x,y =0,1,2,...,nandx+ySn,p20,q20andp+q<S Il. 
(a) Obtain the marginal distribution of Y 

(b) Obtain E(XlY = y). 

(c) Find p (X,Y). 

Ans. (a) X'~ B (n, p), Y~ B (n, q) 


E Jey) ~; -y, P-) 
() (XY =y)~B(n-y, pho 
(Note : p+q# 1) 


 EX|Y=y)=(n-y) ( re 
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Ps hpi ee eee! Se 
(c) Cov (X, Y) = npg; p(X, Y) = Pence 


OBJECTIVE TYPE QUESTIONS 


I. Comm :nt on the following : 
(i) ryy =O = X anid ¥ are. mee pentent 
(ii) If ry > O then ry, _y >0, ry “y>O.and r_ xX, -Y >0 
(iii) ryy>0. = E(XY) > E(X) E(Y) ; 
(iv) Pearvon’s coefficient of correlation i is independent of origin but not 
of scale. 


(v) The numerical value of product moment correlation coefficient ‘r 
between two variables X and Y cannot exceed unity. 


Avi) If the correlation coefficient between ‘the. variables X and Y is zero 
then the correlation coefficient between X? arid Y? is also zero. 
(vii) If r > 0, then as X increases,_Y also increases. 
(viii) “The closeness, of relationship. between two variables is proportional 
tor.” 
(ix) r measures-every type of relationship between the two variables. 


II. Comment on the following values of ‘r’ (correlation coefficient) : 
1, — 0-95, 0, -1-64, 0:87, 0-32, ~1, 2:4. 
Il. (i) If pyy = —0-9, then for large values of X, what sort of values do we 
expect for Y? 
(it) If pyy = 0, what is the value of cov (X, Y) and how are X and Y related ? 


IV. Indicate the correct answer : 

(i). The coefficient of correlation will have positive sign when 
(a) X is increasing, Yy is decreasing, (b) both X and Y are increasing, 

(c) X is decreasing, Y is increasing, (d) there is no change in X and Y. 

(ii) The coefficient of correlation (a) can take any value between —1 and +1 
(b) is always less than ~1, (c) is always more than +1, (d) cannot be 

Zero. : 

(iii) The coefficient of correlation (a) cannot be positive, (6) cannot be 
negative, (c) is always positive, (d) can be both positive as well as 


negative. 
(iv) Probable error of r is 
] ~ r? 1+r 
(a) 0-6475 » (b) 0-6754 : 
Vn Vn 


(d) 0.67544 


(v) ‘The coefficient of correlation between X and Y is 0-6. Their covariance 
is 4- a La variance ae X is 9. Then the S.D. of Y is 
3 4:8 
Onn ok O7Ee3 + 78x 06° 9x 0-6" 
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(vi) The coefficient of correlation is independent of (a) change of scale only, 
(b) change of origin only, (c) both change’ of scale and origin, 
(d) neither change of scale nor change of origin. 
V. Fill in the blanks : 
@) The Karl Pearson coefficient of correlation between variables:X and Y 


(iz) Two independent variables are... ... 
(iii) Limits for correlation coefficient are... ... 

(iv) If r be the correlation coefficient between the random variables X and Y 

then the variance of X + Y is... ... 
() The absolute value of the product moment correlation coefficient-is less 
(vi) Correlation coefficient is i3nvariant.under changes of .. and’... ... 
Vi. How can you use scatter diagram to obtain an idea of extent and nature 
(direction) of the correlation coefficient ? < 


10-4. Calculation of the Correlation Coefficient for a Bivari- 
ate Frequency Distribution. When the data are considerably largé, they 
may be summarised by using a two-way table. Here, for each variable a suitable 
number of classes are taken, keeping in view the same considerations as in the 
univariate case. If there are n classes for X and m classes for Y, there will be in 
all’ 7m <n cells in the two-way table. By going through the pairs of values of X 
and Y, we can find the frequency for each cell. The whole set of cell frequencies 
will then define a bivariate frequency distribution. The column totals and row 
totals will give us the marginal distributions of X and Y. A particular column 
or row will be called the conditional distribution of-Y for given X or Of-X for 
given Y respectively. 


Suppose that the bivariate data on X and Y are presented in a two-way 
correlation table (shown on page 10-33) whéré there are m classes of Y placed 
along the horizontal line and » classes of X along a vertical line and f, is the 
frequency of individuals lying in the (, /)th ceil. 


Here LL, y) = 8) 
is the sum of the frequencies along any row and 
XS, y) =f). 
is the sum of the frequenciés along any coluran. We observe that 


a y) = UE, y) = Efe) = 280) = N 


t . ‘ 


Thus ' 
=~ 3 EExvt9= Meu 5 ») |= =y YX fe) 
Similarly : 
y =yLyAxy y=yby, gy), 
of EE Ex fx y¥t=h patsy -F2 
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BIVARIATE FREQUENCY TABLE (CORRELATION TABLE) 


t£@2 as 


Classes 


Mid Points 
x2 cow Xpeee 


v 


a 


Calculate the correlation-coefficient. 
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Solution. 
CORRELATION TABLE 


eae 
4 4 
35 | 30—40 . 


Let U = x ~ 19. va{(Y- 35/10) 
4=— Ni LU fle) = jog = 0-68, vay v a(v) = = 0.25 
Cov (u,v) “+ D> uvf(u, v) -—u "00" 52 ~ 0-68 x 0-25 = 0-35 
u v 
oy =x re eer 102 _ (0-68)? = 1-1576 
o/ =x v2 9(v) — y2= 12 _ (0.25)? = 1.6075 
(U,V) = are V) _ 0:35. =025 a 


Sy Ov ~ 41-1576 1-6075 : Zs 
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Since correlation coefficient is independent of change of origin and scale, 
r(X, Yy=r (UU, V) = 0-25 
Remark. Figures in circles in the table on page10-34 are the product 
ferms uvf(u, v) : 
Example 10-15. The joint probability distribution of X and Y is given 
below : . 


Find the correlation coefficient between X and Y. 
Solution. 


COMPUTATION OF MARGINAL PROBABILITIES 


3 
8 
2 
g 


E(X) = 2 xp(x) = 3, 5_1 
aL 


EX?) = Lx2p(x) = (-1)?x 34 12x 221 
Var (X) = E(X?) - [EQ)P =1-7= 7 
E(Y) = Ly gy)=0xi+ ] x E55 
E(¥*) = Ly’ g(y) = 0? x 3+ 12 x 4a} 
Var (Y) = E(Y) ~ (EQ)? = 1-1-4 
E(XY) = 0% Cl) xg+0% 1x 3+ l x (-1) x E+ 1x1 x? 
=-5+2=0 


Cov (X, Y) = E(XY) - E(X) E(Y) =0 ~ixis-i 
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: 1 : 
X,Y) =LN OO) Bad ad : 

rm! Oy Oy afi VIS GE 


EXERCISE 10(b) 


1. Write a brief note on the correlation table : 


The following are the marks obtained by ‘24 students in a class test of 
Statistics and Mathematics : 


Role No. of Students : 1 2 3 4 S$ = 6 7 8. 9 1011 13 
Marks in Statistics, > 15 0 1 3 16 2 18 53 4 #17 6 19 
Marks in Mathematics :13 1 2 7 § 9 12 9 17 16 6 18 


Roil No. of Students > 13 14 15 6 17 18 19 20 21 22 23 24 
Marks in Statistics °-14 9 § 13 10 13 11 21 #12 18 9 7 
Marks in Mathematics : 11 3 5 4 10 11 14 7 18 1515 3 

Prepare a correlation table taking the magnitude of each class interval as 
four marks and the first class interval as “equal to 0 and less than 4”. Calculate 
Karl Pearson’s coefficient of correlation between the marks in Statistics and 
marks in Mathematics from the correlation table. 

Ans, 0-5544.. 

2. An employment bureau asked applicants their weekly wages on jobs last 
held. The actual wages were obtained for 54 of them; and are recorded in the 
table below; x represents reported wage, y actual wage, and the entry in the table 
represents frequency. Find the correlation coefficient and comment on the 
significance of the computed value. [Four figure log table may be used]. 


[Calcutta Univ. B.Sc. (Maths. Hons.), 1986) 
3. Calculate the correlation coefficient from the following table :-— 
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4, (a) Find the correlation coefficient between age and salary of 50 workers 
in a factory : 


Age Daily pay in rupees 
(in years) 


L 160—169 170—179  180—189 190—-199 200—209 


(b) Fnd the coefficient of correlation between the ages of 100 mothers and 
daughters : 


Age of mothers Age of daughters in years (Y) Total 
in: years (X) - 5—10 10—15 15—20 20—25 25—30 


: [Madras Univ. B.Sc. (Main Maths.), 1991] 
5. Given the following frequency distribution of (X, Y) : 


NED 
[el [= 
Boon 
rel [= [om 


6. (a) Find the coefficient of correlation between X and Y for the following 
table : 


find: the frequency distribution 
of (U, V), wheré 


2:5 3 — 
What shall be the relationship 


between the correlation coefficients 
between X, Y, and U,V? 
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Calculate E(X), Var (X), 
Cov(X, Y) and r (X, Y). 


[Delhi Univ. M.A. (Eco.), 1991) 


(c) Let (X, Y) have the p.m.f. 
p(0, 1) = p(1,0)=3; pO, -1)=p(-l, 0)=¢. 
Find r(X, Y). Are X and Y independent ? For what values of k, X + kY and 


kX + Y are uncorrelated ? 


10-5. Probable Error of Correlation Coefficient. If r is the 
correlation coefficient in.a sample of n pairs of observations, then its standard 
e 1 _ yr? 


error is given by S.E(r) = aes 
- n 


Probable error of correlation coefficient,is.given by 


2 
P.E.(r) = 0.6745 x S.E. = 0674s (106) 
n 

Probable error is an old measure for testing the reliability of an observed 
€orrelation coefficient. The reason for taking the factor 0-6745 is that in a 
rormal distribution, the range pp) +0-6745 o covers 50% of the total area. 
According to Secrist, “The probable error of the correlation co-efficient is an 
amount which if added to and substracted from the mean correlation coefficient, 
produces amounts within which the chances are even that a coefficient of 
correlation from a series selected at random will fall.” 

If r < P.E.(r), correlation is not at all significant. If r > 6 P.E.(r), it is 
dctinitely significant. A rigorous method (¢-test) of testing the significance of an 
udserved corrclation coefficient will be discussed later in “tests of significance” 
m sampling (c.f. § 144-11]. 
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Probable error also enables us to find the limits within which the 
pulation correlation coefficient can be expected to vary. The limits are 
r+P.E(7). 
10°6. Rank Correlation. Let us suppose that a erollp of n individuals 
is arranged in order of merit or proficiency in possession of two characteristics A 
and B. These ranks in the two characteristics will, in general, be different. For 
example, if we consider the relation between intelligence and beauty, it is not 
necessary that a beautiful individual is intelligent also. Let (x;, yj); = 1,2,. 

n be the ranks of the ith individual in two characteristics A and B respectively. 
Pearsonian coefficient of correlation between the ranks x;’s and y,’s is called the 
rank correlation coefficient between A and B for that group of individuals. 

Assuming that no two individuals are bracketed equal in either 
classification, each of the variables X and Y takes the values 1, 2, ..., n. 


n+l 


Hence X= ya (142434. +ny= 


_ n(n + 1)(2n+1) n+ 1Y_ n2-T 
= 2 12 


2 
Ox 12 oy 
In general x; # y; . Let d; = x; - y; 


; d;=(x;-x)- Qi -y) (x =y) 
Squaring and summing over i from 1 to n, we get 


Ld? = X(%;-x)-O;-y)}? 


= L(x, -x ? + LOi,-y * -22G; =x); - 
Dividing both sides by n, we get 


1 Sd? = oy? + oy? - 2 Cov (X, ¥) = 0,2 + 0}? - 2p oxy/ 


where p is the rank correlation coefficient between A and B. 


lye 2 2» d/ 
di = 26;*-2p0,;7 => 1- —P * nay? 
rd 6 Dr 
=> p=1- oe = l n(n? - 1) .--(10-7) 


which is the Spearman’s formula for the rank correlation coefficient. 
Remark. We always have 


Ld;=Z(ej-y) = Lx; Ly =X - ye 0 (- % =) 


This serves as a check on the calculations. 
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10-6°1. Tied Ranks. If some of the individuals reccive the same rank in a 
ranking of merit, they are said to be ticd. Let us suppose that m of the 
individuals, say, (kK + 1)", (k + 2)", ...., (kK +m)" are tied. Then cach of these 1 ° 
individuals is assigned a common rank, which is the arithmetic mean of the 
ranks k.+ 1, k +2, ...., 4k + mm. ‘ 

Derivation of p (X, Y). We have : 


(XxX - X)(¥- ¥) Exy 
Ct) = 
[z(x-X)2. 5 (Y - yyz]!'2 VEx?. Ly? 
where x=X-X,y= Yay, 
If X and Y each takes the values 1, 2, ...., n, then we have 
X = (n+ 1)/2= Y 
2 2.< 
and nox? = Xx? Me = and no} = Ly? sa 30a) 
Also Td =E(X-Y)P=LD[(X- X)-(Y- YP aT (x-yy? 
=> Ld? = Ux? + Ly2-22 xy 
=> Lxy =} [2x2 + Ly? - La?] aC") 


We shall now investigate the effect of common ranking, (in case of ties), on 
the sum of squares of the ranks. Let S? and S,? denote the sum of the squares of 
untied and tied ranks respectively. 

Then we have : 

S2 =(k +1)? +(k +2)? +...4(k +m) 
= mk? + (12 +22 4... 4m?) + 2k.(1+2+..4m) 


m (m+ 1) @m + I) 
= mk? +fie* 7. = + mk (m+ 1) 


S;2 =m (Average rank)? 


=m [EEN 4 hs 2st ks yy ° 


Mn 


2 2 
=m(k+™+ *) = mio 4 NY im km +1) 


| ee 
S?- $2 =m te (2m +1)-3 (m+ 1] = nord 


Thus the effect of tying m individuals (ranks) is to reduce the sum of the 
squares by m (m* — 1)/12, though the mean value of the ranks remains the same, 
viz.,(n+ 1/2. © 

Suppose that there are s such sets of ranks to be‘ tied in-the X-serics so that 
the total sum of squares due to them is 

AY AY 
> >» m; (m7? —- 1) =15 » (m — m,) = Ty, (say) ... (10-74) 
i=] ! 


4 
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Similarly suppose that there are ¢ such sets of ranks to be tied with respect 
to the other series Y so that sum of squares due to them is : 


t t 
a, 2 mj. (mj? - 1) = 2, 2 (m; ° — m;') = Ty , (say) ---(10-76) 


Thus, in the case of ties, the new sums of squares are given by : 


2_ 
n Var’(X) =D xt -Ty = _ 7, 


24 
n Var’(Y) = Ey Ty MY 7, 


and n Cov’(x, Y) =3(2 x? -Ty + Ly? -—Ty - La?} [From (#4) 
_1| a(n? - 1) n(n2 — 1) 
=| ve Sane) ed 
2 =D faye Tp +E] 


24 
n= _ L [tT + Ty+ Za] 


ae -1) i" ae -1) i 
- Ty — Ty 
12 12 
2_ 
nr [sa iy aes Ty | 


s 2_ 1/2 7 12 
jae 1) _ 27] [aca 1) 9 ry] 


p(X, Y) = 


...(10-7c) 
where Ty and Ty are given by (10-7a) and (10-76). 

Remark. If we adjust only the covariance term i:e., Xxy and not the 
variances Gx? (or 2 x*) and Gy* (or Ly) for ties, then the formula (10-7c) 
reduces to : 

2 
ae) (Xd? + Ty + Ty) 


n(n? — 1)/€ 
6 [Xd?2 + Ty + Ty] 
n(n? — 1) : 


a formula which is commonly used in praotice for numerical problems. For: 
illustration, see Example 10-18. 


Example 10-16. The ranks of same 16 students in Mathematics and 
Physics are as follows. Two numbers within brackets denote the ranks of the 
students in Mathematics and Physics. 

(1,1) (2,10) (3,3) (4,4) (5,5) (6,7) (7,2) (8,6) (9,8) 
(10,11) (11,15) (12,9) (13,14) (14,12) (15,16) (6. 13). 


p(X, Y) 


=1- ...(10-7.2) 
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Calculate the rank correlation coefficient for proficiencies of this group in 
Mathematics and Physics. 


Solution. 


Ranks in 
Maths. (X) 


123 4 5 6 7 8 9 10 11 12 13 14 15 16] Tota, 


Ranks in 
Physics(Y) 


d=X-Y 


1103 4 5 72 6 8 11:15 9 14 12 16 13 


0-8 0 0 0-1 5 2 1 -1 -4 3-1 2-1 3 


06400 0125 4 1 214169 «14 «21 «<9 


Rank cortclation coefficient is given by 
6 > a2 6 x 136 1 4_ 
p= thn? - 1) 17 16x 255~' 575798 

Example 10-17. Ten competitors in a musical test were ranked by the 
three judges A, B and C in the following order : 
Ranks by A: 1 6 5 10 3 2 4 9 7 8 
Ranks byB: 3 5 8 4 7 #+J10 2 | 6 9 
Ranks byC: 6 4 9 8 41 2 #3 #10 #5 7 

Using rank correlation method, discuss which pair of judges has the neares, 
approach to common likings in music. 

Solution. Here n = 10 


Ranks | Ranks | Ranks 
by B| by C| -d d, d, d? 
' (Y) (Z) | = X-Y| =X-Z |] =Y-Z 


by A 
(X) 


a|aci 


1 3 25 9 
6 5 4 1 
5 | al 9 1 
i | a | es 16 
3 7 1 36 
2 10 2 64 
4 2 3 1 
9 |] 1 | 10 81 
7/61] 5 i 1 
gs |o|7 2 1 4 
|| Paepare Pataca 
6%, 6x200 . 40. 7 
p% Y= 1-72 ay) =} -10x 99 = 1-33 = 733 


3 
_, 62d? | 6x60 _ , 4 7 
Pw 2a1- vo n=! -lox99e iE 
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§> d;* 6 x 214 49 
pY, Z)= 1 Fz y= | -i0 x 99 =~ 165 

Since p(X, Z) is maximum, we conclude that the pair of jydges A and C 
has the nearest approach to common likings in music. 

10-6-2. Repeated Ranks (Continued). If any two or more 
individuals are bracketed equal in any classification with respect to characteristics 
A and B, or if there is more than one item with the same value in the series, 
then the Spearman’s formula (10-7) for calculating the rank correlation 
coefficient breaks down, since in this case each of the variables X and Y does 
not assume the values 1, 2, ..., n and consequently, x # y. 

In this case, common ranks are given to the repeated items. This common 
rank is the average of the ranks which these items would heve assumed if they 
were sightly different from each other and the next item will get the rank next to 
the ranks already assumed. As a result of this, following adjustment or 
correction is made in the rank correlation formula [c.f. (10-7c) and (10-7d)). 


2 
In the formula, we add the factor mor) to > d?, where m is the 


number of times an item is repeated. This correction factor is to be added for 
each repeated value in both the X-series and Y-series. 


Example 10-18. Obtain the rank correlation coefficient for the following 


data: 

», ¢ > 68 64 75 SO 64 80 75 40 55 64 
Y > 62 S8 68 45 81 60 68 48 50 70 
Solution. 


CALCULATIONS FOR RANK CORRELATION 
Rank X Rank Y 


»¢ Y (x) (y) d =x-Yy d2 
68 62 4 5 = | 1 
64 58 6 7 ~1 1 
75 68 2:5 3-5 ~1 1 
50 45 9 10 =] 1 
64 81 6 1 5 25 
80 60 1 6 -5 25 
75 68 2-5 3-5 -1 1 
40 48 10 9 1 1 
55 50 8 8 0 0 
64 70 6 2 4 16 
Yd=0 Yde=72 


In the X-series we see that the value 75 occurs 2 times. The Common rank 
given to these values is 2-5 which is the average of 2 and 3, the ranks which 
these values would have taken if they were different. The next value 68, then 
gets the next rank which is 4. Again we see that value 64 occurs thrice. The 
common rank given to it is 6 which is the average of 5, 6 and 7. Similarly in 
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the Y-series, the value 68 occurs twice and its common rank is 3-5 which is the 

a‘*rage of 3 and 4. As a result of these common rankings, the formula for ‘p’ 
ns. 

has to be corrected. To ¥ d? we add nin = for each value repeated, where 


m is the number of times a value occurs. In the X-series the correction is to be 
applied twice, once for the value 75 which occurs twice (m = 2) and then for the 
value 64 which occurs thrice (m = 3). The total correction for the X-series is 


2(4-1) | 3(9-1)_5 
12 io. 2 


Similarly, this correction for the Y-series is “+=? =1 | as the value 68 


12 2 
occurs twice. 
5 1 
of Eat + 5) + A 
Thus p=l1- = 1 802 +3) «| 0.545 


n(n2 — 1) 10 x 99 
10:6°3. Limits for the Rank Correlation Coefficient. 
Spearman’s rank correlation coefficient is given by 


6x d? 


ix 1 


p11 
Rn 
‘p’ is maximum, if + ad? is minimum, i.z., if each of the deviations d; is 
i= 1 


minimum. But the minimum value of d; is zero in the particular case x; = y,, 
i.e., if the ranks of the th individual in the two characteristic are equal. Hence 
the maximum value of p is + lie, p< 1. 


Rn 
‘p’ is minimum , if % d? is maximum, i.e., if each of the deviations d; 


ix 
is maximum which is so if the ranks of the n individuals .in the two 
Characteristics are in the opposite directions as given below : 


ses(®) 


Case 1. Suppose n is odd and equal to (2m + 1) then the values of d are : 
d:2m,2m -2, 2m -4,..., 2, 0, -2, —4, ... —- (2m — 2), —2m. 


.. x d?2 =2{(2m)2 + (2m —2)? +... + 42422} 


= 8{m?+(m-1)?+...+ ?} = 8m ma an tt 
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6 2 d? 8 + 1)(2m + 1 
2 ged _, _ _ 8m(m + D@m+1)__ 
Bee Ob Ge iy a...” 


8m(m + 1) Ss Sm(m + 1) _ 


rs (4m2+4m)”~ ~~ 4m(m + :1)— 
Case II. Let n be even and equal to 2m, (say). 
Then the values of d are 


(2m — 1), (2m — 3), ..., 1,1, -3, ...» —(2m — 3), —(2m — 1) 
Y dz =2{(2m - 1)? + 2m - 3)? +... + 12} 
= 2[ {(2m)* + (2m — 1)? + (2m — 2)? + ... + 27 + 17) 
—~{(2m)* + (2m — 2)? + ... + 42 + 27} 
= 2[12 + 27 +... + (2m)? ~ (22m? + 22(m —1)? + ... + 273] 


ke (2m + 1)(4m + 1) mim + 1)(2m + un 
=2 6 -4 6 


=m [(2m + 1) (4m + 1) — 2(m + 1)(2m + 1)] 


-*n [ (2m + 1)(4m + 1 -2m - 2)] 
-m (2m + 1)(2m ~ 1) =?mGer LD) 


= Oedi* 8 44m? = 1) _ 
pth? —1) > 7 Um(4m? — 1) > 
Thus the limits for-rank correlation coefficient are given by -1 < p < 1. 
Aliter. For an altemmate and simpler proof for obtaining the minimum 
value of p, from (*) onward, proceed as in Hint to Question Number 9 of 
Exercise 10(c). 
Remarks on Spearman's Rank Correlation Coefficient. 


1. Sd = x- Ly = n(x ~—y) =0, which provides a check for numerical 
calculations. 

2. Since Spearman's rank correlation coefficient p is nothing but- 
Pearsonian correlation coefficient between the ranks, it can be interpreted in the 
same way as the Karl Pearson’s correlation coefficient. 


3. Karl Pearson’s correlation coefficient assume that the parent population 
from which sample observations are drawn is normal. If this assumption is 
violated then we need a measure which is distribution free (or non-parametric). A 
distribution-free measure is one which does not make any assumptions about the 
parameters cf the population. Spearman’s p is such a measure (i.e., 
distribution-free), since no strict assumptions are made about the form of the 
population from which sample observations are drawn. 


4. Spearman’ formula is easy to understand and apply as compared with 
Karl Pearson’s formula. The value obtained by the two formulae, viz., 
Pearsonian r and Spearman's p, are generally different. The difference arises due 
to the fact that when ranking is used instead of full set of observations, there is 
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always some loss of information. Unless many ties exist, the coefficient of rank 
correlation should be only slightly lower than the Pearsonian coefficient. 

5. Spearman’s formula is the only formula to be used for finding 
correlation coefficient if we are dealing with qualitative characteristics which 
cannot be measured quantitatively but can be arranged serially. It can also be 
used where actual data are given. In case of extreme observations, Spearman’s 
formula is preferred to Pearson’s formula. 


6. Spearman’s formula has its limitations also. It is not practicable in the 
case of bivariate frequency distribution (Correlation Table). For n > 30, this 
formula should not be used-unless the ranks are-_given, since in the contrary case 
the calculations are quite time-consuming. 


EXERCISE 10(c) 
1. Prove that Spearman’s rank correlation coefficient is given by 


d;* 
1 - o , where d; denotes the difference between the ranks of ith 
individual. 


2. (a) Explain the difference between product moment correlation 
coefficient and rank correlation coefficient. 

(b) The rankings of ten students in two subjects A and B are as follows : 

A: 3 5 8 4 7 to 2 1 6 9 

Bs: 6 4 9 8 1 2 3 10 5 7 

Find the correlation coefficient. 

3. (a) Calculate the coefficient of-correlation for ranks. from the following 


(X, Y): (5, 8); (10, 3), (6, 2), (3, 9), = (19, 12), (5, 3), 
(6, 17), (12, 18), (8, 22), (2, 12), (10, 17), (19, 20). 
[Calicut Univ. B.Sc. (Subs. Stat.), Oct. 1991) 
(6) Ten recruits were subjected to a selection test to ascertain their 
Suitability for a certain course of training. At the end of training they were given 
a proficiency test. 
The marks secured by recruits in the selection test (X) and in the proficiency 
test (Y) are given below :— 


Serial:No. : 1 2 3 4 5 6 7 8 9 10 
X : 10 1§ 12 17 13 16 24 14 22 £20 
Y : 30 42 45 46 33 34 40 35 39 38 


Calculate product moment correlation coefficient and rank correlation 
coefficient. Why are two coefficients different ? 

4, (a) The I.Q.’s.of a group of 6 persons were measured, and they then sat 
for 2 certain examination. Their I.Q.’s and.examination marks were as follows : 


Person : A B: C D E F 
1.Q. : 110 100 140 120 80 90 
Exam. marks : 70 60 80 60 10 20 


Compute the coefficients of correlation and rank correlation. Why are the 
correlation figures obtained different ? 


Ans. 0-882 and: 0:9. 
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The difference arises due to the fact that when ranking is used instead of the 
full set of observations, there is always some loss of information. 
(b) The value of ordinary correlation (7) for the following data is 0-636 :— 
X: -05 +14 = -24 30 -47 +52 -57 -61 -67 + -72 
Y:; 7-08 1-15 1-27 1-33 1-41 1-46 1-54 2-72 4-01 9-63 
(i) Calculate Spearman’s rank-correlation (p) for this data. 
(ii) What advantage of p was brought out in this example ? 


4. Ten competitors in a beauty contest aré ranked by three judges as 
follows : 


Competitors 
Judges : 1 2 3 4 5 6 7 8 9 10 
A 6 5 3 10 2 4 9 7 8 1 
B 3D 8 A 7 ~=©10 2 6 9 3 
Cc 4 9 8 1 2 3 10 5 7 6 


Discuss which pair of judges has the nearest approach to common tastes of 
beauty. 


5. A sample of 12 fathers and their eldest sons gave the following data 
about their height in inches : 

Father: 65 63 67 64 68 62 70 66 68 67 69 71 

Son : 68 66 68 65 69 66 68 65 7i 67 68 70 

Calculate coefficient of rank correlation. (Ans. 0-7220) 


6. The coefficient of rank correlation between marks in Statistics and marks 
in Mathematics obtained by a certain group of students is 0-8. If the sum of the 
squares of the difference in ranks is given to be 33, find the number of student in 
the group (Ans. 10). [Madras Univ. B.Sc., 1990] 


7. The coefficient of rank correlation of the marks obtained by 10 students 
in Maths and Statistics was found to be 0-5. It was later discovered that the 
difference in ranks in two subjects obtained by one of the students was wrongly 
taken as 3 instead of 7. Find the correct coefficient of rank correlation. 


| 6a 
_ 290 _ gn, 
=> Ldt= ey 27 825 


Since one difference was wrongly taken as 3 instead of 7, the correct value 
of Yd is given by 


Corrected ¥ ad? = 82-5 — (3)? + (7)? = 122-5 


6 x 122-5 


Corrected p=1- 10 x 99 


= 0-2576 
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8. If d; be the differencé in the ranks of the ith individual in two different 

characteristics, then show that the maximum value of 2% 4d;? is > (n? —n), 
i=l 


Hence or otherwise, show that rank correlation coefficient lies between —1 and 
+1. [Delhi Univ. B.Sc. (Maths. Hons.), 1986] 


9, Let x,, X, ..., X, be the ranks of n individuals according to a character A 
and y,, Yo. .--» Y, be the ranks of the same individuals according to other 
character B. Obviously (x;, x2, ..23%,)-and (y,, Y2, «-+» Yx) are permutations of 1, 
2,..., n. It is given that x; + y,;= 1+, fori=1,2,..., 2. Show that the value 
of the rank correlation coefficient p between the characters A and B is —1. 


Hint. Weare given x+y; = n+1Vi=1,2,...,n 
Also xj-y; = di 
2x; =n+1l+di => d;=2x;-(n + 1) 


> dz = > [4x,7 + (n + 1)? -2(n + 1)2x;] 
j iol 


| 


_ , a(n + 1)(2n + 1) 2 A(n+ I)a(n + 1) 
=4 6 +n(n + 1) 7 


_ n(n? - 1) 
7 3 


6 > d? 
1 ee 2 See 
n(n? - 1) 


Remark. From Spearmans’ formula we note that p will be minimum if 
XY d;* is maximum, which will be so if the ranks X ard Y are in opposite 
directions as given below : 


p = 1 


This gives us 


X,+yene+1,t= 1,2, ...,0n. 
Hence the value of p = ~1 obtained abpve is minimum value of p. 


10. Show that in a ranked bivariate distribution in which no ties occur and 
in which the variables are independent 


(a) & d? is always even, and 
8 


(b) there are not more than : (13 ~n) + 1 possible values of r. 
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11. Show that if X, Y be identically distributed with common probability 
mass function: P(X=k)=~, fork=1,2,...,N;N >1, 


then py.y, the correlation coefficient between X and Y, is given by 


6E(X - Y)? 
ve N2-1 


[Delhi Univ. B.Sc. (Maths Hons.), 1992) 


10-7. Regression. The term “regression” literally means “stepping 
back towards the average” . It was first used by a British biometrician Sir Francis 
Galton (1822—1911), in connection with the inheritance of stature. Galton 
found that the offsprings of abnormally tall or short parents tend to “regress” or 
“step back” to the average population height. But the term “regression” as now 
used in Statistics is only a convenient term without having any reference to 
biometry. 


Definition. Regression analysis is a mathematical measure of the average 
relationship between two or more variables in terms of the original units of the 
data. 


In regression analysis there are two types of variables. The variable whose 
value is influenced or is to be predicted is called dependent variable and the 
variable which influences the values or is uséd for prediction, is called 
independent variable. In regression analysis independent variable is also known 
as regressor or predictor or explanatory variable while the dependent variable is 
also known as regressed or explained variable. 


10-7-1. Lines of Regression. If the variables in a bivariate 
distribution are related, we will find that the points in the scatter diagram will 
cluster round some curve called the “curve of regression” . If the curve is a 
straight line, it is called the line of regression and there is said to be linear 
regression: between the variables, otherwise regression is said to be curvilinear. 


The line of regression is the line which gives the best estimate to the value 
of one variable for any specific value of the other variable. Thus the line of 
regression is the line of “best fit” and is obtained by the principles of least 
squares. 

Let us suppose that in the bivariate distribution (x;, y;); i = 1,2, ...,n; Y 
is dependent variable and X is independent variable. Let the line of regression of 
Yon xX be Y=a+ bX. 


According to the principle of ‘least squares, the normal equations ‘for 
estimating a and b are {c.f. (9-2a)} 
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2 i =xnat+b > x; ..-(10-8) 
i= i=l 


i= 


and r Xi =a Bs itd a we .. (10-9) 


From (10-8) on dividing-by n, we get 

y =a+bx ...(10-10) 
Thus the line of regression of Y on X passes through the point (, y). 
Now 

A. fee av | —— 
Hi = Cov (X,Y) => 2 tii Se 2 xii SMiitxy — ...(10-11) 
= j= 
an R 


Also oy? => Da?-¥2? = 2 ¥ aPeoe+k? ...l0lla) 


iz] iz 


Dividing (10-9) by n and using (10-11) and (10-11a), we get 


bi +x y=ax + b(o,? +x?) ...(10-12) 
Multiplying (10-10) by x and then subtracting from (10-12), we get 
a — Hou Ch, 
i= boy? => G2 ...(10-13) 


Since ‘b’ is the slope of the line of regression of Y on X and sincé the line 
of regression passes through the point (x , y ), its equation is 


Y -y=b(X-%) = ot (X -X) (10-14) 
= Y-y = rot (xX -x) ,..(10-14a) 


Starting with the equation X =A + BY and proceeding similarly or by 
simply interchanging the variables X and Y in (10-14) and (10-14a), the 
equation of the line of regression of X on Y becomes 


x -x =H yy -7) -..(10-15) 

oy? 
= X -x =r“k iy -7) ...(10-15a) 

Ory 

Aliter. The straight line defined by 
Y =a+bXx .- (i) 
and satisfying the residual (least square) condition 

S =E [((Y -a - 6X?) = Minimum ass) 


for vartauons in a and b, is called the line of regression of Y on X. 


The necessary and sufficient conditions for a minima of S, subject to 
variations in a and b are : 
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() =o, SP=0 and aC) 
aS = aS 
: da * dadb 925 
Gi) A= 926 525 >0O and 54229 2g. aa) 
0bda db 
Using (*), we get 
0S ss 
oa = -2E (Y¥-a—-bX]= wo (dis) 
8 = -—2 E [X(Y -a- bX)}=0 ...(iv) 


=>  E(Y) =a+bE(X)...(v) and E(XY) =aE(X) + bE (X2) ...(vi) 

Equation (v) implies that the line (i) of regression of Y on X passes 
through the mean value [E(X), E(¥)]. 

Multiplying (v) by E(X) and substracting from (vi), we get 


E(XY) - E(X)E(Y) = b[E(X *) - {E(X))?] 


=> Cov (X,Y =bo;? => b= Cove. roy .-.(vii) 
O,* Ox 


Subtracting (v) from (i) and using (vii), we obtain the equation of line of 
regression of Y on X as: 


Y -E(Y) = a IX -E(X)] > Y-E(Y) = = [X -E(X)] 


Similarly, the straight line defined by X = A + BY 
and satisfying the residual condition 
E(X -A —BY)* = Minimum, 
is called the line of regression of X on Y. 
Remarks 1. We note that 
07S 


Fie = 2>0, and 
aS _ 02S 
5p = 2E(X*) and aad = = 2E(X) 
Substituting in (**), we have 3 ri 
as as ( 27S 
= ae * ap? \ aadb 


= 4 [E(X2) - (E(X))?] = 4. 0,2 > 0 


Hence the solution of the least square equations (iii) and (iv), in fact, 
provides a minima of S. 


2. The regression equation (10-14a) implies that the line of regression of Y 
on X passes through the point (x, y ). Similarly (10-15a) implies that the line 
of regression of X on Y also passes through the point (x, y ). Hence both the 
lines of regression pass through the point (x, y ). In other words, the.mean 
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values (x, y ) can be obtained as the point of intersection of the two regression 
lines. 

3. Why two lines of Regression ? There are always two lines of regression 
one of Y on X and the other of X on Y. The line of regression of Y on x 
(10-14a) is used to estimate or predict the value of Y for any given value of x, 
i.e., when Y is.a dependent variable and X is an independent variable. The 
estimate so obtained will be best in the sense that it Will have the minimun, 
possible error as defined by the principle of least squares. We can also obtain an 
estimate of X for any given value of Y by using equation (10-14a) but the 
estimate so obtained will not be best since (10-14a) is obtained on minimising 
the sum of the squares of errors of estimates in Y and not in X. Hence tg 
estimate or-predict X for any given value of Y, we use the regression equation of 
X on Y (10-15a) which is derived on minimising the sum of the squares of 
errors of estimates in X. Here X is a dependent variable.and Y is an independent 
variable. The two regression equations are not reversible or interchangeable 
because of the simple reason that the basis and assumptions for deriving these 
equations are quite different. The regression equation of Y on X is obtained on 
minimising the sum of the squares of the errors parallel to the Y-axis while the 
regression equation of X on Y is obtained on minimising the sum of squares of 
the errors parallel to the X-axis. 

In a particular case of perfect correlation, positive Or negative, i.e., r +1, 
the equation of line of regression of Y on X becomes : 


= re] Z 
Y-y=t oy 7) 


= 2 Gow (10-16) 


which is same as (10-16). 

Hence in case of perfect correlation, (r = + 1), both the lines of regression 
coincide. Therefore, in general, we always have two lines of regression except in 
the particular case of perfect correlation when both the lines coincide and we get 
only one line. 

10-7-2. Regression Curves. In modern terminology, the conditional 
mean E(Y | X = x) for a continuoiis distribution is called the regression function 
of Y on X and the graph of this function of x is known as the regression curve 
of Y on X or sometimes the regression curve for the mean of Y. Geometrically, 
the regression function represents the y co-ordinate of the centre of mass of the 
bivariate ‘probability mass in the infinitesimal vertical strip bounded by x and 
“x'+ dx. 


s 
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Similarly, the regression function of X on Y is E (X | Y = y) and the graph 
of this function of y is called the regression curve (of the mean) of X on Y. 


In case a regression curve is a straight line, the corresponding regression is 
‘d to be linear. If one of the regressions is linear, it docs not however follow 
that the other is also linear. For illustration, See Example 10-21. 


Theorem 10-4. Let (X, Y) be a two-dimensional random variable with 
E(X) = X, E(Y) = Y, V(X) = oy?, V(Y) = oy* and let r = r(X, Y) be the 
correlation coefficient between X and Y. If the regression of Y on X is linear 
then 


E(Y |X) = Y+rh (x - x) ...(10-16a) 
x 
Similarly, if the regression of X on Y is linear, then 
E(X\Y) =X + a (Y -¥) ...(10-16b) 
Proof. Let the regression equation of Y on X be 
E(Y (x) =a + bx (1) 
But by definition, 
x oo 
E(Y |x) =" yfQlx)dy= y LEY ay 
—0o 7 00 Fx(x) 
1 oS 
Ae) J y fx, y) dy =a+ bx ...(2) 


Multiplying both sides of (2) by f(x) and integrating w.r.t. x, we get 


| . | . y f(x, y) dydx =a " Ix(x) dx + b | . x fielx)dx 


oo — 00 


=> | y (| fx y) dx|dy = at bE(X) 


oo 
= | yfr)dy =a + bE(X) 
i.e., E(Y)=a+bE(X) = Y =a+bX ..:3) 
Multiplying both sides of (2) by x fy(x) and integrating w.r.t. x, we get 


[- \" xy f(x, y) ay dx =a | X fx(x) dx +b | x? fx(x) dx 


—oo 


= E(XY) =a E(X) + bE (X2) 
= My +X ¥ =aX + b(o,y? + XY ...(4) 
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[fy = E(XY) -EQOE(Y) = E(XY)- XY ; 
Gy? = E(X?) - (E(X)}? = E(X?) - X? 


Solving (3) and (4) simultaneously, we get 


Substituting in (1) and simplifying, we get the required equation of the line 
of regression of Yon X as 


E(Y|lx) =Y + oe eX) 

= E(YIX) =¥ +55X- X) 

= E(YIX) =¥ +r2(X-X) 
Ox 


By starting with the line E (X | y) = A + By and proceeding similarly we 
shall obtain the equation of the line of rca 1n of X on Y ar 


on 


E(X|Y) =X + soe Y)= X+rcky- ¥) 


Example 10°19. Given 

f(x, y) = xeX"" +), x 20, y 20, 
find the regression curve of Y on X. (B.H. Univ. M.Sc., 1989] 
Solution. Marginal p.d.f. of X is given by 


f, @) || fix y) dy = | xe +) dy 
0 0 


Go 
oo 


en*Y 


= xe* | ee) dy = xe* 
0 


=e*,x20 
Conditional p.d.f. of Y on X is given by 


fy 20) wy 
Soy |x) Fix) yey  e xy 20. 


The regression curve of Y on X is given by 


oo oo 


=E(YIX=x)= | yyts) dy = | xen ay 
0 0 
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=x (| +] Pal =o+ Se 


=> x=. 


2 | mne 


1.é., | 


which is the equation of a rectangular hyperbola. Hence the regression of Y on 
X is not linear. 


Example 10-20. Obtain the regression equation of Y on X for the 
following distribution : 


= _—_). > 
FEN =G PrP ( 1+ x ey 
Solution. Marginal p.d.f.‘of X is os by 


eo 


fi = i fix, y) dy = (+ cee I. ye ~Wl+z) dy 
-awot .12.(1 +x)? (Using Gamma Integral) 
a sae 
=(1 +x)?’ * = 
The conditional p.d.f. of Y (for given X) is 
_ fy) __y (- 7) 
fix) = fi  (d+xn?oP Wwe es 


Regression equation of Y on X is given by 


y=E(YIX) = -|- y fy |x) dy = (1+ Tape) ee a 
res T3. (l+x) [Using Ganima Integral] 
> y =2(1 +2) [-- T3=2!=2] 


Hence the regression of Y on X is linear. — 
Example 10-21.. Let (X, Y) have the joint p.df. given by 
eh Lifly|<xmO<x<l1 
fx Y) = 1 6. otherwise 


Show thatthe regression of Y on X is linear but regression of X on Y is 
not linear. 


Solution. lyi<x => -x<y<x and x>Ilyl. 
The marginal p.d.f’s f,(.) of X and f2(.) of Y are given by : 
x x 


fia) = | f(x, y) dy = l. dy =2x;0<x<1 
—Xx —X 


1 1 
0) = | fx, ) de = | ldx=1-lyl;-l<y<l 
lyl ly] 
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f, @ly) Ste Topi sy<ho<x<l 


see (al. Peue tiger SI 
aylhay 
ee | 
ie 
fo (y!x) ee oe O<x<lilyt<x 


‘E(Y |X =x) -| y .foly |x) dy = [z - day. lye = 


—X 


Hence the curve of regression of Y on X is y = 0, which is a straight line, 


E(xX\lY=y) = Jes (xl y)dx 


1 
ol Ve 
EX \Y=y) = Ne Geer \enaq ry O<¥<! 


1 
l | 
ae —— k=. ..-1 
and = E(XIY =) Gest. 7+)’ <y<0 
Hence the curve of regression of X on Y ts 


1 
Oe ye 
2(1 - y) 


2(1 + y)’ 
which is not a straight line. 
Example 10-22. Variables X and Y.have the joint p.df. ' 


fix y) =3 (x+y), O0<xs1,0y ¥2., 


-l<y<0O, 


Find: 
(i) r(X, Y) 
(ii) The two lines of regression 
(iii) The two regression curves for the means. 


Solution. The marginal: p.d.f.’s.of X and Y are given by : 


2 y 2 
fix)’ = fre y) dy = Jee» dy aay +5 , 


=>: fi(x) =(1+x);0Sx<1 .. (1) 
1 ; | x? ah 
FAY) = [sornde=3] crndend E+ xy 


= fh) = (3+ ty }iosys2 a 
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The conditional distributions are given by : 


af, y) _ 
fix) sft 23 (FS 


_ Ly) _ 2(%+ y) 
falx ly) ad fly) ~ 4 + 2y ..-(3) 


2 
1 
E(Yix) = [> -f3 (y Ix) dy= 21 +2) Jo + y)dy 


1 j=. el 3x +4 
“21 + x) yzo0  3(x + 1) 


Similarly, we shall get 


1 
E(X ly) = | as (xly)dx= JT a 
(iii) Hence the-regression curves for means are’: 
= __3x + 4 a __2+ 3y_ 
y = E(vix) = 3 (x +1) and x =E(Xly) = 3(1 4 2y)* 
From the marginal distributions we shall get 


1 i 
EM) = F "x fiG) de=2, = -| Pf(de= 5 
0 


=o REM) = Oy = 7g -(5) =3 i 


Similarly, we ai get 


2 + 3y 
1, (P+ xy) das 2y) 


2 
Bx) =4L B= 38 o=2- (2) -8 


2 
Also E(XY) f Pose nano} [ | c2y+299 duay: 
040 040 


{Se} re {fe Lro } 


1f1 1, 8]_2 


Cov (X, ¥) = BUKY) ~ EW) E()=3- 5x Ba zt 
1 
PP 12 
() rk, Yy =U, 8h _ (55) 
yr [43 23 
162 81 


(ii) The two lines of regression are : 


Y -E(Y) = x - E(X)) = ¥, - "G=-G(*+3) 
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ond X E(x), =<£X EY y_ ey = x -3=-a ait 
oy 9 23 9 
10-7-3. Regression Coefficients. ‘b’, the slope of the line of 
regression of Y on X is also called the coefficient of regression of Yon X. | 
represents the increment in the value of dependent variable Y corresponding to a 
unit change in the value of independent variable X. More precisely, we write 


byy = Regression coefficient of Y on X = gos 7 ...(10-17) 


Similarly, the coefficient of regression of X on : nies the change in 
the yalue of variable X corresponding to a unit change in the value of variable 
Y and is given by 


byy = Regression coefficient of X on Y = =a = i ..-(10-17a) 
107-4. Properties of Regression Coefficients. 


(a) Correlation coefficient is the geometric mean between the regression 
coefficients. 


Proof. Multiplying (10-17) and (10-17a), we get 


Oy Oy _ { 
byy x b =fr— xr— 
XY YX re} C 


Pe Fst V byy xX byyx ...(10-18) 
Remark. We have 
Bi — Hin — Mar 
Sy. Gy’ bx = G3 and bx = G2 
It may be noted that the sign of correlation coefficient is the same as that of 
regression coefficients, since the sign of each depends upon the co-variance term 
l4,- Thus if the regression coefficients are positive, ‘r’ is positive and if the 
regression coefficients are negative ‘r’ is negative. 


From (10-18), we have 
r=t Vbyy x byx 


the sign to be taken before the square root is that of the regression coefficients. 
(b) If one of the regression coefficients is greater than unity, the other must 
be less than unity. 
Proof. Let one of the regression coefficients (say) byy be greater than 
unity, then we have to show that by; < 1. 


r= 


Now SiS. ea (4) 
byx 

Also rs = byx . byy <1 

Hence byy < = <1 [From (*)] 


(c) Arithmetic mean of the regression coefficients is greater than the 
correlation coefficient r, provided'r > 0. 
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Proof. We have to prove that + (byx + byy) 2r 


1 Oy Oy Gy Sx 
or 2(" oy —+Fr ror)e? or Sy op" («> r>Q) 
=> Cy" Sus —-26ydy2 0 i.e, (oy —Oy)? 20 


which is always true, since the square of a real quantity is 2 0. 
(d) Regression coefficients are independent of the change of origin but not 


of scale. 
Procf. Let U =A S| -1* => X=a+t+hU, Y=b+kV, 


where a, b, h (> 0) and k (> 0) are constants. 
Then Cov (X, Y) =hk Cov (U, V), 6y* = h?6;7 and oy = k’o,* 


bes _ Bi _ Ak cov (U, V) 


~ Oy his? 
_& covW, Vk, 
~ hh? 67 =n? 


Similarly, we can prove that 
by y°= (h/k) byy 
10-:7:5. Angle Between Two Lines of Regression. Equations of 
the lines of regression of Y on X, and X on Y are 


i OVS oe ee Oheg 
ae is x) and X-x reg, y) 


o o 
Slopes Of these lines are r. = and = respectively. If 6 is the angle 


between the two lines of regression then 
Sy... OY 


% 1 - r2 OxOy - 
=a? ——— |-> (10: 
8 = tan (5,2 ion } (10-19) 


Case‘(i). (7 =0).Ifr=0,tan@=0 => O=7 

Thus if the two variables are uncorrelated, the lines of regression become 
perpendicular to each other. 

Case (ii). (7 = +/). Ifr=+l1,tanO0=0 => O=Oorn. 

In this case the two lines of regression either coincide or they are parallel 
to each other. But since both the lines of regression pass through the point 
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(x ,y ), they cannot be parallel. Hence in the case of perfect correlation, 
positive or negative, the two lines of regression coincide. 

Remarks 1. Whenever two lines intersect, there are two angles 
between them, one acute-angle and-the otfier obtuse angle. Further tan 0 > 0 if 
0< 6 < 7/2, 1.e., 8 is an acute angle and tan 0 < 0 if 1/2 < 8 <7, i.e., 8 is an 
obtuse angle and since 0 < r? < 1, the acute angle (6;) and obtuse angle 0, 
between the two lines of regression are given by 


0 - 
0, = Acute angle = tan” —oxer — ,r>o0 
O° + Oy* r 


> = tan-1 J-OX + OY a | 
and 8, = Obtuse angle aur! | SES, : ,r>od 

2. When r = 0,i.e., variables X and Y are uncorrelated, then the lines of 
regressions of Y on X and X on Y are given-respectively by : [From (10-14a) 
and (10-15Sa)} ys 


Y=Yand X=X, (oy 
as shown in the adjoining diagram. * ~' 
Hence, in this case (r = 0), the lines 
of regression are perpendicular to 
each other and are Nel to X- axis = 
and aitierespectively. : (x,0) Xx 

3. The fact that if r = 0 (variables uncorrelated), the two lines of regression 
are perpendicular to each and if r= +1, 6=0,4.e., the two lines coincide, leads 
us to the conclusion that for higher degree of correlation between the variables, 
the angle between the lines is smaller, i.e.,. the two lines of regression are 
nearer to each other. On the other hand, if the lines of regression make a larger 
angle, they indicate a poor degree of correlation between the variables and 
ultimately for 9 = /2,r =0,i.e., the lines become perpendicular if no 
correlation exists between the variables. Thus by plotting the lines of regression 
on a graph paper, we can: have an. approximate idea about the degree of 
correlation between the two variables under study. Consider the following 
illustrations : 


TWO LINES TWO LINES TWO LINES . TWOLINES TWO LINES 
COINCIDE COINCIDE PERPENDICULAR APART(LOW APART (HIGH 
( =-1) (7 =+1) (r=0) DEGREE OF DEGREE OF 


NEE Ke 


10-7-6. Standard Error of Estimate or Residual Variance. The 
equation of the line of regression of Y on X is 
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2 Y-Y _ (X= X 
aad Cy - Cy 
The residual variance sy* is the expected value of the squares of deviations 


of the observed values of Y from the sa aa values as given by the line of 
regression of Y on X. Thus 


sy =E[Y - (¥ + (roy X -X)oy)) ? 
y-yY X-X\P_ sn SES 
~ope|r =F a -r{ =£) = Oy E(Y* ~ rX°) 
where X° and Y* are standardised variates so that 
E(X*2): = 1= E(¥*?) and E(X* ¥°) =r. 
sy? = oy [E(¥*?) + r? E(X*) — 2r E(X*Y*)] = oy? (1 - 7?) 
=> Sy = Gy (1 — r2)iz 
Similarly, the standard error of estimate of X is given by 
Sx = 0x (1-P)i2 
Remarks 1. Since sy? or sy” > O, it follows that 
(l-r2)20 = Itrisl 
Hence -1 <r(X¥, Y)<s1 - 
2. If r=+ 1, sy = sy = 0 so that each deviation is zero, and the two lines 
of regression are coincident. 
3. Since, as r? — 1, sy and sy —> 0, it follows that departure of me value 


r? from unity indicates the departure of the relationship between the variables X 
and Y from linearity. 


4. From the definition of linear regression, the minima condition implies 
that sy or sy is the minimum variance. 


10°:7:7. Correlation Coefficient between Observed and Esti- 
mated Value. — we will find the correlation between Y and 


Y = Yr ta - X) 


where Yj is the estimated value ‘of Y as given by the line of regression of. Y on 
X, which is given by 


r(Y, Y) = Cov i, Y) 
Oy Sy 
We have 


EY) -E|F +r Lx ~X)|-Fer SY ex -K=¥ 
Ox Oy 


Var (Y) =E [? -edyP=e| r SE ox - )| =o? 
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A 
=> Oy=rdsy 


Also Cov (Y, Y) = E[{Y -E())} (¥ - E(Y))] 


=H (01x - Buoy) fr St (X - »} 


= br 5 EL(X - E(X)}?] = - =| Oy? = ro? 


ne r(Y, es oe =r=r(X,Y) 


Hence the correlation coefficient between observed and estimated value of Y 
is the same as the correlation coefficient between X and Y. 


Example 10-23. Obtain the equations of the lines of regression for the 
data in Example 10-1. Also.obtain the estimate of X for Y = 70. 


Solution. Let U = X — 68 and V = Y — 69, then 
U = 0,V=0, 0,7 =45, of = 5-5, Cov (U, V) =3 andr (U, V) = 0-6 
Since correlation coefficient is independent of change of origin, we get 
r=r(X, Y)=r(U, V) =0-6 
X-a Y-b 
h° V = 8 then 
KX =a+hU, Y =b+kV,oy=hoy and oy =k oy 
In our case h= k= 1,a= 68 and b=69, 
Thus X =68+0=68, Y =69+0=69 | 
Oy =oy=V4 4-5 =2-12 and oy= oy = V5-5 = 2-35 
Equation of line of regression of Y on-X is 


We know that if U = 


Y - Par or (x -X) 


ie., Y= = 69 +06 x 535 (X -68) => Y = 0-665 X + 23-78 


Equation of line of regression of on Y is 
X -X= rox 7c Y) 


ss Ne 68 + 06x Fae (Y - 69) ie., X = 0-54Y + 30-74 
To estimate X for given Y, we use the line of regression of X on Y. If 
Y = 70, estimated value of X is given by 
X = 0-54 x 70 + 30-74 = 68-54, 


where X is estimate of X. 


Example 10-24. /n a partially destroyed laboratory record of an analysis 
of correlation data, the following results only are legible : 


- 
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Variance of X = 9. | 
Regression equations : 8X — ]OY + 66 = 0, 40X — 18Y = 214. 
What were (i) the mean values of X and Y, 
(ii) the correlation coefficient between X and Y, and 
(itt) the standard deviation,of Y ? 
[Punjab Univ. B.Sc. (Hons.), 1993] 
Solution (i) Since both the lines of regression pass through the point 
(X, Y), we have 8X - 10Y +66 =0, and 40X — 18Y = 214. 
Solving, we get X= 13,.Y = 17, 
(ii) Let 8X — 10Y + 66 = O and 40X — 18Y = 214 be the lines of regression 
of Yon X and X on Y respectively. These equations can be put in the form : 


By , 06 _18,,, 214 
Y =19X + jo and X=70' + Go 
byy = Regression coefficient of Yon X = = =* 
and byy = Re ffi ienver xen ye = 
xy = Regression coeffic = 407 20 
9 9 
Hence r2 = byy bxy=5 29 = 25 


as ee = 
(iii) Wehave byy =r- Sy > 5 
Hence Oy =4 


Remarks. 1. It can be verified that the values of X = 13 and Y = 17 as 
obtained in part (i) satisfy both the regression equations. In numerical problems 
of this type, this check should invariably be applied to ascertain the correctness 
of the answer. 

2. If we had assumed that 8X — 10Y + 66 = 0, is the equation of the line of 
regression of X on Y and 40X — 18Y= 214 is the equation of line of regression 
of Yon X, then we get respectively : 


8X = 10Y-66 and 18Y=40X - 214 


10, 66, 40, 214 
= X=ay¥-3 nd Y=>3X 18 
18 40 
=> byy ="3. and byx = 18 
10. 40 
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But since r? always lies between 0 and 1, our supposition is wrong. 


Example 10-25. Find the most likely price in Bombay- corresponding to 
the price of Rs. 70 at Calcutta from the following : 


Calcutta Bombay 
Average price 65 67 
Standard deviation 2:5 3°5 
Correlation coefficient between the prices of commodities in the two cities 
is 08 ~ [Nagpur Univ. B.Sc., 1993; 
Sri Venkateswara Univ. B.Sc. (Oct.) 1990) 


Solution. Let the prices, (in Rupees), in Bombay and Calcutta be 
denoted by Y and X respectively. Then we are given 


X = 65, Y = 67, ox = 2-5, oy = 3-5 and r= r(X, Y) = 0-8. We want Y for 
X = 70. . 
Line of regression of Y on X is 


—_ Oy _Y¥ 
3-5 
= Y =67+08X52 (XK - 65) 


A ¥ 
WhenX=70, Y =67+08x x3 (10-- 65) = 72-6 


Example 10°26. Can Y = 5 + 2-8 X and X = 3 —0-SY be the estimated 
regression equations of Y on X and X on Y respectively ? Explain your answer 


with suitable theoretical arguments. [Delhi Univ. M.A.(Eco.), 1986] 
Solution. Line of regression of Y on X is : 
Y=5+28X¥ > byy=28 ...(*) 
Line of regression of X on Y ‘is: 
X = 3-0-5Y => byy =- 0-5 saa(**) 


This is not possible, since each of the regression coefficients byy and byy 
must have the same sign, which is same as that of Cov (X, Y). If Cov (x, y) is 
positive, then both the regression coefficients are positive and if Cév (X, Y) is 
negative, then both the regression coefficients are negative. Hence (*) and (**) 
cannot be thé estimated regression equations of Y on-X and X on Y respectively. 


EXERCISE 10 (d) 


1. (a) Explain what are regression lines. Why are there two such lines ? 
Also derive their equations. 

(b) Define (¢) Line of regression, (ii) Xegression coefficient. Find the 
equations to the lines of regression and shov: that the coefficient of correlation is 
the geometric mean of coefficients of regression. 

(c) What equation is the equivalent mathematical statement for the 
following words ? ‘ 

“If the respective deviations in each series, X and Y. from their means were 
expressed in units of standard deviations, i.e., if each were divided by the 


C 
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standard deviation of the series; to which it belongs and plotted to a scale of 
standard deviations, the slope of a straight line best describing the plotted points 
would be the correlation coefficient r.” 

2(a) Obtain the equation of the line of regression of Y on X and show that 
the angle 6, between the two lines of regression is given by 


_ 1l-p?_ 9% 
tan @= ae ET 
0} +O? 


where 6}, O2 are the standard deviations of X and Y respectively, and p is the 
correlation coefficient. (Delhi Univ. B.Sc. (Maths. Hons.), 1989) 

Interpret the cases when p = 0 andp = + 1. 
(Bangalore Univ. B.Sc. 1990) 

(b) If 8 is the acute angle between the two regression lines with correlation 
coefficient r, show that sin 6 < 1 — r?. 

3. (a) Explain the’ term “regression” by giving examples. Assuming that 
the regression of Y on X is linear, outline a method for the estimation of the 
coefficients in the regression line based on the random paired sample of X and 
Y, and show that the variance of the error of the estimate for Y for the 
regression line is dy” (1 ~ p?), where oy” is the variance of Y and p is the 
correlation coefficient between X and Y. 

(b) Prove that X and Y are linearly related if and only if pyy* = 1..Further 
show that the slope of the regression line is positive or negative according as 
p=+lorp=-l. 


(c) Let X and Y be two variates. Define X* = a , Y’= z A © for 


some constants a, b, c and d. Show that the regression line (least square) of Y 
on X can be obtained from that of Y* on X°*. 

(d) Show that the. coefficient of correlation between the observed and the 
estimated values of Y obtained from the line of regression of Y on X, is the 
same as that between X and Y. ‘ 

4. Two variables X and Y are known to be related to each ‘other by the 
relation Y = X/(aX + 6). How is the theory of linear regression to be employed 
to estimate the constants a and b from a set of n pairs of observations (x;, y;), 
bed, 26h? 


: 1 ax+b_ b ° 
Hint. Y~ xX =a+y 
l 1 - 
Put x 7Y and » =V 
V =a+bU 


5. Derive the standard error of estimate of Y obtained from the linear 
regression equation of Y on X. What does this standard error measure ? 
6. (a) Calculate the coefficient of correlation from the following data : 
> ane 2 3 4 5 6 7 8 9 
Y: 9 8 10 12 11 13 14. 16 15 
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Also obtain the equations of the lines of regression and obtain an estimate 
of Y which should correspond on the average to X = 6-2. 

Ans. r=0.95, Y-12=0-95 (X -— 5), X —5 = 0-95 (Y -— 12), 13-14 

(b) Why do we have, in general, two lines of regression ? Obtain the 
regression of Y on X, and X on Y from the following table and estimate the 
blood pressure when the age is 45 years: . 


Age in years Blood pressure Age in years Blood pressure 
(X) (Y) (X) (Y) 
56 147 55 150 
Al, 125 49 145 
72 160 38 115 
36 118 42 140° 
63 149 68/ 152 
47 128 60 155 


Ans. Y = 1-138X + 80-778, Y = 131-988 for X = 45. 

(c) Suppose the observations on X and Y are given as: _ 

X: 59 65 45 52 60 62 70 55 45 49 

Y: 75 70 55 65 60 69 80 65 59 = 61 
where N = 10 students, and Y = Marks in Maths, X = Marks in Economics. 
Compute the least square regression equations of Y on X and of X on Y. 

If a student gets 61 marks in Economics, what would you estimate his 
marks in Maths to be ? 

7. (a) In a correlation analysis on the ages of wives and husbands, the 
following data were cbtained. Find 

() the value of the correlation coefficient, and (ii) the lines of regression, 


Estimate the age of husband whose wife’s age is 31 years. Estimate the age 
of wife whose husband is 40 years old. j 


15—25 25—35 35—45 45—55 55—65 


(b) The following table gives the distribution of total cultivable area (X) 
and area under cultivation (¥) in a district of 69 villages. 


Calculate (i) the linear regression of Y on X, 


“ 
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(ii) the correlation coefficient r(X, Y), and (iii) the average area under wheat 
corresponding to total area of 1,000 Bighas, 


Lo 


Total’ area (in Bighas) 


500—1000]1000—1500} 1500—2000} 2000—2500 


” 


Area under wheat 
AN 
=) 
f 
re 
—) 
— 
; 


oO 
Ans. (i) ¥ = 0-7641X — 455-3854, (ii) r(X, Y) =0-756 
(iii) Y = 308-7146 for X = 1000 oe 

8.(a) Compare and contrast the roles of correlation and regression in 
studying thé ‘inter-dependence of two variates. 

For. 10 observations on price (X) and. supply (Y) the following data were 
obtained (in appropriate units). 

2X = 130, LY = 220, 2X7= 2288, LY? = 5506 and XY = 3467 

Obtain the line of regression of Y on X and estimate the supply when the 
price is 16 units, and find out the standard error. of the estimate: 

Ans. Y = 8-8 + 1-015X, 25-04 


(b) If a number X is chosen at random from among the integers 1, 2, 3, 4 
and a number Y is chosen from among those at least as large as X, prove that 


Cov (X, Y) =3 
Find also the regression line of X on Y. 
(c) Calculate the correlation coefficient from the following data :— 
N = 100, YX = 12500 LY =8000 
LX? = 1585000, LY? =648100 SXXY = 1007425. 
Also obtain the regression equation of You X. 
9. (a) The means of a bivariate frequency distribution are at (3, 4). and 


r = (0-4. The line of regression of Y on X is parallel to the line Y = X. Find the 
two lines of regression and estimate the mean of X when Y = 1. 

(b) For certain data, Y = 1-2 X and X = 0-6 Y, are the regression lines. 
Compute p(X, Y) and ox/oy . Also compute p (X, Z), ifZ= Y-X. 


(c) The equations of two regression lines obtained in a correlation analysis 
are as follows : 


3X + 12Y=19, 3Y + 9X = 46 
Obtain (i) the value of correlation coefficient, 
(ii) mean values of X and Y, and 


10-68 Fundamentals of Mathematical Statistic, 


(iii) the ratio of the coefficient of variability of X to that of Y. 
Ans. (i) - 5 V3, (i) X= 5, ¥ = 1. 
(d) For an army personnel of strength 25, the regression of weight of 
kidneys (Y) on weight of heart (X ), both measured in ounces is 
Y —0-399X — 6-934 = 0 
and the regression of — of heart on weight of kidney is = 
~1212¥+2461=0 ‘ , 


' Find the correlation aims between X.and Y and their mean values. Can 
you find out the standard deviation of X and Y as well ? 

vee 

Ans. r(X, Y) = 0-70, X =11- 5086, Y = 11-5261, No. - 

(e) Find the coefficient of correlation for distribution in which 
, ‘ S.D of X = 3-0 units : 

' §.D. of Y = 1-4 units ' 

Coefficient of regression of Y on X = 0-28. a 

10. (a) Given that X = 4Y + 5 and Y =kX + 4, aré the lines of regression 
of X on Y and Y on X respectively, show thatO < 4k <1. Ifk= =<, find the 


means of the two variables and coefficient of correlation between them.., 


a [Punjab Univ, B.Sc. (Hons.), 1989] 
Hint. X =4¥+5 = byy=4 “ 
‘YY =kx+4 ‘= = byy=k 
“ "pea dk . -+(*) 
ie => O<4k< 1. 
If in = is, then from (*), we get 


Psax ie ‘Sypats 5 (Since both the regression coefficient are positive] 
For k = = , the two linés of regression become 


X =4Y45 and Y=yX+4 
Solving the two equations, we get Y = 5:75 , X = 28. 
(b) For S50 students of a class the regression equation of marks in Statistics 
(X) on marks in Mathematics (Y) is 3Y —5X + 180 = 0. The méan marks in 
Mathematics is 44 and variance of marks in Statistics is 9/16th of ‘the ‘variance 
of marks in Mathematics. Find the mean marks in Statistics and the coefficient 
of correlation between marks in two subjects. , 
[Bangalore Univ. B.S¢,, 1989) 
i Hint. We are ~ n=50,Y=44 
Gx _3 * 
and 6;* = Zo > es . Pres 
The equation of - a of regression of X on Y is giver to be 


Correlation and Regression 1069 


3Y¥ -5X #180=0 => x=iys4t2 


— -Ox_3 -3_3 ee 
byy “ "oy 5 => fr. is or r=08 
Since the lines of — pass through the point (X, Y), we get 


K =2¥ +P a2x 44436-0624 


(c) Out of the two lines.of regression given by 
X + 2Y ~5 =0 and 2X + 3Y -8 =0, 

which one is the regression line of X on Y? 

Use the equations to find the mean of X and the mean of Y. If the variance 
of X is 12, calculate the variance of Y. 

Ans. X = 1Y=2,0/7=4 

(d) The lines of regression in a bivariate distribution are : 

X+9Y¥=7andY+ 4x =2 

Find (i) the coefficient of correlation, (iii) the ratios Oy? : om Cov (X,Y), 
(iii) the means of the distribution and (v) E(X | Y = 1). 

(e) Estimate.X when Y = “ if the two lines of. regression are : 


X soa Y+X and Y=-2x4+u, 
(A, 4) being unknown and the mean of the distribution is at (1, 2). Also 
compute r, A and 1. [Gujarat Univ. B.Se., Oct. 1992} 


11. (a) The following results were obtaincd in the analysis of dat, on yield 
of dry bark in ounces (Y) and age in years (X) of 200 cinchona plants : 


: X Y 
Average 9-2 . 16-5 
Standard deviation 2-1 - 4.2 


Correlation coefficient = +0-84 
Construct the two lines of regression and estimate the yield of dry bark of a 
plant of age 8 years. [Patna Univ. B.Se. » 1991), 


(b) The following data pertain to the marks in subjects A and B in a certain 
examination : 


Mean marks in A = 39-5 
Mean marks in B = 47.5 
Standard deviation of marks in A = 10-8 
Standard deviation of marks in B = 16-8 
Coefficient of correlation between marks in A and marks in B= 0-42. 


Draw the two lines of regression and explain why there are two regression 
equations. Give the estimate of marks in B for candidates who secured 50 marks’ 
nA 


Ans, Y = 0-65X + 21-825, X = 0-27Y + 26-675 and Y = 54-342 forX = 50 
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(cy You are given the following information about advertising expenditure 
and sales : 


Advertising Expenditure (X) Sales (Y) 


| (Rs, lakhs) (Rs. lakhs) 
Mean 10 90 
s.d. a 3 12 


Correlation coefficient = 0-8 


What should be the advertising budget if the company wanis to attain sales 
target of. Rs..120 lakhs ? [Delhi Univ. M.C.A., 1990] 


12, Twenty-five pairs of value of variates X and Y led to the following 
results : 


N = 25, EX = 127, SY = 100, EX? = 760, ZY? = 449 and EXY = 500 
A subsequent scrutiny showed that two pairs of values were copied down 


as : 
X Y X Y 
8 14 8 | {2 
8 6 6 8 


(7) Obtain the correct value of the correlation coefficient. 
(i) Hence or otherwise, find the correct equations of the two lines of 
regression. 
(iit) Find the angle between the regression lines. , 
Ans. (i) r(X, Y) =— (0-64 x 0-15)!?, 
(ii) X =-.0.64Y + 7-56, Y = —0-15X + 4-75. 
13. Suppose you have n observations : 
(X15 1)» (X25 Ya)s +2200 Kar Ya) 
on two variables X and Y, and you have fitted a linear regression Y = a + bX by 
the method of least squares. Denote the ‘expected’ value of Y by Y*, and the 
residual Y —Y“ by e. Find means and variances of Y° and e, and the 
correlation co-efficient between (i) X and e, (ii) Y and e and (fii) Y and Y*. Use 
these results to bring out the significance and limitations of the correlation 
coefficient. 
Ans. r(X,e) =0,r (Y, e) =O and (Y, Y*) =r(X, Y). 
14. (a) The regression lines of Y on X and of X on Y are respectively 
Y = aX +b and X = cY + d. Show that 


() Means are X = (bc + d)((1 - ac) and Y = (ad + b)/(1 —ac) 

(ii) Correlation coefficient between X and Y is Va c: 
(iii) The ratio of the standard deviations of X-dnd Y is Vela ; 

(b) For two random variables X and Y with the same ‘mean, the two 
l-a 


regression equations are ¥ = aX + b and X = aY + B. Show that 3 = ee 


Find also the common méan. 
, (Punjab Univ.B.Se. (Maths Hons.), 1992) 
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(c) If the lines of regression of Y on X and X on Y are. respectively 
aX + bY + c, = 0 and a2X + bY + c2 = 0, prove that a,b, S ab). 
(Dethi Univ. B.Sc. (Stat. Hons.), 1989) 


2 1 b2\ ab, 
Hint. r byy . byy s1=(- )*(7 rag aby <1 


n 
15. (a) By minimising > f; (x; cos a + y; sin a —p)? for variations in a 
i=1 


and p, show that there are two straight lines passing through the mean of the 
distribution for which the sum of squares of normal! deviations has an extreme 
value. Prove also that their slopes are given by 


- 


._ 2p 
tan 20 = ox? - oy? 
Hint. We have to minimize : 
S= Yf(%;cosa+ y;sina-—p)* ..(1) 
i=) 


Equating to Ze the partial derivatives of (1) w.r.t. @ and p, we have 


ts 0=2 E fia cosa+ y:sin a p) x; sina +y,cosa) — ...(2) 
WS _o-2 5 ? 
yee ,& fix Cos & + y; sin & - p) ---(3) 
Equation (3) can be written as 


n 
x f, %jcosa+y,sina-p)=0 = Xcosat+ysina-p=0...(4) 
| 


From equation (2), we get a quadratic equation which shows that there are 
two straight lines for extreme values of E. 

From equation (4), it becomes clear that both the straight lines pass 
through the point (x , y ). 

Again equation (2) can be written as : 


© fi &icos a +y; sin a —p) (y; cos @ — x; sin a) = 6 


R 


=> _&, fi loos @ (i — X )-+ sin a (y; + )] Ly; cos @ — x; sin &] = 
[Using i 


Rw 


=> cos? O, 7 fi yi @i-X ) -sin a cos a 2, f,%; @j-X ) 
+sin acosa E fii (y; -¥ )-sin? a Shin (y; - y) =0...(5) 


1 = - 
Wehave ji =" Lfi i -x*)Oi-Y) 
g 
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ee y iy 1s y ais ( 
7 y 2 LIAM: WI Hk RIOR LIK Yi -y) 
sack 1; = 
Similarly, py; - =N Xf yj -x) 


= oD fixile ~x) and oy = TSS yiOi-y) 
: | | 
Substituting these values in (5), we get thé required result. 
(6) If the-straight line defined by” ! 
Y =a+bXx 


satisfies the condition £[(Y - a — bX)?] = minimum, show that the regression 
line of the random variable Y on the random variable X is 


y -Y= rot (X —X), where X.= £(X), Y = E(Y) 
z 


16.‘(a) Define Curve of regression of Y on X. 
The joint density function of X and Y is given by : 
flix, y) =x+y,0<x<10<y<1 
= 0, otherwise 


. Find 
(i) the correlation coefficient between X and Y, 
(ii) the regression curve of Y on X, and 


(iii) the regression curve of X on Y. 


Ans. p (X, ¥) = “1 (Madras Univ. R.Sc., Stat. (Main), 1992) 


(b) Let te aii 4 0 $2 <Xo,0 <%2<4 
= 0, elsewhere 
be the joint p.d.f. of X; and X2. 
1 


Find conditional means and variances. saa show that p=;. 


17, If the joint density of X and Y is given ‘by 


_ J @+y)/3, for0<x< 1, 0<'y <2 
f y= 0, otherwise 


obtain the regressions (i) of Y on X and (ii)-of X on Y. 
Are the regressions linear ? Find the correlation coefficient between 


X ahdy. .« (Allahabad Univ. B.Sc. 1992) 
‘3x +4 2+ 3 
Ans, y‘= EV Ix) age ix= EXIY=3G 2 ay 
? 


Corr. (X,*¥) =- ( 55 y" 


18. Let the joint density function of X¥ and Y be given by > 
f(x y) = 8xy, O<x<y<il 
= 0, otherwise 


correlation and Regression oe 
Find: (i) E(Y\X =x), (ii) E[XY1X 2a (iii) Var IYIXx = x) 
[Delhi Univ. BSc. (Maths Hons.), 1988] 


21 2 
Ans. (i) E(¥lx) = 5 eae E (XY Ix) = x E(Y 12x), (iii) E (¥2 1x) = ive 


19. Give an example to show that it is possible td have the regression of 
y on X constant (does not depend on X), -but the regression of X on Y is not 
constant (does depend on Y),. caer 

Hint. See Example 10-21 . 

20. Prove or. disprove ; 

ca E(Y|X =x)=constant = r(X,Y)=0 

Ané. ‘True 7 

21. Iff(x,y).= zx? exp [-y (1 +.x)], x 2,0, y 2 0.is the joint p.d.f. of 
i Y), obtain the equation of regression of Y on X. 

Ans, y-E(Y |x)'= 1/1 + x). 

22. Variables (X,Y) have joint p:d.f: 

f(xy) = 6(1-"x - y), x >0, y >00,x +y <1. 
= 0, otherwise. 
Find fx(x), rly) and Cov (X,Y). Are X and Y independent ? Obtain the 
regression curves for the means. 
F (Calcutta Univ. B.Sc. (Maths. Hons.), 1986) 

Ans. f;(x) = ="3(1 ~x)?,0<x<1;f,(y) = 3(1 —y)*,0<y <1. 

X and Y are not independent. 

Regression. curves. for a means are:: 

y= E(Y1ix)= : 3 (1 ~x)and x= E(X ly)-=! xt *y): 
23. For the. joint p.d.f. ; 
f(x, y) = 3x? ~ Bxy + 6y?-0-S(z, yy SL, 

find the least square regression lines and the regression curves for the means. 


[Calcutta Univ. B.Sc. (Maths, Hens.), 1987] 
Ans. Regression lines: _, 


2__ 10 3 a oe) 2 
y -F2-18le - §) oe :) 


Regression curves for-means are : 
Ox? - 16x+9 | 36y2 = 32) - 9 

y= EWU) = 6352 — ax. 2) BIE poy ay + 1) 

24. Let (X, Y) be jointly ‘distributed with: p.d.f. 
fx, y) =e%,0<x<y<oo 
-- ‘“=Q , otherwise 

Prove that : 

E(YIX =x) =x+% and E(X|\Y=y)=//2. 
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Hence prove that r(X, Y) = V1/2 . 
25. Letf(x%,y) =e7 (l=e7) ,0<x<y;0<y<o 
=e*(1-e7) ,0O<y<x; O<x<0 
(a) Show that f(x, y) is a p.d.f. 
(b) Find marginal distributions of X and ‘Y. 
(c) Find E(Y1X =x) for x > 0. 
(dd) Find P X% <2, Y <2). 
(e) Find the correlation coefficient r(X, Y). 
(f. Find another joint p.d.f. having the same marginals. 
Ams. (6) f;(x) =xe* ,0<x< 0; fo(y) = ye” ,0<y.<om. 
.l-e 1 fx _ 
(c) E(Y Ix) =" Lx - 1] +i (F+ xe*+e7%—- y 
@ 1 -4-3: ©re. yee 7 R72 
() Hint. f(x, y. a) =fi(x) fx) [1 + @ (2F(x)-- 1) QF(y) -1)), 
la l</J, has the same margiuals f(x) and f(y). 
26. Obtain regression — of Y on X for the distributions : 
l+x+y reg 
(a) f(. y) =>: (1 +.x)* (1 + y)4 > x,y 20 
(6) fx y) =3 +(x + 3y)e*27 ; x, y 20 
' [Sardar Patel Univ. M.Se., 1992) 


Ans. (a) Hint. See Example 5-25, page 5-55, (b) . — x. 


27. A ball is drawn at random from an urn containing ‘three white balls 
numbered 0, 1, 2 ; two red balls numbered 0, 1 and one black ball numbered 0. 
If the colours white, red and black are again numbered 0, 1 and,2 respectively, 
find the correlation coefficient between the variates X, the colour number and Y 
the number of the ball. Write down the equation of regression line of Y on X. 


[Calcutta Univ. B.Sc. (Maths. Hons.), 1986) 


OBJECTIVE TYPE QUESTIONS 


I. State, giving reasons, whether each of the following statements is true or 
false. 


(i) Both regression lines of Y on X.and of X on Y do not intersect at 
all. 


(i) Ina bivariate regression, byy = 7 byy = 10 
(iii) The regression coefficient of Y on X is.3-2, and that of X on Y 1s 
0-8. 


(iv) Theté ‘is no relationship between correlation coefficient and regression 
coefficient. 
(v) Both the regression coefficients cannot exceed unity. 
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(vi) The greater the value of ‘r’, the better are the estimates obtained 
through regression analysis. 

(vii) If X and Y are negatively correlated variables, and (0, 0) is on the 
least squares line of Y on X, and if X = 1 is. the observed value then 
predicted value of Y must be negative. 

(viii) Let the correlation between X and Y be perfect and positive. 
Suppose the points (3, 5) and (1, 4) are on the regression lines. With 
this knowledge it is possible to determine the least squares line 


‘exactly. 
(ix) If the lines of regression are Y=iX and X =tY¥+ 1, then p = 4 and 
E(X |Y =0) = 1. 


(x) In.a bivariate distribution, byy = 2-8 and byy = 0-3. 
II. Fill in the blanks : 

(i) The regression analysis measures ... between X and Y. 

(ii) Lines of regression are ... if ryy = 0 and they are ... if ryy = +1, 
(iit) If the regression coefficients of X on Y and Y on X are — 0-4 and 


— 0-9 respectively then the correlation coefficient between X and Y 
is . 


(iv) If the two regression lines are X + 3Y —5 = 0 and 4X + 3Y -8 = 0, 
then the correlation coefficient between X and Y¥ is ... 

(v) If one of the regression coefficients is ... unity, the other must.be ... 
unity. 

(vi) The farther the two regression lines cut each other, the ... will, be the 
degree of correlation. 
(vii) When one regression coefficient is positive, the other would be ... 
(viii) The sign of regression coefficient is ... as that of correlation 
coefficient. 
(ix) Correlation coefficient is the.... between regression coefficients. 
(x) Arithmetic mean of regression.coefficients is ... correlation: coeffi- 
, cient. . 
(xt) When the correlation coefficient is zero, the two regression lines are 
;.¢ and when it is + 1, then the regression lines are ... 
IKI, Indicate'the correct answer : 

(iy The regression line of Y on X (a) minimises total of the squares of 
‘horizontal deviations, (6) total of the squares: of the vertical 
deviations, (c) both vertical and horizontal deviations, (d) none of 
these. 

(ii) Thié regression coefficients are bz and b,, Then the correlation 
coefficient r is (a) by/b2, (b) bib, (c) bib, (d) + Vb; be 
“(iii) The farther the two regression lines cut each other (a) the greater will 
be the degree of correlation, (6) the lesser will be the degree of 
correlation, (c) does not matter. 
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(iv) If one regression coefficient ig greater than unity, then the other must 
be (a) greater than the first one, (6) equal to umty, (c) less than 
unity, (d) equal to:zero. 

(v) When the corrélation coefficient r = +1, then, the two regression jing 
(a) are perpendicular to each: other; (b) ela (c) are parallel to 
each-other, (d) do not exist. 

(vi) The two lines of regression are given as X + 2Y = 5 = 0 and 
‘2X + 3Y = 8. Then ‘the mean values of X and Y respectively are (a) 
2, 1, (6) 1, 2, (c) 2,.5, @ 2, 3. 

(viiy The tangent of the angle between. two regression lines is given as 0-6 
and the s.d. of Y is known to be twice that of X. arene value of 


correlation coefficient’ between X oe is a) —-~, (b) * (c) 0-7, 
(d) 0-3. ~ 9 


IV. Gx and oy are the standard deviations of tWo correlated‘ variables X an 
Y respectively in a large sample, and r is thé sample correlation 


coefficient. 
(i) State the “Standard Error of Estimate” for linear regression of Y on 
, & ‘4 


(ii) What is the standard error in.estimating Y from X‘if r = 0? 
(iii) By how much is this error reduced if r is irictédséd to 0-30 2 


(tv) How large must r be in order fo reducé this standard error to one-half 
its value for r = 0? 


(v) Give your interpretations for the cases r = 0 aiid r = 1. 


ee 


V. Explain why we have two lines of regression. oe 


J 

10-8. ‘Correlation Ratio. As discused’ earlier, when variables are 
linearly related, we have the regression lines of one variable on another variable 
and correlation coefficient can be computed to tell- us about the extent of 
association between them. However, if the variables are not linearlyrelated but 
some sort of curvilinear relationship exists between them, the use of r which is 
a measure of the degree to which the relation approaches a straight line “law” 
will be misleading. We might come across bivariate: distributions. where + may 
be very low or even. zero but the'regression may ‘be strong, or even perfect. 
Corrélation ratio ‘Nn’ is the appropriate measure of curvilinear: ‘relationship 
between the two variables. Just as r measures the concentration of. points about 
the ‘straight line of best fit, 7) measures the concentration of points about the 
curve of best fit. If regression is linear 1. =.r, otherwise 7) > r (cf. Remark 2, 
§ 10-8-1). 


10-8-1, Measure of Correlation -Ratio. In the previous: articles we 
have assumed that there is a single observed value Y corresponding to the given 
value x; of X but sometimes there are more than one such value of Y. 

- Suppose’ corresponding to the values x;, (6 = 1, 2, ...,m) of the. variable . 
the variable Y takes ‘the values y,; with respective frequencies fj. J=1,2,. 

Though all the x’s in the ith vertical array have the same = value. the y’s are 
different. A typical pair of values in the ith array is (x;, yj), with frequericy f;. 
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Thus the first suffix i indicates the vertical array while the second suffix j 
jndicates the positions of y in that array. Let 


SZ fyen and D) Lhe = (2 fus\= S ni=V, (say). 
iol tz lje=l fel Yel i= | 


If y; and y denote the means of the. ith array and the overall mean 
respectively, then ” 

_ Ly yy Ly dy T, Lk i dni Yi 

y= ee = t+ n; and y= “3Shi ‘En 


2 fy Nn; 
In other wey y is the weighted mean of all the array means, the weights 
being the array frequencies. 


Def. The correlation ratio of Y on X, usually denoted mats Nyx is given by. 
where 6,y” and oy” are given by 
1 vor eae _1 ec; = 
Oy HV » >fi Oy -y and of = 5, 22 fi Qi~-y) 


A convenient expression for Tyyxy can be obtained in terms of standard 
deviation 6,,y of the means of the vertical arrays, each mean being weighted by 
the array frequency. 

We have 


No? =L EL Oy-YP= LE (4-9) 4 5;--F))? 
= 3 Bhi Oa “YP +E DSi Oi YP + 2% Efi OW -¥) Gi-¥) 


The term 2025 yi ~ y) fi if Oy ~ ¥})}] vanishes since Lf (yy - -¥) =0, 
being the algebraic sum of ihe deviations from mean. 


Noy = EE Sy Oy -I? + Bm Ii- y)? ° 


=f 


2 2 ee ee 
=> N.o, = Noey? +No,, => Oy 20, +O.) 
2 
Oey” © ny 
=“ I~ G2 = G2 
Y Y : 


which-on comparison with (10-21)-gives 


Say Xn yi -y?) 
nw = == ...(10-22) 
Sy z > fy (Yj = ye 


~~ 
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We have 
No, =Dni(yi-yYP= 2 ny? — Ny= =2 
& 


2 - ra T2 
‘a -|3 ta 5 | rs et ...(10-23) 


a formula, much more convenient for computational purposes. 
Remarks 1, (10-21) implies .that 
O.y = oY (1 ~ Nyx’) 
Since 6,y” and oy” are non-negative, we have. 
,1-nrx’ 20 => Nyxy?s1 => Inyy!S1 


2. Since the sum of squares of deviations in any array is minimum when 
measured from its mean, we have 


Lie lad 
n, N 


» > fi (yj - yi)? Ss 2 >i (yy -— Ji) ---(*) 
where 9 yi is the estimate of yj; for. given ce of X =x; , say, as given by the 
line of regression of Y on X i.e., Vi =a+ bx;, (j= 1,2, ..., n). 

But > 2 fy i Oy ~Y)? =No. =Noy (1 am 
and pp» fy 0; -~a-—bx;)? = Noy? (1 - r?) (cf. § 10-7-6) 
ij 
. ‘=> 1 —Nyx? <1-r 
i.é., Nyx 2r? => Inyl2lril 


Thus the absolute value of the correlation ratio can never be less thay the 
absolute of r, the correlation coefficient. 

When. the regression of Y on X is linear, straight line of means of arrays 
coincides with the line of regression and Nyx? = r?. Thus Nyx? ~ r? is the 
departure of regression from linearity. It is also clear (from Remark 1) that the 
more nearly Nyx’ approaches unity, the smaller is o,y” and, therefore, closer are 
the points to the curve of means of vertical arrays. 

When Nyx? = 1,6,;7 =0 => L Xf; Oy = yd? =0 

=>y,=y;,Vj=,2,...,a, he. all the points lie on the curve of 
means. This implies that. there is a functional relationship between X and Y. 
Tiyx is, therefore, the measure of the degree to which the association between the 
variables approaches a functional:relationship of the form Y = F(X), where 
F(X) isa single valued function of X, [F(X) = a + bX]. 

3. It is worth noting that the value of Nyy is not independent of the 
classification of the data. As the class intervals become narrower Nyx approaches 
unity, since in that case 6,,y7 gets nearer to oy. If the grouping is so fine that 
only one item appears in each row (related to each x-class), that item will 
constitute the mean of that column and thus in this case 6,,y* and oy7 become 
equal so that Nyx? = 1. On the other hand, a very coarse grouping tends to make 
the value of Nyx approach r. “Student” has given a formula for ‘the correction’ 
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to be made in the correlation ratio ‘for grouping’ in Biometrika (Vol IX page 
316-320.) | 

4, It can be easily proved that Nyx” is indepenuent of change of origin and 
scale of measurements. 

5. Nx, the second correlation ratio of X on Y depends upon the scatter of 
observations about the line of column means. 

6. ryy and ryy are same but Nyy is, in general, different from nNhyy. 

7. In terms of expectation, corrclation ratio is defined as follows : 

» _ Ex (E(Y1X)-E(Y))? _ E[E(Y1X)-E 


x" ELY-E(Y)}> — oy” 
_ Ey (E (XY) -E(X)) _ E[E(X1Y) - EX)? 
and nxr EIX-EX)P oe 


8. We give below some diagrams, exhibiting the relationship between r 
and Tlrx- 

(i) For completely random scattering of the dots with no trend, both r and n 
are Zero. 


F =O. y= Lay= 0 
(ii) If dots lie precisely on a line, r= 1 and yn = 1. 
Y 


r= lyx® Nyyt 1 
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(iii) If dots lie on a curve, such that no ordinate cuts it more than once, 
Nyx = 1 and ‘if furthérmore, the dots are symmetrically placed about Y-axis, then 
Nxy = 0,r=0. 


r=0 Nyy71 Lxy=8 


(iv) If Nyy > 7, the dots are scattered around a definitely curved trend line. 


Ay?! 


EXERCISE 10(e) 


1, (a) Define correlation coefficient and correlation ratio. When is-the latter 
a more suitable measure of correlation than ine former ? Show that the 
correlation ratio is never less than the correlaticn coefficient. What do you infer 
if the two are equal ? Further, show that none of these can excced one. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1988] 


(b) Show that 12x 2rvx 20 

Interpret each of the following statements. 

(i) r=0, Gi) 2 =1, Gi) H? = 1, (iv) N? =P? and (v) nH =0 

(c) When the correlation coefficient is equal to unity, show that the two 
correlation ratios are also equal to unity. Is the converse true ? 

(d) Define correlation ratio Nyy and prove that 
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r 1 >n2yy 277, 
where is the coefficient of correlation between X and Y, Show further that 
(n’xy - _ r) is a measure of non-linearity of regression. 
2. For thé joint p.d.f. 
Fi y) =4x3 exp [-x (vy + 1)], y > 0, x >0 


\ ° 
= 0 ., otherwise, 


find : 
(i) Two lines of regression. 
(ii) The regression curves for the means. 


(iii) r(X, Y). 
(iv) Nyxand Nyy 


(Delhi Univ. B.A. (Stat. Hons. Spl. Course), 1987] 


0 
Ans. (i) ys-ix4] : xa—2y4 5 


4 
l+y 
ees = 1 : 2 _1 2 2. ] 


Wi y=E (lx) =~ ~ eRe ie 


3. cone: r(&, Y) and nyy for the following data : 
0-5 — 1-5 1.5 — 2-5 25 — 3-5 35—45 45 —5-5 


s , 20 30 35 25 35 
yi" ~ 11-3 12:7 14-7 16-5 19-1 
Var. (Y) = 9-61 


Ans. Nyx = “0-77, r ='0-85 
4, Compute’ Nyy for ‘the followin table : 


10:9. Intra-class Correlation. Intra-class correlation means within 
class correlation. It is distinguishable from product moment Correlation in as 
miich as here both the variables measure the same characteristics. Sometimes 
specially .in biological and agricultural study, it is of interest to know ‘how the 
members of a family or group are correlated among themselves with respect to 
some one of their common characteristic. For example, we may require the 
correlation between the heights of brothers of a family or between yields of 
plots of an experim-ntal block. In such cases both the variables measure the 
same characteristic, e.g., height and height or weight and weight. There is 
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nothing to distinguish one from the other so that one may be treated as X- 
variable and the other as the Y-variable. 


Suppose we have Aj, Ag, ..., A, families with k,, ko, ..., k, members, 
each of which may be represented as 


‘ X11 XQ] vores seserseaee Xj] -rereerooveseces Xai 
X12 X22 Xi2 Xn2 
ra } 

Xj Nj sce eectiss Nag: clacicstecs Xnj 
X1ky Krfey vrocerrerereneese Kies cvseoreeeeessenes Xnky 


and let x; Gi = 1,2,...,n; j= 1,2, ..., kj) denote the measurement on the jth 
member in the ith family. 


We shall have k(k; -— 1) pairs for the ith family or group like (xij, Xi), 
R 
j#l. There will be 2 k; (ki; - 1) = N pairs for all the n tamilies or groups. If 


we prepare a correlation table there will be k; (k; — 1) entries for the ith group or 
family and > k; (k; - 1) = N entries for all the n families or groups. The table is 
t 


symmetrical about the principal diagonal. Such a table is called an intra-class 
correlation table and the correlation is called intra-class correlation. 


In the bivariate table x;, occurs (&; — 1) times, x,. occurs (k; - 1) times, ..., 
Xjz, occurs (k; — 1) times, i.e., from the ith family we have «i -1) 2 xy and 


hence for all the n families we have > (k; — 1) xj; as the marginal’ fréquency, 
the table being symmetrical about principal diagonal. 


x “j2t/EGs- 1) Zs, 
Similarly, 
ox? =of= > [pets - DECy- | 
Further 
Cov (X, ¥) =* z| 5, (x, - ¥) (4a --% } j#l 


jel? 


- kj 
aE: Ey B) Gu-¥) = E,tay- 2 
j = 1 j=l 
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[f we write x; = L x,;/ k;, then 
j 


x [> x (%y-) Gu 3] 


i ed f= 


5 | % (xy - HE (1-3) 


h 


& Lk; (x; —x) k; (x; -x)] 
= 2k? @,-x) 
i i 
Therefore intra-class correlation coefficient is given by 


BOS ee an ge atone)? 
Cat Pt, te 
Vv V(X) V (Y) % 2% (ki- 1) Cry — x? 


if we putk; =k, ie., if all families have equal members then 
k2 L(x ;--x)? - 2» > (xj,-x) 
qc i vd 
(k —1) pp aC Fee ee : 
ij 


nk2 6,,2 — nko? I ko,,? 
pe nest EY or ...(10-24a) 


where 6? denotes the variance of X and G,,” the variance of means of families. 
Limits, We have from (10-24a), 


2 KO" 1 
a = > ae 
1+4+k-l)r a >O0 => r2 (k-1 
G,,2 
Also 1 + (k~1) rk, as the ratio Ty S | => rs 
] 
. , _ cre 
so that (k- jo = | 


Interpretation. Intraclass correlation cannot be less than —I/(k — 1), 
though: it may attain the value +1 on the positive side, so..that it is a skew 
coefficient and a negative value has not the same significance as a departure from 
independence as,an equivalent positive value. 


EXERCISE 10 (f) 


1. If x1, x2, ..., x, be k variates with standard deviation oO and m be any 
number, prove that 
k kok 
k2o0? = (k - 1) 2, (x,-m)?— 2% dD (x,-m) (x,—m), r¥s 
r 


= r=ls=] 


Hence deduce that the coefficient of intraclass correlation for 7 families with 
varying number of members in each family is 
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Lk, oO? 
(a 
oy k i(k in | ) 
ri 
where k;, 6 denote the number of members and the variance respectively in the 


itn family and 0? is the general variance. 

Given n=5,6,;=i, kj =i + 1 (<5), find the least possible intraclass 
correlation coefficient. 

2. What do you understand by intra-class correlation coefficient. 

Calculate its value for the following data: 


Family No. Height of brothers 
1 60 62 63 65 
2 / 59 60 61 62 
3 62 62 64 63 
4 65 66 65 66 
5 66 67 67 69 


3. In four families each containing eight persons, the chest measurements 
of persons are given below. Calculate the intraclass correlation co-efficient. 


Femily 1 2 3 4 5 6 7 g 
I 43 46 48 42 50 45 45 49 
I 33 34 37 39 82 35 37 41 
I 56 52 50 51 54 52 39 52 
IV 34 37 38 40 40 4] 44 44 


10:10. Bivariate Normal Distribution. The bivariate normal 
distribution is a generalization of a normal distribution for a single variate. Let 
X and Y be two normally correlated variables with correlation coefficient p and 
E(X) =, Var (X) = 0)? ; E(Y) = po, Var (Y) = 6. In deriving the bivariate 
normal distribution we make the following three assumptions. 


(i) The regression of Y on X is linear. Since the mean of each array is on 
the line of regression Y = p(62/0,)X, the mean or expected value of Y is 
p(G,/o,)X, for different values of X. 


(ii) The arrays are homoscecastic, i.e., variance in each array is same. The 
common variance of estimate of Y in each array is then given by o,2 (1 - p”), 
p being the correlation coefficient between variables X and Y and is independent 
of X 


(iii) The distribution of Y in different arrays in normal. Suppose that one 
of the variates, say X, is distributed normally with mean Q and standard 
deviation 6, so that the probability that a random. value6f X will fall in the 
small interval dx is 


1 
dt=—— Se —x2/26,”) dx 
g(x) 5. VGn) xp (-x*/20, 


The probability that a value of Y, taken at random in an assigned vertical 
array will fall in the interval dy is 
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: 1 i: 1 CO» i 
(3). SD = Se LY —2 
h (y |x) dy rere P| 267°(1 — p) (y px 
The joint probability differential of X and Y is given by 
dP (x, y) = g(x)h(y | x)dxdy 
2 


1. ee eee &.Y 
ee: caer —— 262" * | 20,*(1 - p*) ( J a.) 


= ~@ 
200,02 V(1 - p?) 


= { & 2Oxy , a 
_ SEO orca oe — 900 - p2) (o,2 162 5 62? 
20,62 (1 ~ p?) ae “Ge 
Shitting the origin to ({1;, Ho), we get 
fur % y) 


pe oe “ae _ 2p G=WO-w) , oe 
ope gM | G0, o ). 
2n0,02V (1 - p?) 
(-00 <x < 00,-co <y<oo) .,.(10-25) 
where Hy, [2 O; (>Q), 02 (>0) and p (1 <p < 1) are the five parameters of the 
mspunen: NORMAL CORRELATION SURFACE 


This is the density function of a bivariate normal distribution. The variables 
X and Y are said to be normally correlated and the surface z = f(x, y) is known as 
the normal correlation surface. The nature of the normal correlation surface is 
indicated in the above diagram 


Remarks 1. The vector (X, Y)’ following the joint Daf. f(x, y) as given 
in (10-25), will be abbreviated as (X, Y) ~ N (1, Hy, 01, 92, p) or BVN (Hy, Ho, 
272 , 
0), 6, P). If in particular 1, = pL, = 0 and o, = 62 = 1 then 
(X, Y)~N (0,0, 1, 1,p) or BVN (0, 0, 1, 1, p). 


_ 2. The curve z = f(x, y) which is the equation of a surface in three 
dimensions, is called the ‘Norma! Correlation Surface’. 
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10°10:1. Moment Generating punction of Bivariate Norma, 
Distribution. Let (X¥, Y) ~BVN (Wy, Ho OteOs, p). By def., 


Myy (t,t) =E [ etiX 7 | -|- J exp (4x + by). fx, y) dxdy 


xm—- 
Put aPi oy ee See 
ere 0? 
ae X =OpKk+ Ny, Y=OQV4eN, => ly l=a\G, 


EXP (t)My + ofa) 


x,y (h, t2) = 
i- 2nV1 - p? 
«JI exp [io + [40 ov —- aa H uz— 2puyv + v2} fa | eu 


_ &XP (t)py + toft2) 


2nV1 - p2 
x | [exo a { (u2—2puv + v2) -2(1- p(t 1u +20>v) } dudy 


(u2 ~ 2puv + v*) — 2(1 — p?) (410, 4 + 02v) 
= ((u—pv) - (1 - p?)t;0,}? 
+ (1 — p?){(v — pt) O) — 962)? ~ t)70j? — to? 62? - 2pt,t.0,05} --(*) 
By taking 
- (1 - p”) 0, = @(1 —- p”)!/4 
ca ad ) 491 ( ) | => dudv = V1 — p? dwdz 
and V—Pt,;01 — [252 =2 
and using (*), we get 
My, y(ty, t2) = exp[ typi + tofle + — +1205 + 2ptyt20102)] 


-i1 o-0/2 ale \~ 27/2 
x dw x e az 
Y one 00 \ 21? —co 


= exp Lert F bnplo + 5 (20,2 + to2o2 + 2pt) 78102] ] ...( 10-26) 
In particular if (X, Y) ~ BVN (0, 0, 1, 1, p), then 
My y (th, t2) = exp [3 (4? + to? + 2pttr)] ...(10-26a) 

Theorem 10-5. Let (X, Y) ~ BVN (ty, fo, 512, 62”, p). Then X and Y 
are independent if and only if p = 0. 

Proof. (a) If (¥, Y) ~ BVN (uy, Us, 61”, 62”, p) and p = 0, then X and Y 
are independent (c.f. Remark 2(a) to Theorem 10-2, page 10.5), 

Aliter. (X, Y) ~ BVN (4), Ho, 517, 52”, p) 
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My, y (theta) = exp (thr + thle + 5 (470,2 + 2ph 002 + 1,°02")} 


If p =O, then 
; 2 2 2 
Mx,y(4.t) = exp {4H + +40; } - Oxp{ ta Me + + 2.02} 
D = My(t,). My (¢9). oi) 


[-- If X, Y) ~ BVN (4), Ly, 6,2, 62”, p), then the marginal p.d.f.’s of 
X and Y are normal i.e., X ~ N (\;, 0)) and Y ~ N (1, 0%)). 
(*) = Xand Y are independent. 

(b) Conversely if X and Y are independent, then p = 0 [c,f. Theorem 10-2] 

Theorem 10-6. (X, Y) possesses a bivariate normal distribution if and 
only if every linear combination. of X and Y viz.,aX + bY,a #0,b#0,isa 
normal variate. 

Proof. (a) Let (X, Y) ~ BVN (1), U2, O12, 64”, p), then we shall prove 
that aX + bY, a #0,'b #0 is a normal variate. 

Since (X, Y) has a bivariate normal distribution, we have” 


My, y(,) =E (et + 47 ) 
= efit + tH, + 1(,70,7 + 2pt,t, 6,6, + 1,78,2) (5 
Then m.g.f. of Z = aX + bY, is given by : 
M(t) = E(e2) =E (efx + bY) )=E (e2!* + bY) 


2 
= exp {r(ap; + by) + = (@o? + 2pabo,02 + 6°5,7)}, 


(Taking ¢, = at, tf. = bt in (*)) 
which is the m.g.f. of normal distribution with parameters 
pw =a, +Di ; = a*o,* + 2pabo,07 + b’o,2. sas(**) 
Hence by uniqueness theorem of m.g.f., 
Z=aX + bY ~ Ny, 6”), 
where }! and 6? are given in (**). 
(b) Conversely, let Z = aX + bY, a #0, b #0 be a:normal variate. Then 
we have to prove that (X, Y) has a bivariate normal distribution. 


Let Z = aX + bY ~ N(1, 6), 
where ph = EZ=E (aX + bY) = ap, + by, 
and o? = Var Z = Var (aX + bY) = a’0,2 + 2abpo,0, + b?o,? 


M(t) = exp [tq + 02/2] 
= exp [1(a}1, + by) + £ (@o,2 + 2abpa,6, + b°,2)) 


= exp [t)[1r + fol, + $.(t20, + 2pt; 10,0, + t70,’)] si") 


where % =at and t = bt. 
But (***) is the m.g.f. of BVN distribution with parameters (1, Hy, 6,2. 
0,7, p). Hence by uniqueness theorem of m.g.f. 
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(X, Y) ~ BVN (uy, py, Sy?, dy’, p) 
10-10-2. Marginal Distribution of Bivariate Normal. Distribution. 
The marginal distribution of random variable X is given by 


fx@) = fuy &, y) dy 


—0o 


2 — = u, then dy = G, du. Therefore, 
2 


1 


ag ON spare 


oo ' = 2 a 
x | ex|- sats 5 (SEY - 20m (SE) + wt one 


a _1 f#=b 
2no,V(1 - 92) exp 5 ( 0; J 


x | © ex- 21 - es | ~» (Sey 


1 — Hy Van o2) 
“seem #( ) ea then du =V(1l-p?) dt 
V(1 =p?) %1 
SCSI CD 
fx) oe -exp| - 2 ( 3; | J exp dt 
sal L /z-wWy 
~ 200, exp| - 5 ( 0; YI ii 
1 1 (x - Py ] 
= exp| - ~ |—— | | ..-(10-27) 
o,V2n 2 ( 01 ) 
Similarly, we shall get 


fr) -| Fay (, y) dx 


1 1 (y — pe i 
= exp - >| (10-274) 
62 20 2 f O2 
Hence X ~ N (411, 0,7) and Y ~ N (12, 62”) ...(10-27b) 


Remark. We have proved that if (X, Y) ~ BVN (41, M2, 012, 022, p), 
then the marginal p.d.f.’s of X and Y are also normal. However, the converse 
is not true, i.e., we may have joint p.d.f. f (X, Y) of (X, Y) which is not 
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normal but the marginal p.d-f.’s may still be normal as discussed in the 
following illustration. 
Consider the joint distribution of X and Y given by : 


1 -1 
fx. ¥) =; Pera exp rere (x? - 2pxy + | 


1 
+ 2n(1 - 92)? Spin exp} - 21 —p?) (x? + 2pxy + ro} 
=5[f(z, y) + fo, 13-2 & y) <0 --(10-27¢) 


where f(x. y) is the p.d.f. of BVN (0, 0, 1, 1, p) distribution and f2(x, y) is the 
p.d.f. of BVN (0, 0, 1, 1,—) distribution. 

Jt can be easily verified that f(x, y) is the joint p.d.f. of (X, Y) and 
obviously f(x, y) is not the p.d.f. of bivariate normal distribution. 


Marginal distribution of X in (10-27c) 
fc 4] ec y) dy + J file y) dy] 
But ° fi(, y) dy is the marginal p.d.f. of . X, where 
% 1) ~ BVN (0, 0, 1, 1, 9) and is given by X ~ N(O, 1). 
Similarly | ° fo, y) dy is the marginal p.d.f. of X, where 


(X, Y) ~ BVN (0, 0, 1, 1,- p) and is given by X ~ N (0, 1). 


Sx) | Len 4 Tne 


V2n V 2x 
1 -2n 
=" e > —~ COL XC oo 0 
V2n 


=> X ~ N(O, 1) i.e., the marginal distribution of X i.. (10-27c) is normal. 
Similarly, we can show that that the marginal p.d.f. of Y im (10-27c) is 
given by : 


fro) =3[ oe%? + eva] 


21 2n 
eas YR. _wey<oo on (19) 
V2n 
=> Y ~ N(O, 1). 


Hence if the marginal distributions of X and Y are normal (Gaussian), it 
does not necessarily imply that the joint distribution of (X, Y) is bivariate 
normal. 

For another illustration, see Question Number 17, Exercise 10(f). 

We further note that for the joint p.d.f. (10-27c), on using (2) and (ii), we, have 
E(X) =0, oy? =1 and E(Y)=0, o/* = 1. 
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Cov (x, ¥) = EXD =EO EM _ evxy 


Oy Oy 


=3 f xy fi, y) Gauy + J | xy folx, y) dxdy 


=5[p +(-p)] =0, 
because, for f(x, y) , (X, Y) ~ BVN (0, 0, 1, 1,.p) and for f, (x, y), 


(X; Y) ~ BVN (0, 0, 1, 1, - p). 
oe Corr. (X,Y) = athe 


However, we have ; [From Ps is (it)} 


1 173.43 
fi) fr) =zre7 20 *) ) #flx, y) 


=> X ce Y are not independent. 

The above example illustrates that we may have a joint density (non- 
Gaussian) of rv’s (X, Y) in which the marginal p.d.f.’s of X and Y are normal! 
and p(X, Y) = 0 and yet X and Y are not independent. 

16-10-3. Conditional Distributions. Conditional distribution of X 


for a fixed Y is given by 


_Sxy (x, y) 
fyry ly) = fry) 


Pe) -exp| -1 (7 py } 
Vino, Vaz) 20-0 WS 


CRC Cyl 


~ 6, NanV — p?) 
xexp] — — bo ee -n) -p Ho -pyt 
p 201 a 92)0,2 Hy p O> 2 


a 
V2n 0,V(1 - p2) 

1 } 2 

oo - aa=ppas fo (re Go -uo| 


which is the probability function of a unvariate. normal distribytion with mean 
and variance given by . . 


Go 
EX IY =y) =m +p 5 -h2) and VIXIY =y)= 017 (1-97) 
Hence the-conditional distribution of X ‘for fixed Y is given by ; 
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(XIY=y)~N[u, + mae -p,),6,2(1-p2)] (10-27) 


Similarly the conditional distribution of random variables Y for a fixed X is 
Ix) = fxy @, y) 
f YIX yg ) fx(x) 
1 


~N2R 0,V (1 ~ p2) 


xexp| - sq-taoF {7 - a) -p Been | 
2(1 — p?) a2? | 0; ape 
—co <y <0o 
Thus the conditional distribution of Y for fixed X is given by 


(Y |X =x) ~N he +p ZG — 1), 627 (1 - 9%) | ...(10-27e) 


It is apparent from the above results that the array means are collinear, i.e., 
the regression equations-are linear (involving linear functions of the independent 
variables) and the array variances are constant (i.e., free from independent 
variable). We express this by saying that the regression equations of Y on X and 
X on Y are linear and homoscedastic. 

For p = 0, the conditional variance V (Y | X) is equal to the marginal 
variance 6,* and the conditional mean E(Y1X) is equal to the marginal mean 
\t. and the two variables become independent, which is also apparent from joint 
distribution function. In between the two extremes when p = + 1, the correlation 
coefficient p provides a médsure of degree of association or interdependence 
between the two variables. 

Example 10-27. Show that for the bivariate normal distribution : 

wa dP = const. exp [- a (x? — 2pxy + y?)| dx dy, 


(i) M.G.F. is M(t), t2) = exp.[3 (t,2.-+ 2ptyte + t?)] 


(ii) Moments obey the recurrence relation, 
Mrs = (r +S- 1) PMy-1,841 7 (r- 1) (s - 1) (1 - p?) Uy 25-2 
Hence or otherwise, show that + 
Uys =0, ifr + sis odd, Wx, = 3p, U2 = 1 + 2p? 4 
* Delhi Univ. B.Sc. (Stat. Hons.), 1989) 
‘Solution. (i) From the given probability function, we see that 
Hi=O=p, and o,7=1= 0, - : 
. From.(10-26a), we get 
M = M (ty ta) = exp [3.(4? + 2p + 27)] 


(ii) om = M(t, + pts) and* OM = M (t2 + Ph) : 
Ot; Ot, 
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02M 
Ot ~ Ot a. (a) 22 ~ Oth ar, Malte + PADI 
= Mp + (2 + ph) (4) + Pt2)M 
02M oM OM 
Ot 0l> ~ Pho Pl; ~ Pla Wb 
= [Mp + (to+ Pht + pla)M = pty (t) + P)M — pta(te + pty)M] 


= M[tto + p — p2hta] (On simplification) 
= Mp +71 - 02)M tito 

OM __, OM oM 2 

ata pt, at, + Pl Oty +Mp+M (1-p*)til see(*) 


1.2 2 fy" tt 
But M =exp[5(h? + 2ptite + t2”)] = en 


: ene 
7 . ho 
by Ye. ‘(r-1)!(s-D! 
=.) s:=.] 
aa ty" bs 
=, 2 THrs - pEsttP SMrs- ist 
r=Il1s=0 r=Qs=1 
t" to! a ith orth 
+o) 2 hes ist oe ay Tit ris! Car 
r=0s=0 r=0s=0 
r-1 (oe! 
Equating the coefficients te D!'G- az 1) G-pin both sides, we get 


oa [p(r - 1) by1, 91 + p(s — I)My-1, 5-1 + P74, 51 
- +(1— pr - 1)(s— Dpy-2, 5-2] 
=> Ws = (+ S— 1) PH +7 - IS - 1) - p)My-2, 5-2 


In particular 
31 = 3Pi29 + 0 =.3p0,7 = 3p (-,- 0,7 = 1) 
Moo = 3pitis + (1 — p?) Hoo = 3p? + (1 — p?).1 
, =(1+2p’) (-.* Hi = PO; 02 =p) 
Also {93 = M39 = 0 
Hi2 = 2p, +0=0 (++ Hor = Hio = 9) 


Mos = 4pii,2 + 1-2 (1 — p*)Mon = 0 
Similarly, we will get plo, = 0, [iz = 0 
If r+ s is odd, so is (r — 1) + (§ - 1), (7 —2).+ (s — 2), andSo on. 
And Since [39 = 0 = fo3, Hig = O= Hor. Ho3 = O = [a9..., we finally get, 
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Lt, = 0, if r+ s is odd. 

Example 10-28. Show that if X, and X2 are standard normal variates 
with correlation coefficient p between them, then the correlation coefficient 
between X,? ana X,? is given by p27 

Solution. Since X, and X2, are two standard no-mal variates, we have 

E(X;) = E(X>) = 0 and V(X;) = E(X,2) = 1 = V(X>) = E(X,”) 
Mx,, X2 (t), 3) = exp [3 (42 + 2ptyte + to?)] [c.f. (10-26)] 


E(X;? Xo?) - E(X,?) E(X2?) 
N XP) ee 
ow p(Xi*, X2 EX, — (EX \  [E(X24) — (E(X,2)}?] 


where E(X,2X,2) = Coefficient of . in M(t, to) = (2p? + 1) 


E(,‘) = Coefficient of a" M(t), t2) =3 


EX‘) = Coefficient of a in M(t), t2) = 3 
297 +1-1 
p(X;2,X,2) =P = = 
~NB3-DVGB-1 
Example 10:29. The noes X and Y with zero means and standard 
deviations 0, and GO, are normally correlated with correlation coefficient p. Show 
that U and V defined as 


Y x YY 
=—+— and V=— -— 
GO, OG oO, % 
are independent normal variates with variances 2(1 + p) and 2(1 — p) 


respectively. 
Solution. We are given that 


1 _ 1 {x 2px ye 
dF(x, y) = ————— _ “bx Ce 
a 2n0,02V ak 2(1 - p?) (a? O10, * 2] 


—0o < (x, y)< co 


x x 
Also u=~42 yot_-t 
GO, GO, 


nl 2 Lee ae _ Ot SOR. 
Saad a and 5(u WES =>. x= (uty) and y=Q tu v) 
Jacobian ‘of transformation J is given by 


1 1 
7 = 9SHY) - a 251) a6 


19 
~ O(u, v)~ +o - 50, 2 
+i} [(u + v)? + (u - v)?]=3 (U2 + v2) 
1 
dF (u, v) = 
eee oe 


x exp|- 
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a) 
gras Assen |- ar 4 - 4(1—p2) {(i- pres ct+er?) | dua 
1 


= — a7 and 7 1 LAD | Hai 
2nV2(1 — p) V2 + p) exp 2(1+p)2 2(1-p) Ar 


a Ree ee exp {- sarah] 
Van Var+p) 20+ )2 


Pareee) ces a 
. a 2(1-p) id ' 2(1 - oat . 
= [f,(u)du] ((v)dv], (say) 


a en ee 
where a eA aa) exp {- 21 + i 

a pee ee 
mh el 


Hence U and V are independently distributed, U as N [0,'2(1 + p)] and V as 
N [0, 2 (1 — p)I. 
Aliter, Find joint m.g.f. of U and V viz., 
M(t, b)=E (ef1U + t2V) ae 5 [ex + ty)/0, + Y(t, — ty)/02 ] 


and use E(e* * 2F-) = exp [(¢,20,2 + t2202? + 2pt 26102)/2] 
Example 10-30. If X and Y are standard normal variates with co-efficient 
of correlation p, show that 
(i) Regression of Y on X is linear. 
(ii) X+Y and X ~—Y are independently distributed. 
a lei 12.0 as GP 
(iii) Q ea 9 Hd 6 is distributed like a chi-square, i.e., as that of 
the sum of the squares of ae normal variates. 
(Madras Univ. B.E., 1990) 


Solution. (i) c.f. § 10-10-3. 
(ii) Let u =x + y and _ 


a 
sale) aan Vi —p2 exp 2(1 - 2(1 - p2) (x? - Dpxy + | dey 


u+ - 
Now x = 5 ae . 
ox ox 1 1 
We du dv |- | 2. (2 1 
dg a | [i 1 | 2 
ou ov 2 2 


dG(u, v) =C exp - Te ae { 2(u? + v2) — 2p(u? - v?)} dudy 
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4nV1 ~ p? 


ae dG(u, v) = cex|- a7! 4(1 — 4(1 - p2) {qi - p)u2 + (1 + p)v? fs 


a exp 41 +p) : id x c, exp} ery = p)| | 


= [g:(u)du] [22(v)dv], (say). 
Hence U and V are independently distributed. 


where C = 


(iii) Mo(t) -| ~ | e'2 dF (x, y) 


-—-—=| 7 \° exp(tQ) 
20 (1 - p? —0o %.— oo 
x a - at {2 —2pxy + 7 


ae \~ exp (10 - $a 
‘ "Oe (1 - p7) —0o % —oo 
=—_ 1 — =|" ~ -$c1-20] 
on Vil ae? © [i el > A t) |dxdy 


Pur Vl —2)x=u and V(1-2)y=v 


de = — 9 — and dy =-—™ 
(1 — 22) V1 oO 
Py eee, 2 
Also Q a ae - 2pxy + y*] = ee ee ee 


M(t) =———._ 
ot 2nV(1 — p2) (1 - 22) 


co 6 0° 
x] ] alee 20-2? (u2 ~ 2pm + v2) fc 


SGT al eo 
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which is the m.g.f. of chi-square (x?) variate* with-n (=2) degrees of freedom. 
Example 10-31. Let X and Y be independent standard normal variates, 

Obtain the m.gf. of XY. [Gauhati Univ. M.Sc., 1992) 
Solution. We have, by definition : 


Mxyt) = E(e"") -| ~ | ~ e!*Y | f(x, y) dxdy 


Since X and Y are independent standard normal variates, their joint p.d.f. 
fx, y) is given by : 
fx.y) =f@) AO) = greF? eF? ;— 00 <(, y)<o 


co ce =f 2... 2 
Mxy(0) -z| | e > (x meal ar eS 


“2nJ J OP L201 - 2) 


2t.x. 2 
«GTP “GAS a 


If (U, V) ~ BVN (0, 0, 6;?, 6*, p), then we have 


__-! pe oane : 

[- [- , 2(1 - p’) {a7-5e +35) 

—_————— .@ dxdy = 1 
—oo % —oo 200 10>, v1 - p? 


—— {5 - _ pry Z| 
oo oo 21 ~ 2 2 
= | gene 12 FY aedy=2n0.0,V1—p?__ ...(**) 


Comparing (*) and (**) with 
1 
oy a = 
GO," =O, = - 2) and p =¢, we get 
1 1 1 
Myy(t) =x—.2n oe VI - 
ano Ni-P Vi-e 

=> Mxy (t) =(1-2)'7;-1<1t<1 
Example 10-32. Let X and Y have bivariate normal distribution with 


parameters : 
by = 5, py = 10, 02 = 1, of = 25 and Corr (X, Y) = p. 


esa > 0, find p when P (4 < Y°< 16\1X = 5) = 0-954 
[Delhi Univ. B.Sc. (Math. Hons.), 1993, ’83] 


*Chi-square distribution is discussed in Chapter 13 
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(b) If p = 0, find P (X + Y $16). 
Solution. Since (X, Y) ~ BVN (Ly, py, Sx?, Sy”, p), the conditional 
distribution of ¥ given X = x is also normal. 


Anes ~N[p = Hy + Or ey), 6? = 4/7 (1 -p?)] 


(YIx=5)~N[p =10+px2 (5—5), o2=25(1-p2] * 


=N [n= 10, o? = 25(1 ~ 2] 
We want p so that 
P(4<Y < 161X = 5)=0-954 
where 7 Aa pe ae ~ N (0, 1) 


GS 5 V(1~p?) 


4-10. 6 =10 ) 
= p S <Z< ee 


= 0-954 


~ 6 6 
= (= <2 <5) = 0.954 .*) 
But we know that if Z~ N (0, 1), then 
P (-2<Z<2) =0-954 voe(**) 
Comparing (*) and (**), we get 


$22 => 6=3 => o7=9=25(1-p?) 
9 
1- p? = 25 
(b) Since (X, Y) have bivariate normal distribution, 
p=0 => X and ¥ are independent rv’s 
and X ~ N(x ,0x*) and Y~ N(py , dy’) 
X+Y~N (U=npy + py, O? = Oy? + Gy’) = N (15, 26) 


=> p?=i¢ = p=2=08 (-.- 9 > 0) 


Hence 


16 - 15 
P(X+Y $16)=P(zs ~— 
pea) 


where Z A+ D-by (0, 1). 


1 ‘ 
P(X+Y<16)=P(Z < —)=0(\1/N 26), 
( 76 | (126) 
where (z) = P (Z <2), is the distribution function of standard normal variate. 


~ fe) ae 
Remark . P(X +Y < 16) -°(z < 5555)" PZ < 0-196) 


=0:5+P 0<Z <0-196) 
= 0-5 + 0-0793 (approx.) 
= 0.5793, 
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EXERCISE 10(f) 


1. (a) Define conditional and marginal distributions. If X and Y follow 
bivariate normal distribution, find (i) the conditional distribution of X given y 
and (ii) the marginal distribution of X. Show that the conditional mean of X jis 
dependent on the given Y, but the conditional variance is independent of it. 

(6) Define Bivariate Normal distribution. If (X, Y) has a bivariate norma] 
distribution, find the marginal density function fy(x) of X. 

[Delhi Univ. B.Sc. (Maths. Hons.), 1988) 


2. (@) The marks X and Y scored by candidates in an examination in two 
subjects Mathematics and Statistics are known to follow a bivariate normal 
distribution. The mean of X is 52 and its standard deviation is 15, while Y has 
mean 48 and standard deviation 13. Also the coefficient of correlation between Y 
and Y is 0-6. 


Write down the joint distribution of X and Y. If 100 marks in the 
aggregate sre needed for a pass in the examination, show how to calculate the 
proporcoa cf candidates who pass the examination ? 

(b) A manufacturer of electric bulbs, in his desire for putting only good 
bulbs for sale, rejects all bulbs for which a certain quality characteristic X of the 
filament is !ess than 65 units. Assume that the quality characteristic X and the 
life Y, of the bulb in hours are jointly normally distributed with parameters 
given below : 


x Y 
Mean 80 1100 
Standard deviation 10 10 


Correlation coefficient p(X, Y) = 0-60 
Find (¢) the proportion of bulbs produced that will burn for less than 1000 
hours, (ii) the proportion of bulbs produced that will be put for sale, (iii) the 
average life of bulbs put for sale. 
3. (a) Determine the parameters of the bivariate normal distribution : 
8 
fx, y) =k exp| - a7 {(x - 7)? -2(x - 7) (y + 5) + 4(y + 5?) | 
Also find the value of &. 
(6) For the bivariate normal distribution : 
(X, Y) ~ BVN (. 2, 42, 52, 42 
find (i) P(X > 2), Gi) P(X >21Y = 2). 
(c) The bivariate random variable (X,, X2) have a bivariate norma! 


distribution with means 60 and 75 and standard deviations 6 and 12 witha 
correlation coefficient of 0-55. Find the following probabilities : 


(i) P(65 < X, $75), Gi) P (71<¢ X_q ¥ 801X, = 55) and (iii) PAX, —X,'2 15). 
4. For a bivariate normal distribution : 


pe aes: 2 
fury &, Mapa exp {- ae 2(1 ~ p?) (x? - 2pxy+ y i 


— oo < (x, y)<* 


2 
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Find (i) marginal distribution of X and Y, 
(ii) conditional peau a of Y given X, 


(iii) eit aeen of a 0?) [x? — 2pxy + y?], 


and (iv) show that in general X and Y are stochastically dependent and will be 
independent if and only. if p = 0. 
5. Let the joint p.d.f. of X and Y be 


1 
x, y= 
i 21010 V1 - 9?) 
2 = am = 
x exp} me [ies 1) 55 (x=) (y-b2) | 


' 6,2 0; O> 


where — 0 <x <00,—~ co <y<oo,-l <p <1. 
({) Find the marginal distribution of X. 
(ii) ivimd the conditional distribution of Y given X = x. 
(iti) Show that the regression of Y on X is linear and homoscedastic. 
(iv) Find P(3< Y<81X = 7), given that ji, = 3, p. = 1, 0,2 = 16,. 
6,7 = 25, p = 0-6, 
(v) Find the probability of the simultaneous materialization of the 
inequalities, X > E(X) and Y > E(Y) 
Hint. (v) Required probability p is given by 
p = P(X > E(X), Y>E(Y)) = PIX >u,) A (> p,)] 


-| | f(x, y) dx dy 
ae 


em -§ 03 1 1 
- a ——Tr oe oh, 2 
J, J; on v1 - p? 7 2(1 - p?) (u*— 2puv+y ) fu 


Gp ee te 
7 Oo, 7 O> " 


Now proceed as in Hint to Question Number 9(0). 
6. Let the random variables X and Y be assumed to have a joint bivariate 
normal distribution with 
Hy = pl2 = 0, 6, =4,02= 3 and r(X, Y) = 0-8. 
(i) Write down the joint density function of X and Y. 

(it) Write down the regression of Y on X. 

(iii) Obtain the joint density of X + Y and X -— Y. 
7. For the distribution of random variables X and Y given by 


d= k exp| - a X1- py (x2 - - Opry + 92) fd dy; -00 <x Soo, co Sy < oo 
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Obtain 
(i) the constant k, 
(ii) the distributions of X and Y, 

(iii) the distributions of X for given Y and of Y for given X, 

(iv) thé curves of regression of Y on X‘and of X on Y, 
and (v) the distributions of X + Y and X - Y. 

8. Let (X, Y) be a bivariate normal random variable with E(X) = E(Y) =0, 
Var (X) = Var (Y) = 1 and Cov (X, Y) = p. Show that the random variable 
Z=Y/X has a Cauc.y distribution. 

ue [Dethi Univ. B.Sc. (Maths. Hons.), 1989) 
LL (1 — p2)'/2 
Ans, f(z) = = Fe = p+ Ge >| ,— 00 <2 <0, 
9. (a) If (X, Y) ~ N(x, Hy, 5,7, 6,7, p), prove that 
. 1. sin! 
PX >weOY > py) = 74 
[Delhi Univ. M.Sc. (Stat.), 1987) 
(b) If (X, Y) ~ N(O, 0, 1, 1, p} then prove that 
_1., sin-'p 
P(X > ONY > 0) = 7+ on 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 


Hint. p = by > ONY > 0) 


= para «|e Ic a te xT 


Putx=r,cos®, y=rsin® => lJlar:0<r<0,0S0<x/2 


n/2 3 ; 
(l-p sin'20) |r drat 
pas lole ol- aan 


Now Rt first w.r. to 7 and then w.r. to 0. 

10. (a) Let X, and X, be two indepenaent normally distributed variables 
with zero means and unit variances. Let Y, and Y, be the linear functions of X, 
and X, defined by 
Y; =m, + l; 1X1 + ly Xp, Y,= m2 + 15,X) + lon Xo 
Show that Y, and Y, are normally distributed with means m, and m., variances 
Mao = b11? + L192, Moa = lo:? + fog”, and covariance p1yy'= dy) Lay + hia loo. 

(b) Let X, and X, be independent standard‘ normal variates. Show that the 
variates Y,, Y, defined by 

Y, = a; + b6y,X, + 2X9, ¥o = ay + b2,X, + b22X2 .are ance normal 
variates and find their mean and variance. 

Hint. Y; and Y, being linear combination of S.N.V’s are also normally 


distributed. To prove that they are .dependent, it is sufficient. to prove that 
HY, Yo) # 0. (c.f. Remark 2 to Theorem 10-2) ; 


satay (x? 2ony + 92) aeay 
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11. (a) Show that, if X ag Y are independent normal variates with zero 
means and variances 6,2 and 6? respectively, the point of inflexion of the curve 
of intersection of the normal correlation surface by planes through the z-axis, lie 
on the elliptical cylinder, 


X2 sy? 
—= 1 

0 

(b) If X and Y are bivariate normal variates with standard deviations unity 
and with correlation coefficient p, show that the regression of X? (Y) on Y? 
(X?) is strictly linear. Also show that the regression of X (Y) on Y? (X?) is not 
jinear. 

12. For the bivariate normal distribution : 

dF = kexp [-$ (x? - xy + y? ~3x + 3y + 3)] dx dy, 


obtain (i) the marginal distribution of Y, and 

(ii) the conditional distribution of Y given X. 

Also obtain the characteristic function of the above bivariate normal 
ditribution and hence the covariance between X.and Y . 

13. Let f and g be the p.d.f.’s with corresponding distribution functions F 
and G. Also let 

A(x, y) = fix) gy) [1 + @& (2F(x) — 1) 2G(y) - DI, 
where !a1< 1, is a constant and h is a bivariate p.d.f. with marginal p.d.f.’s 
fand g. Further let f and g be p.d.f.’s of N (0, 1) distribution. Then prove that : 
Cov (X, Y) =a/n 


14. If (X, Y) ~ BVN (Uy, po, 617, 52”, p), Compute the correlation 
coefficient between e* and e”. 


Hint. Let U =e*, V = e¥. 
w’,, =E(UT.V) =E [e’* + 5¥] 
= exp [rpy + su, + 5 (Po? + s*o,2 + 2prs)] 
(c.f. m.g.f. of B.V.N., distribution : ¢; = 7, t2 = 5] 
Now E(U) =p1'19 ; E(U*) = 29, E(UV) = 11" and so on. 


A (U,V) e 06,9, = l 

ns. p(UV) =—_> 7) 

[(eOr" — 1) (€92” - 1]? : 
15. If (X, Y)~ BVN (0,0, 1, 1, p), find E [max (X, Y)]. 


Hint. max (X,Y) =4(% +) #51X-Y1 


and Z =X-Y~N[0,2(1-p)] [c.f.Theorem 10-6] 
1/2 
Ans. E [max (X, Y)] = ey 
16. If (X, Y) ~ BVN (0, 0, 1, 1, p) with joint p.d.f. f(x, y) then prove that 


a) P(XY >0) = a+ - sin! (9). 
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Hint. P(XY>0) =P(X>O0NY>0)+P(X <O0NY <0) 
=2P(X>0NY >) [By symmetry) 
Now proceed as in Hint to Question No. 9(5). 


0 20 
(b) an | | fix, y) dxdy = 1 + sin-' p 


17. The joint density of r.v’s (X, Y) is given by : 
] 
FY) = 5 exp [— @? + y?)/2] x [1 + xy exp [- @? +? -2)2)]; 


— 00 < (x, y)<o 
() Verify that f(x, y) is a p.d.f. 
(it) Show that the marginal distribution of each of X and Y is normal. 
(iit) Are X and Y independent ? 
Ans. (ii) X ~N (0,1), Y~ NQ, 1) 
(ii) X and Y arc not independent. 


18. Show by means of an example that the normality of conditiona] 
p.d.f.’s does not imply that the bivariate density is normal. 


Hint. Consider f(x, y) = constant. exp [— (1 + x2) (1 + y*)], 20 < (x, y) <0 


. 1 1 
- Then (Ylx)i~N (0. TES a a (Xly)~N (0, EET 
19, For a bivariate normal r.v. (X, Y), does the conditional p.d.f. of (X, Y) 
given X + Y =c, (constant) exist ? If so find it. If not, why not ? 
Ans. No, since P (X + Y=c) = 0. 
20. Let 


4 


1 é | 2 2 


1 1 2 2 
$F §- Fen (4 + 2PY +X Y 
anv E - p? 2(1 - p*) 
—-o <X< 008, —-W KL Y<co 
then show that : 


(i) f(x, y) is a joint p.d.f. such that both marginal densities are normal but 
f(x. y) is not bivariate normal. 


(ii) X and Y have zero correlation but X and Y are not independent. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989) 
21. Let X, Y be normally correlated variates with zero means and variances 


6)’, 6,¢ and if 
wad za aa {kon 
1° Vr = p) (02 


0 W, z) 1 
Show that Aw, 3 —————————_—_ 
9%) gio,V(1 - p”) 
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1 X2 2pxXY ¥? 
2 2_—_1 _|4° <4PAr I" 
sas Ge ala CES p7) E 102 ~ 5 | 
Deduce that.thé joint probability differential of W and Z is 
ak. _Ly~vv2. 52 
; =a -ex7| 7 iw + 2) | dw de 
and hence that W, Z are independent ae variates with zero means and unit 
S. SD. S (Meerut Univ. M.Sc., 1993] 


Hence or otherwise obtain the m.g.f. of the bivariate normal distribution. 
22. From. a standard bivariate normal population, a random sample of n 


observations (X;, Yj), @ = 1, 2,..., n) is drawn. Show that the distribution of 


z= 5 x2 1 
Zj=" E X? and Z=— 


i=] 


has the moment generating function : 


2 2t Ap2tsts | 
Constant (1 = a (1 _ 2) al 


; t,x? t 2 . 
Hint. Mz,,Z, (h, t2) = E exp & + || 
] il * aaa ‘|. 
tI pools (t- sates)* (Fe) 
to =| 
i - - 20-- 5) eae 
Now use the result 


eo eo ri 
: 2 ae eee 
[c \ exp[— (ax? + 2hxy + by )| dxdy a 
and simplify. 


10:11. Multiple and Partial Correlation. When the values of one 
variable are associated with or influenced by other variable, e.g., the age of 
husband and’ wife, the height of father and son, the supply and demand of.a 
commodity and so on, Karl Pearson’s Coefficient.of correlation can be used-as a 
measure of linear relationship between them. But sometimes there is 
interrelation between many variables and the value of one variable may be 
influenced by many others, e.g., the yield of crop per acre say (X,) depends 
upon quality of seed (X,), fertility of soil (X;3), fetilizer used (X4), irrigation 
facilities (X>), weather conditions (X,) and so on, Whenever we are interested in 
Studying the joint effect of a group of variables upon a variable not included in 
that group, our study is that of multiple correlation and multiple regression. 

Suppose in a trivariate or multi-variate distribution we are interested in the 
relationship between two variables only. The are two alternatives, viz., (i) we 
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consider only those two members of the observed data in which the othe; 
members have specified values or (ii) we may eliminate mathematically the 
effect of other variates on two variates. The first method has the disadvantage 
that it limits the size of the data and also it will be applicable to only the data in 
which the other variates have assigned values. In the second method it may not 
be possible to eliminate the entire influence of the variates but the linear effeg; 
can be easily eliminated. The correlation and regression between only two 
variates eliminating the linear effect of other variates in them is called the partig| 
correlation and partial regression. 

10-11-1, Yule’s Notation. Let us consider a distribution involving 
three random variables X,, X_. and X3. Then the equation of the plane of 
regression of X, on X2 and X; is 

X, = a+ by7.9Xp + 513.2% .- (10-28) 
Without loss of generality we can assume that the variables X,, X2 and X, 
have beén measured from their respective means, so that 
E(X)) = E(X2) = E(Xs) = 0 
Hence on taking expectation of both sides in (10-28), we get a = 0. 
Thus the plane of regression of X, on X, and X, becomes 
X, = by2.3.X2 + by3.3.X5 ..-(10-284) 
The coefficients 5)2.3 and 5,3.. are known as the partial regression 
coefficients of X;.on Xz and of X, on X3 respectively. 
€1.23 = b12.3X2 + bi3.2 X35 
is called the estimate of X, as given by the plane of regression (10-28a) and the 
quantity 
X 1.23 =X) — b12.3X2 — 513.2X3, 
is Called the error of estimate or residual. 

In the general case of n variables X,, X2, ..., X,, the equation of the plane of 

regression of X, on X>, X3, ..., X, becomes 
X1 = by 2.44.--nX2 + O13.2...nX3ZH--. + bi n-23-++(n-1) Xn 

The errcr of estimate or residual is given by 

X}.23-004 =X,- (012.34. -+n%2 + b13.24---nX3 +. + bin.23--(n-1) X,) 

The notations used here are due to Yule. The subscripts before the dot (.) are 
known as primary subscripts and those after the dot are called secondary 
subscripts. The order of:a regression coefficient is determined by the number of 
secondary subscripts, e.g., 

Bi 2.3) B12.34) «++» B12.34--. 
are the regression coefficients of order 1, 2, . “(t — 2) respectively. Thus i in 
géneral, a regression coefficient with p-secondary subscripts, will be called a 
regression co-efficient of order ‘p’. It may be pointed out that the order in which 
the secondary subscripts are written is immaterial but the order of the primary 
subscripts is important, e.g., in b)2.34...,, X2 is, independent while X, is 
dependent variable but in 5;.34..., ,X1 is independent while X2 is dependent 
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variable. Thus of the two primary subscripts, former refers to dependent variable 
and vie latter to independent variable. 


The order of a residual is also determined by the number of secondary 
subscripts in it, €.g., X).23, X1.234, -.-, X1.23-.-_ are the residuals of order 2, 3, 
_,., (n— 1) respectively. 

Remark. In the following sequences we shall assume that the variables 
under consideration have been measured from their respective means. 


10:12. Plane of Regression. The equation of the plane of regression 
of X, on X2 and X;3 is 
X1 = by2.3X3+ 613.2 X3 .. (10-29) 
The constants b’s in (10-29) are determined by the principle of least 
squares, t.e., by minimising the sum of the squares of the residuals, viz., 
S = X4.23? = LX — b123 X2 - 6132 X3), 
the summation being extended to the given values (N in number) of the 
variables. 
The normal equations for estimating b;2.3 and b;3, are 


§ 
d =Q=-2 2 X(X, — b12.5X2 - 613.2 X3) 
db12.3 


35 .. (10-30) 
= 0 = -2 2 X3(X — b12.3 X2 - 513.2 X3) 
0b13.2 
i.é., LX. X1.23=90 and 2X3X1..3=0 .. (10-302) 
2X 1X2 - b12.3 B Xo? = b13.2 E X2X3 = of 
DX X3 = b12.3 > X7X3 = b13.2 Dy X 37 = 0 { 
Since X;,’s are measured from their respective means, we have 


Oo; = ai X;7, Cov (X;, Xj) =< XX; 


_ Cov (X;, Xi) _ £X;X; tee?) 
and i 9,0, NG; 9G; 
Hence from (10-305), we get _ 
712 0102 — 512.3 G2” — by3.2 723 5203 = 0 
, 113 8103 — y2.3 7230203 — 613.2 63? = 0 
Solving the equations (10-30d) for b,2,3 and b,3.2, we get 


=> ...(10-305) 


...(10-30d) 


M1291 123 93 
r13 9, O03 


M12 23 
riz 1 


Od a 2 ...(10:31) 
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Similarly, we will get 


1 Si2 
roan 
0} 
bisn =: ...(10-31 
132 = & i a ( a) 
ro3, 1 
If we write 
1 ri2 ri3 
@oO= | Ta 1 rn (10-32) 
r3i r32 l 


and w,; is the cofactor of the element in the ith row and jth column of @, we 
have from (10-31) and (10-31a) 
GO, Wi2 
—,—~* and b32.=-=.—- . (10-33 
0, M11 ake O63 Wi ur?) 
Substituting these values in (10-29), we get the required. equation of the 

plane of regression of X, on X_ and X3 as 

Oo, ® Oo, @ 

X,=-2- —12.y, = nl —l3.y 
02 Mir 03° 11 


bi23 = 


X X X 
=> Gg, Out G O12 GOs = 0 .. (10-34) 
Aliter. Eliminating thé coefficient 62.3 and b,3.2 in (10-29) and (10-304), 
the required equation of the plane of regression of X, on X2 and X3 becomes 
x X, Xx; 
117092 of T7300, | =0 
; 1139;93 73003 o,? 
Dividing C,, C2 and C3 by 6, 62 and G3 respectively and also R» and R3 
by 62 and 6; respectively, we get 


X, X2 Xs 
6, OG G3 
=0 
r\2 1 'n \ 
| M3 Tn 1 
X1 X2 Xs 
=> +012 + = O13 =0 
3G; O11 » 12 G4 13 


where @,; is defined in (10-32). 
10-12-1. Generalisation. In general, the equation of the plane of 
regression of X, on X2, X3, :.. X, 1S 
Xy = 12.34... X2 + 513.24. X3 + 00. + Bins...(n-1 Xn .- (10-35) 
The sum-of the squares of residuals is given by 


§ = & X44.23...n 
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= D(X — by 2-34. X2 — 513-2400 X30 — Bins. Xa)? 
Using the principle of least squares, the normal equations for estimating the 
(n -~ 1), b’s are 


os 
0 22, XK, — By 2.54, X 2 — 813-24... X39 — «0 — Oiny:.ccn — 1% 0) 


OD 1 2:34...8 


as 
—_—_—_— = 0 = -2y X3(X, a b12.34...242 by3.24...043 — = bi n.23. {a - 1) Xn) 
0D43.24...8 
| | | | 
0s , 
= 0 = 20, XX, — Bh 2:36, X2— Opts icn X3 — 00 — Oy n23...0n 1) Xn) 
Abin 23...(a - 1) 
...( 10-36) 
L.é., LX; X1.23...n = 0, (i = 2 3, coe n) .--(10-36a) 
which on smplification after using (10-30c), give 
7170182 = by 2,34, nO2? + b13.24...n 238203 +... + b1n23..(n—1) 7 20F25n 
7130103 = by2.34 nl2308703 + 513.24. nO32 +... + bi n23...(n— 1) 73n030,, 
Fin O1On = 01 2.34...nF2nF2n + 13.24...nT3nF3n + --. + Ons... (9 —1)Fn? 
...(10-365) 
Hence the eliminant of b’s between (10-35) and (10-365) is 
X| X2 X34 ay Xp 
r190102 0", T2308 a TOL 
T3003 238205 oe vee T4f8Sn 
=0 
ry 2019, 52,02, 13,030, ss o,2 


Dividing C,, C2, +11yCn DY G1, 2s... Gn respectively and R>, R3, ... R, 
by 02, G3; ..., G, respectively, we.get 


xX Xo X3 Xn 
re 1 30 kes Fan, 
ry rn 1 tae ly, =0 .--(10-37) 
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If we write 
1 Tin Na wes Tin 
rn 1 Fy dss Toy 
ry é 9) 1 eee V2, 
@ = ; oe .(10-38) 
er 1 


and (;; is the cofactor ofthe element in the ith row and jth column of @, we get 
from (10-37) 


X2 x4 Xn 
= O11 + O10 ¢F O14 + ... + 1, =U ..-(10- 
1 t & O12 # g, O1s 5, Ol (10-39) 


as the required equation of the plane of regression of X, on X2, X3, .... Xn. 
Equation (10-39) can be re-written as 


GO} 912 0, 13 O1 Min 
Xi =—", ), nara reed 3 oom. xX ...(10-39¢ 
O72 O11 03 O11 On O11 ” ) 


Comparing (10-39a) with (10-35), we get 


_ G1 py 
b12.34...0 — O.° O11, | 
Oo, @z 
b 13.24... BiG G. 
.. (10-40) 
b1n23...(n-1) =~ Go) Oin 
On ‘O11 


Remarks 1. From the symmetry of the result obtained in (10-40), the 
equation of the ‘plane of regression of X;, (Say), on the remaining variables X; 
G#i= 1, 2, ..-, M), IS given by 
Son + on +... + ofa +>... se O,=0;i=1,2,,n 

.--(10-41) 
2. We have 


b =k 
12-34...0 = Oo, . Or 
2 


and bo. s-7— 
21-34... ro 09 


Since each of 01, O2, M1; and Wo iS non-negative and @12 = 21, [cf. 
Remarks 3 and 4 to §10-14, Page 10-113}, the sign of each regression 
coefficient Dy 234_, and bz)... depends on @)>. 


Correlation and Regression ; 10.109 


10°13. Properties of residuals 


Property 1. The sum of the product of any residual of order zero with any 
other residual of higher order is zero, provided the subscript of the former occurs 
among the secondary subscripts of the latter. 


The normal equations for estimating b’s in trivariate and n-variate 
distributions, as obtained in equations (10-30a) and (10-36a), are 

> Xo X1-23 = 0, > X3 X 1.23 =0Q 

and DX; X1.23..,n=03i1=2, 3, coon 
respectively. Here X;, (i = 1, 2, 3, ..., m) can be regarded as a residual of order 
zero. Hence the result. 

Property 2. The sum of the product of any two residuals in which all the 
secondary subscripts of the first occur among the secondary subscripts of the 
second is unaltered if we omit any or all of the secondary subscripts of the first. 
Conversely, the product sum of any residual of order ‘p’ with a residual of order 
p+4q, the ‘p’ subscripts being the same in each case is unaltered by adding to 
the secondary subscripts of the former any or all the ‘q’ additional subscripts of 
the latter. 

Let us consider 

D X1.2X1.23 = LX) ~ by2X2)X 1.23 = 1 X1X1.23 — by2 DX2 X1.23 
= > X1X1.23 (cf. Property 1) 
Also x X 1.23" = a X1.23 X 1.23 = z (X, = 512.3 X2 - b13.2 X3) X1.23 
= 2X X1.23 — 52.3 E X2 X1.23 — B3.2 2X3 X1.23 
= 2 X, X1.23 (cf. Property 1) 
LX Xy.23" = 2 X12 X1.23 = DX1X1.23 

Again »y X 1.34... X2.34...n 

= DK) — by3.4...n X3 — O14.35..2X4— +. 5 in34...(n-1) Xn) X2.34..0] 

= X)X2.34..0 (cf. Property 1) 

Hence the property ? 


Property 3. The sum of the product of two residuals is zero if all the 
subscripts (primary aswell as secondary) of the one occur among the secondary 
subscripts of the other, e.g., 


© X12 X312 = D(X) — by X2) X32 =D X1X3.12 — bi E Xo X3.12 = 0 
(c,f- Property 1) 

Dy X2.34...n X1.23...9 
= 2X2 — b23.4...0X3 — 524.35,..0 X4 — 00+ = Don 3,..net) Xn) X1.23...0] 
=D X2 X1.23,..n — 523.4...n X39 X1.23,,.0 — O20.35,..0 2 X49 X1-23...0 

woo Dan34,..(n-1) 2 Xn X1.23,..1 
=0 (c.f. Property 1) 
Hence the property 3. 
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10-13-1. Variance of the Residual, Let us consider the plane of 
regression of X, on Xo, X3, ..., Xq viz., 


X1 = by2.34...n X2 + 013.24...2 X3 +... + Oin23..(n 1) Xn 

Since all the X;’s are measured from their respective means, we have 
E(X) =0;i=1,2,...,.n = E(X1.23,..)=0 

Hence the variance of the residual is given by 


l Be 1 
071.23. .2 = N LIX a3..2 — E(X1.23.,. DI)? = N >. CP 


1 1 
= yy © X1.23...0X1.23...0 = py 2X1X 1.23. 0, 
(c.f. Property 2 § 10-13) 
l 
= Hy Ue Xs XK) = 32.34... X2 — Bi 3.24.,.n X3 — ++ — Byn-23.. (a a1) Xn) 
= Oy? — by 2.34.9 7128182 — 513 24. 7130193 — «6+ — Bin23, (n-1) 1nF1 On 
f 
ea Oy? — 071.93. = 5 12.34...0 7128182 — 513.24...n 1130193 — «.. 


— 514.23 .(a=1) inp -- (10-42) 
Eliminating the b’s in equations (10-42) and (10-365), we get 


G7 — 67) 23. 11720,02 ... 71,015, 

112010 Oy 2.2028, 
ie | =0 

r) 2010, 1,02 On oe 6,2 


Dividing Ri, Ry, ..., Ra, by G1, G2, ..., 0, respectively and also Cy, C2, 
..., C, by Gj, Oo, ...,.G, respectively, we get 


pO r . r 
0,2 12 eee la 
r)2 ] eee 2, 
. =0 
Tin Lom 1 
2 
O*1.23...n 
1 - — r eee ry. 
6,2 12 ln 
r + eee Ve 
2h 12 + 0 l m | 9 
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1 V12, wee Tq 071.03. 
2 12 Tin 
1 91 
ri2 To, 0 1 
r 
o 2n =( 
Tin Tan 1 0 Ton l 
2 
Le] 1-23... 
=> @ — = ,, =0 
6,2 11 
O?,. =6),* — ...(10-43 
1-23...7 ee ( ) 
Remark. In a tri-variate distribution, 
0) 
61.237 = 6; a .-.(10-43a) 
1 1 


where @ and @,, are defined in (10-32). 

10:14. Coefficient of Multiple Correlation. In a tri-variate 
distribution in which each of the variables X,, X>, arid X3 has N observations, 
the multiple correlation coefficient of X, on Xz and X3, usually denoted by 
R;.23, is the simple correlation coefficient between X, and the joint effect of Xz 
and X3 on X,. In other words R;.23 is the correlation coefficient between X, and 
its estimated value as given by the plane of regression of X, on X-, and X3 viz., 

€1.23 = 2.4X2 + 32X35 

We have X 1.23 =X, ~ b)2.3 X27 — 614.2 X3 =X, — €1.23 


=> €1.23 =X, —X}4.23 
Since X;’s are measured from their respective means, we have 
E(X1.23) =0 and E(e;.23) =0 (-- E(X;) = 0; i = 1, 2, 3) 
By def., 
Cov (X 
)> ©)-23 4 
1.23 = ov Ay. 1:23) 4, .(10-44) 


VV(X1) V(e1.23) | 


Cov (X), 1.23) = E[(X, — EX 1)) (1.23 — E(e1.23))] = E(X, 1.23) 
=n EX €1.23 EEX, (X; — X1-23) 
a5 EX,2- VEX Mas =F EX? ~ FEM 35 
= 0,2 -0}.23” (c.f. Property 2, § 10-13) 


] ] 
Also Vie, 23) = E(e1.23") = N >y €1.237 = N »y (X, = X}.23) 


X (X1? + X1.23? — 2 X1X 1.23) 


l 2 
LX\7 + N & X1-237 —y &X1X1.23 
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1 1 2 
=N ~X;? + N X X4.237 —“N > X4.937 
= 612 -0}.2;" (cf. Property 2, § 10-13) 
Rcd 0)? — 0}.237 
V 612(12 - 61.232) 
0)? — 4.23” 01.237 
2, we ce a en) LS 
=> R ane G2 1 G,2 
2 01.237 
=> 1 — R%).23 = 62 F 
Using (10-43a), we get , 
i eRe a ...(10-45) 
where 
1 ry ry 
@= To) 1 ry =] - rie ris? _ raat + 2rjori3%oxXOn simplification) ‘ 
ry) I's 1 
1 rn 
and 11 = r> iL = 1-123? 
Hence from (10-45), we get ; 
247137-2 
R2,5,=1- Fig’ 3 O12 113 23, .-.(10-45a) 


M11 L — r3" 
This formula expresses.the multiple correlation coefficient in terms of the 
total correlation coefficients between the pairs of variables. 
Generalisation. In case of n-variate distribution, the multiple correlation 
coefficient of X,; on X2, X3, ..., X,, usually denoted by R,.23.., , iS the 
correlation coefficient between X, and 


€1.23..n =X, —X1.23...n 


Cov (X, €1.23...n) “= 
R123... = 
VX) Ve1.23...0) 


1 1 
Cov (M1, €1.23...n) =H vx €1.23...0 = ne x1 — X1.93...n) 
et yx2-tyxx 
N 1 N 1441-23... 
1 1 
= UXP DX ZO =O a3 CD 


1 1 : 
Vein.) =H L e71.23...n= N D(X, — X1.23...n)? 
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et LX (X12 + X4.93.n — 2X1X1.23...n) 
: 1 2,1 2 1 xX 
= 77 wei + NX xX 1-23...a 7 2 N >X1 1-23...a 
2 
= X ,? +S Ex 1-23...a —N DX" 1.23...1 


=.6,7 — Sea 


Oi7- 07 1.23..n ony = a) = 


R, 23.u0 = 2 
© ooo 7 e x Oo 
V6,2(6,? — 61.23, 0) l 
O71.23...n 0) 10-45 
as ‘ eee —_ — eco Cc 


where @ and @,; are defined in (10-38). 
Remarks 1. It may be pointed out here that multiple correlation 
coefficient can never be 5a a because from (*) and (**), we get 
Cov (X41, €1.23...0) = 91? — 074.23, = Var (@1.23,..n) 2 0 
Sitice the sign of R1.23.., depends upon. the covariance. term 
Cov'(Xi, €1.23...,), we conclude that R}.53,, 20. 
2. Since R?,.3., 20, we have : 


1 _© 39 => OS 011 ..(10-45d) 
11 
3. Also,Rio3.nS10 => 1-“<1 
O11 
=> 0 <2? x ~— >0 => o20 .-(10-4Se) 
M11 M11 
From the above results, wé get 
M1; 2@ 20 .-(10-45/) 
In general, we have 


Oj; 2.0°; i= 1,2, coo N 
4. Since @ is symmetric in r;;’s, we have 
; 0); = Oi 5 Lz] = 142, ...,7 (10-452) 


10:14-1. Properties of Multiple Correlation Coefficient 
1. Multiple correlation co-efficient, measures the closeness of the 
association between the observed values and the expected values of a variable 
obtained from the multiple linear regression of that variable on other variables. 
2. Multiple correlation coefficient between observed values and expected 
values, when the expected values are calculated from a linear relation of the 
variables determined by the method of least squares, is always greater than that 


where expected values are calculated from any. other linear combination of the 
variables, 
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3. Since Rj.93 is the simple correlation between X, and é).23, it must lie 
between —1 and +1. But as seen in Remark 1 above, R}.23 1s a NOn-negative 
quantity and we conclude that 0 <R, 23 21. 

4, If Ry.23 = 1, then association is perfect and all the regression residuals 
are zero, and as such G?;.3 = 0. In ths case, since X, = €;.,, the predicted value 
of X,, the multiple linear regression equation of X; on X_ and X3 may be said to 
be a perfect prediction formula. 

§, If Ry.23 = 0, then all total and partial correlations involving X, are zero, 
[See Example 10-37). $o X1 is completely uncorrelated with all the other 
variables in this case and the multiple regression equation fails to throw any 
light on the value of X, when X, and X3 are known. 

6. Rj.23 is not less than any total correlation coefficient, i.e., 


Ry1.93 21125 713) 123 


10:15. Coefficient of Partial Correlation. Sometimes the 
correlation between two variables X, and X_ may be partly due to the Correlation 
of a third variable, X3 with both X, and X5. In such a situation, one may want 
to know what the correlation between X, and X_ would be if the effect of X, on 
each of X, and X, were eliminated. This correlation is called the partig| 
correlation and the correlation coefficient between X; and Xz after the linear , 
effect of X3; on each of them has been eliminated is called the partial correlation 
coefficient. 

The residual X,.3 =X, —6,3X3, may be regarded as that part of the 
variable X; which remains after the linear effect of X3 has been eliminated, , 
Similarly, the residual X2.3 may be interpreted as thé part of the variable x, 
obtained after eliminating the linear effect of X3. Thus the partial correlation 
coefficient between X, and X>, usually denoted_by 7,53, is given by 


r — a eS ME 13s 2* 237 _. _. 
; ( ) ...(10 6) 


We have 
1 1 
Cov(X 1.3, X2.3) = N 2 X13X2.3'= N LX1X2.3 


1 1 1 
=H UXi1X2 ~by, X3) = UXiX2 — bo; wv XiXs 
= 112002 — 13 2 - (7130103). 


= 61902 ("12-713 723) 


1 1 em 
Also V(X;.3) ayy UX = yy PX13X13 


1 1 
= Ds X1X1.3-5 N >» X(X, = b13.X3) 


~ 
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1 1 
=u Xi? — by3. nv Xiks 
fe) 
=6)’—113 5, 7130193 
3 


= 6)°(1 ~ 713”) 


Similarly, we shall get 
V(X23) = 627(1 — 723") 
Hence 


aaa O192(712 — 713 F23) _ —_12 = 713 723 ,..(10-46a) 
Vo)2(1 - 713) 627(1 — 7937) Ya- r13*) (1 — 7237) 
Aliter. We have 
O = 2 X23X1.23 
= © X25 (Xp — by23 Xo — O32 Xs) 
= 2X1 X23 - b12.3 X23X2 - 613.2 DX23X3 
= 2 X1.3X23 - 51232 X23X23 
Dias 
x X23" 
From this it follows that 5,5 is coefficient of regression of X;.3 on X>.3 
Cimilarly, b;.3 is the coefficient of regression of X23 on X.3.- 


Since correlation coefficient is the geometric mean between regression 
coefficients, we have 


b12.3 = 


7219.3 = 2.3 X b21.3 


But by def., 
@ Or W2 

biog =- ot = and by 32-4. 4 

12-3 Os’ Oi 21.3 3, On 

ro (SL 12) (_92 @21)__ O12" 

te O2 11 OG M22} M11 O22 

(++ O)2 = @21) 
@ 

- 12 


the negative sign being taken since the sign of regression coefficients is the 
same as that of (— @)2). 


Substituting the values of @ 2, @,; and @2. from (10-32), we get 
r23= T12 — 413723 
V (1 = 17132)(1 = 7232) 


Remarks 1. The expressions for 713.2 and r3.; can be similarly obtained, 
to give 
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a 5p 
ee V0 = 112) (1 = 732?) OY N= rad — P31) 

2. If ri23 = 0, we have then 7,5 = 1,3 723, it means that 7,2 will not be Zerg 
if X; is correlated with both X, and X2. Thus, although X, and X2 may ph, 
uncorrelated when effect of X;3 is eliminated, yet X; and X_ may appear to he 
correlated because they carry the effect of X3 on them. 

3. Partial correlation coefficient helps in deciding whether to incInde or no, 
an additional independent variable in regression analysis. 

4. We know that o,2(1 — r12”) and ,2(1 — r;32) are the residual variances jf 
X, is estimated from X». and X3 individually, while o? (1 — Rj.23*) is the 
residual variance if X, is estimated from X2 and X3 taken together. So from the 
above remark and R).23? > 7,27 and r,32, it follows that inclusion of an addition 
variable can only reduce the residual variance. Now inclusion of X3 when X; has 
already been taken for predicting X,, is worthwhile only when the resultan; 
reduction in the residual variance is substantial. This will be the case when r,,,, 
is sufficiently large. Thus in this respect partial correlation coefficient has its 
significance in regression analysis. 

10-15-1. Generalisation. In the case of n variables X:, X2...,X, the 
partial correlation coefficient 712.34, between X; and X> (after the linear effect 
of X3, X4, ..., X, on them has been eliminated), is given by 


7712.34... = 512-34,..n X 021.34...2 


But,-we have 
b12.34....= 7 1 O12 
and “G2 Oi tee Equation (10:40). 
O2 @21 ae 
b21.34..1 = — ai ey 


72 -( O} — 02 a oe 
1234..a7 | ~ ° 17 


12 


> "32.34..2n=—- 
V 11022 


negative sign being taken since the sign of the regression coefficient is same as 
that of (—;2). 


10-16. Multiple Correlation in Terms of Total and Partial 
‘Correlations. 


1 — Ry.937 = (1 = r2”)(1 -— 713.27) .-+(10-46¢)' 


(10-466) 


——~ 
Proof. We have 


1 _ F427 + M3? - 2r12 r13 'o3 


l= R423" 5 137 


A = 193? = rip? = 13? + 2712 713 723 
1 ~ 723? 
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Also 


> _ | - (ri3 — 112193)? _ l — 19? — 93% — 144? + Aryory4: 
(1 = ry22)(1 = 13?) (1 = ry9?)(1 - 937) 


Hence the result. 


| — 113.2 


Theorem. Any standard deviation of order ‘p’ may be expressed in terms 
of a standard deviation of order (p — 1) and a partial correlation coefficient of 
order (p — 1). 


Proof. Let us consider the sum : 
D X93 8 = LX 1.23...0 X1-23...0 

= 2X1 23 (-1)X4-23...0, . 

(c.f. Property 2, § 10-13) 
= UX r23..0—1) Xb = 542-44. X29 — «Dan y.23. Xn 

— bin23..0n = 1) Xn] 

= 2 Xy.23.0-1) Xi ~ Pinas in -) D X1-23..00- Xn 

(c.f. Property 2 § 10-13) 
=2 X71.93. 00 ~ 1) On 23...n = 1) 2X 4.23.00 ~ 1) X0-23...0 =) 


Dividing both sides by N (total number of observations), we get 
074.93, 0 = O71 23..n= 1) — O1n-23, (n— 1) COV (X1 23 n= ty X23 «(in - D) 


The regression coefficient Of X23. (n— 1) ON X1.23.,.¢n— 1) IS given by 


ee e Cov (X1.23.. n— t), X0-23...n - 1) 
23..(-1) = 
" oe 0°1.23...(0- 1) ; 
071.23. i 074.93 ..(t — 1) [1 a Bi n.23. (n- 1)-Dn1-23...(n - »] 

= 071.23. (n-1) [] — at 23... -))> ...(10-47) 


a formula which expresses the standard deviation of order (# — 1) in terms of 
standard deviation of order (nm — 2) and partial correlation coefficient of order 
(n — 2). If we take p = (#2 — 1), the theorem is established. 


Cor. 1. From. (10-47), we have 
07193. (n= 1) = 971-23. -2 CL — Pin 23.02) -«-(10-47a) 
and so on. Thus the repeated application of (10-47) gives 


074.93 9 = Oy2() = ry?) C1 - 13.97) Cl - rigng?).- A — Pinas. — 1) 
.(10-47b) 


Since partial correlation coefficients cannot exceed unity numerically, we 
get from (10-47), (10-47a), and so on, 
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071.23...n S  071.23,..n=1) 
O71.33..00-1) S 97 1.23..4n-2) 


| O1.25” S$ 01.2” 
01.27 < 0,2 
> 01; 201.2 2 0}.23 2 ... 20}.23...0 ...(10-47¢) 


Cor. 2. Also, we have 
071.23,..n = 012(1 — R713... n) 
On using (10-475), we get 
1 — R193. = (1 —7127)(1 - 7413.2”)... — Pins.) ...(10-47d) 
This is the generalisation of the result obtained in (10-46c). 
Since Irigy lS 1;s=0,1,2,...,(@a- 1), 
where 7;;/.) is a partial correlation coefficient of order s, we get from (10-47d) 
1- R123. 4S ¥-1127 
1 R193. .n $1 -7%13.2, 
and so on. 
Le, R2193. on 21127 713.05 009 P21 23,..(8 = 1) -..(10-47e) 
Since R}.33..., is Symmetric in its secondary subscripts, we have 
2 2 i a 
R 1-:23...A 2 Vi}; i ¢ = 2, ooey n) | ...(10-47f) 
R*1.03..0 201i G4 = 2, 3, ..., 2) 


and so on 


10-17. Expression for Regression Coefficients in Terms of 
Regression Coefficients of Lower Order. Consider- 


2 X1.34..0X2.34..08 = 2 X1.34..(n-1)X234...0 
= 2 X1.34...¢n -1)(X2 — b23-4,..0X3 —- + Bansa,..dn— 1) Xa) 
= 2X 1.34 .n-1)X2 7 Ban34...n-1) 2 X1-34.,(n- 1 Xn 
= X1.34...¢n- 1) X234..0-1) 
— on, (n—1) 2 X1.34..(n—-1) X04. (n-1) 
Dividing both sides by N, the total number of observations, we get 
OV (X 1.34.9, X2.34...0) = COV (X1.34.. ww -1),X2-34...0n- 1) 
— bon 34, —1) COVA. 34,,.(n— 19% 034.01) 
b12,34,..n O72.34...n = 012.34,..(n- 1) 072-34...(n-1) 


— bon 3a... n—1) O1n-34...(n—1) 970 34...(n-1) 
On using (10-47), we get 
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b12.34...1 075.34...(n -1) (1 _ Pon 34...(n- 1} 
= 07 2.34...(n -1) [b12-34...¢n- 1) - bon.34...(n -1) bin34..(n -1) 
2 
O'n-34...(a - 
; x “eae (a a] .(*) 
O"2.34...(n - 1) 


In the case of two variables, we have 
o2 


by G7 = by, 7 =Cov (X;,X)) = by= os bi 
G7 
07 4.34...(n— 1) 
DO a34.n—1) 
Hence from (*), we get 
b19.34...n972.34...(n ~1) ( 1- Pr 0n.34..(n- 1)} 


D2n34...(n = On2.34...(n —1) 


in 00.34. (n -1) (512.34...(n -1)7 Bin34...(n -1) bn2.34...(n - 1] 


...(10-48) 


| ...(10-48a) 


ea ere (n-1)> Dae. (n=1) 


10:18. Expression for Partial Correlation Coefficient in 
Terms of Correlation Coefficients of Lower Order. By definition, we 
have 


Gi. lm...t (*) 


b;: =r x 
te ee Oj... 


Bin-34...(n - 1)-On2.34...(n - 1) 


© 1.34...(n = 1) 


O'n-34...(n - 1) 
= 1 1n-34...(n- 6.54, P 1) TaD 3. (n- 
ap 


» O2.34...(n - 1) 


O1.34...(n-1 
= Tw 34...(n— 1)° Tn234,..(8 1) ° ETRE . : -»(**) 
a- 


Hence from (10-48), on using (*) and (**), we get 


91.34. 

92.34...0 

(risa —T1n.34...(n—1) 1n2-34,..(n~1)} Si Mati= | (#48) 
1 ~ Fe on.34,..(n -1) O2.34..0-1)) ¢ 


12.34.48 xX 


Also on using (10-47), we get 


F134. _ F1-34..(a—1) Hi ~ 715534, fn - at" 
O2.34...2 O2.34...(n -1) 1- 7 on34...(n -1) 


Hence from (***), we get 
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i 
: L — 7714.34..(n-1) | 2 
1234.8 TD ron 34. (nH 1) 


_ ao — Fin-34...(n-1) 02-34... | 


=> V12.34...8 
12-%., 7 (1 — 7 34.34.. (a-1y) (1 ae a2-34...(a - 1)) 


which is an expression for the correlation coefficient of order p = (n — 2) in 
;terms of the correlation coefficient of order (p ~ 1) = (n —3). 

Example 10-33. From the data relating to the yield of dry bark (X,), 
height (Xz) and girth X3 for 18 cinchona plants the following correlation 
coefficients were obtained : 

Y\2>= 0-77, 135 = 0-72 and 93 = =,0:52 

Find the the partial. correlation coefficient r\2.,and multiple correlation 
coefficient Ry.23. 

Solution. 

~——lia-ria a3, __0-77 = 0-72. X 0-52 


"123 = = 
N(1 = 713?) 17332) VE - 0-72)7111 - (0-52)?] 


24.7.2 2 r 
Pig" +7 3% — 2712713723 
Rj.23? = 


1 — ro37 
_ (0-77)? + (0-72)? - 2 x 0-77 x 0-72 x 0-52 
1 — (0-52)? = 0.7334 
R123 = + 0-8564 


(since multiple correlation coefficient is non-negative). 
Example 10-34. In a trivariate distribution : 
= Z. O2= C3 = 3, Pia = 0-7, Fo3 = 73, = 0-5. 
Find (1) F231, (i) Ry.23, (tii) b12.3, 913.25 and (iv) 61.23. 
Solution. We have 
() roy =e g 0-5 = (0-7)(0°5)___ _ 9.9495 
Vl = roi) (1-72) V1 — 0-49)(1 — 0-25) 


2 2 
: 132" + 713° — 2712713 723 
ii) Ry.24* = 
(ii) R123 1 F232 


0-49 + 0-25 ~ 2(0-7)(0-5)(0-5) _ 
1-025  ™=%52 


R1.23 = + 0-7211 


oe 01.3 

Git) By2.3 =F123 =a 

2-3 
ri2— "1423 114 —"32%a2 

= (0.2425 


r12.3= = 0-6, 713.2= 
> V(1 = r432)(1 — 7932) V(1 - 7192) — 732?) 


CG e 
and = 513.2 = 713.2 an 
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61:3 =0, V1 — 7432) = 2 V(1—0-25) = 1-7320 
62.5 = O62 V (l- 13°) =3 Va —0-25) = 2-5980 
O}.2 =O; Va - 112) =2 ‘ (1 -0-49) = 1.4282 
63.2 = 64 Va v7 1492) =3 V (1 -— 0:25) = 2-5980 


Hence 6j7.3 =0-4 and 63.2 = 0-1333 


w 
Vv) O1.23 = 0 J 
(iv) 1.23 =O; Ou 


r] 
~ 


1 Ty «=—*'713 

where © ={ fra 1 ry |= 1 —ry22—ry3?= ry? + 2r,arigre3 = 0-36 
Tz, Ta 1 z 
1 ‘rn 

and @)) = ; 1 = 1 -~#23,7 = 1-0-25 = 0-75 


O123 = 2x V0-48" =2 x 0.6928 = 1-3856 
Example 10-35. Find the regression equation of X, on Xz and X34 given 


the following results :— 
Trait Mean Standard deviation "rh 23 31 
X; 28-02 4-42 +080 — — 
X2 4-9] 1-10 — 0:56 — 
X3 594 8&5 — = - 0-40 


where X,=\Seed per acre; Xz = Rainfall in inches 
X3 = Accumulated temperature above 42°F. 
Solution. Regression equation of X, on X4 and X; is ey by 


(X, - XG t (X, -X,) 2 24 (x, -X,)—# = =0 
1 ry ry 
where O=. 7 1 fy 
V4) rp 5 | 
1 ry ; 
1) = ro 1’ =1-ry?=1~-(-0-56)? = 0-686 
r. ry, 
@12 -- | = -: =9713!%n-Tay = — 0-576 


@13 =%o3%12—-"i3 = (- 0-56) (0-80) = (- 0-40) = — 0-048 
> - Required equation of plane of regression of X, on X, and X3 is given by 


0-686 (~.0-576) (=0-048) 
447 C= 28:02) +” 1-10 (X2+4-91) * “eno (X3 — 594) =0 
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Example 10-36. Five hundred students were examined in three subjects ] 
II and IIT, each subject carrying 100 marks. A student getting 120 or more by; 
less than 150 marks was put in pass class. A student getting 150 or more by 
less than 180 marks was put in second class and a student getting 180 or more 
marks was put in the first class. The following marks were obtained : 


I IT | IT- 
Mean: 35-8 52-4 48-8 
S§.D.: 4-2 5-3 6:1 
Correlation : r12=0-6, = 3=0-7 a3 = 0-8 


(i) Find the number of students in each of the three classes. 

(ii) Find the total number of students with total marks lying between 120 
and 190. 

(iii) Find the probability that a student gets more that 240 marks. 

(iv) What should be the correlation between marks in subjects I and [I 
among students who scored equal marks in subject II ? 

(v) If r3 was not known, obtain the limits within which it may liz from the 
values of r\2 and r,3 (ignoring sampling errors). 

Solution. If Z denotes the total marks of the students in the three Subjects 
and X;, X2, X3 the total, marks of the students in subjects I, II and Il 
respectively, then 

Z = X i* X. 2+ xX 4 
E(Z) = E(X;) + E(X2) + E(X3) = 35-8 + 52-4 + 48-8 = 137 
V(Z) = V(X,) + V(X2) + V(X3) 
_ +2[Cov (X;,X_) + Cov (Xz, X3)-+ Cov (X3;X))] 
= 17-64 + 28-09 + 37-21 + 26:712 + 35-868 + 51-728 
= 197-248 [Using Cov (X;, X; j) =179;5)) 
=> Gf = 197-248 or o7= 14-045 
E -2- EZ) ~ N(O, 1) 
Oz 


Area under the 


curve in this Frequency 


class (A) 500 x (A) 


0-11314 0-70937 354-685 
0-82251 D418 0-17639 88-195 
0-99890 0-00102 0-510 
0-99992 ~ 0-88678 443-390 
- -1-00000 0-00000 0-000 
(i) The number of students in first, second and third class respectively are 
355, 88 and 0 (approx.) 
(ti) Total number of students with total marks between 120 and ‘190 is 443. 
(iii) Probability that a student gets more than 240 marks is zero: 
(iv) The correlation coefficient between marks in subjects I and. of the 
Students who secured equal marks in subject III is 7,23 and is given by 
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roa a= 42 = 89." 23: — A  __. = 0:0934 
V(1-7132)(1 — 7232) ~~ V(1 — 0-49)(1 — 0-64) 
(v) We have . | 
; (ri2 — 713 723)” 
123" ~ (1 = 7132) = 793”) 


0-6 — 0-7a)? 7 
=> 0-36 + 0-49a? —0-84a <0-51 (1 2 at 
= a? —0-84a -0-15 <0 


Thus ‘a’ lies between the roots of the equation : 

a? — 0-84a - 0-15 ='0, 

which are 0-99 and — 0-15. 
Hence r23 should lie between — 0-15 and 0-99. 
Example 10-37. Show that 
1 — Ry.23% = (1 -— 749”) U — 113.27) 

Deduce that 
() Ri23 2712, (i) R123? = 12? + 1137, if 723 = 0 


(iii) 1 —'R,.23? 2 — ( eee ) , provided all the coefficients of zero 


order are equal to p. 
(iv) If Ry.23 = 9, X, is uncorrelated with any of other variables, i.e., 
2="13= 0. [Delhi Univ. B.Sc. (Stat. Hons.), 1989) 


Solution. (2%) Since | r13..! <1, we have from (10-46c) 
1 — R.23? <1l- ri? => Rj.232712 
(ii) We have 
F143 — 712 132 


r if ro. = 
Tarai Vi-ne Ward 


*. From (10-46c), we get 


) 2 P ry | 2_ 
1 — Rj.93° = (1 — 72) | 1 = 2 = 1 ro? - 7137 


1- rv; 
Hence Ri. = 1197 + 7132, if ro3 = 0. 
(iif) Here, we are given that 7,2 =713 = 723 =P 


pies DPE og POD 2 
MENG p09) 1-07) T+ 


Hence from (10-46c), we have 
__p?__) _ (= pt + 2p) 
1 - Ry.237 = (1-5 [1 = q2el- Se ee Sa 


(iv) If R123 = 0, (10-46c) gives 
1 = (1 = 742*)(1 — 713.2”) +(*) 
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Since 0 $r,.* S$ 1 and 0 $7;3..? $ 1, (*) will hold if and only if 
32= 0 and "13.2 = 0 
113.— 112732 


V (1 = 7122)(1 — 7392) 
> —B— = 0 (712 = 0) 
V1 = 759? 


=> "13 = 0 
Thus if R,.23 = 0, then 713 = 7r,2 = 0, i.e., X, is uncorrelated with X2 and X,. 


Example 10:38. Show that the correlation coefficient between the 
residuals X\.x, and X14 is equal and opposite to that between X,., and X23, 


(Poona Univ. B.Sc., 1991) 
Solution. The correlation coefficient between X1.23 and X213 is given by 


Now 7332=0 => 


1 i 
Cov (Xj.23 .X213)  £X1.23X213 _N 2X 9.13(X1 — 512.3 X2 — 613.2 X3) 


01.239213 0 NG4.23 02.43 G1.23 92.13 
2X2.13 X2 : 
= 612.3 N 61.23 O2.13 (c.f. Property 1, § 10-13) 
>» Xo.137 | 
=—bi25 NG 25 On13 (c.f. Property 2, § 10-13) 


0213 | (G2 ¥ 0/22 ) 


01.23 


=-bi25 


12:3 - 
(6,Vio/o;; ) 
1 fe ny 
where @O =| %y 1 try 
T3) Ley) ‘1 
1 'r l "3 
= = |- = 2 = ;—4 —_ 2 
O11 ro 1 1-~— 7937 and @2 re 1 1-733 


02 l- 1237 62.3. 
igh r(x). , >, =—-b)}23—. =—b fe 
23) X2.13) 3G, lr? 123g, 


[since 623? = 627 (1 —ro3") and 61.37 = 6,2 (1 -7)37)] 


Cov (X1.3, X93) Oo. 
ve (X23, Xp43) = — oe Aa) 23 


2,3” f 01.3 
Cov (X).3, X2. 
) = Oa Ba). 10,5, Xa0) 
62.3 01.3 s 


Hence the result. 
Example 10:39. Show that if X; = aX, + bX>, the threé partial correlations 
are numerically-equal to unity, r13.2 having the ‘sign of 4, r23.., the sign of b and 
7123, the opposite sign of alb. (Kanpur Univ. M.Sc., 1992] 
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Solution. Here we may regard X3 as dependent on X, and X2 which may 
pe taken as independent variables. Since X, and X_ are independent, they are 


Thus 7(X1,X2)=0 = Cov(X;,X)=0 
V(X3) = V(aX, + bX) = a2V(X)) + b°V(X2) + 2ab Cov (X;, X2) 
= ao? + bo, 

where V(X,) = 6)’, V(X2) = G,’. 

Also X;X3 =X,(aX, + bX) =a X,? + bX,X> 

Assuming that X;’s are measured from their means, on taking expectations 
of both sides, we get 

Cov 1, X3) = ao,2 +b Cov (X;, X>) = ao,” 


4 


bc _ Cov (X;, X3) 2 ao,” _ 201 
VX) VM%) — VoiXa'a;? + Boz?) 
ese 2 sato,2+ Po?, 
Similarly, we will get 


_ Cov (X2,X3) _ boy 


"3 NVM) Va) OF 
Hence 


sen 1 9 __, ).- — pqs a | 
2 VG= 10-73%) VP - 00? “laa? p40) 
according as ‘a’ is positive or negative, Hence r,3.2 has the same sign as ‘a’ 
Again 
apne Ses |e, ae Bes 

Me Vd-mad-r)  * Ve-ao, Veo? 
according as ‘b’ is positive or negative. Hence rz, has the same sign as ‘b’. 
Now 
Nites 
a" 2 iH kok Z| 2 2) (k2 big? 
(1 —.r13°) (1 — 123°) (k? — a20,”) (k? - 670) 
= — G5}, _ _ = _aib_ a —(alb) =7 ] 
V0.2 x aa? Veep Vee +(aib) * 

according as (a/b) is positive or negative. Le T1723 has the sign opposite to 
that of (a/b). 

Example 10-40. If all the correlation coefficients of zero order in a set of 
p-variates are equal to p, show that 

() Every partial correlation of s’ th order is ears A} 

(ii) The coefft cient of multiple correlatign R of a variate with the othe 
(p- 1) variates is given by 


= (p-1)p 
1-R* = (1-9) Fe + (p - ard 
[Delhi Univ. M.Se. (Maths'), 1990] 


10-126 Fundamentals of Mathematical Statistic, 


Solution. We are given that 
T'mn~P, (m,n=1,2,...,p;m#n) 
We have 
Vii — Vik Vik — 
Vij = 9 (i, J; 
Vd r2)0 - rx?) 
———— 1) 
“Va-p)a-py. 1+? 
Thus every partial correlation coefficient of first order is p/(1 + p). 
=> (*)is true fors =i. 


The result will be established by the principle of mathematical induction, 
Let us suppose that every partial correlation coefficient of order s is given by 
o/(1 + sp). Then the partial correlation coefficient of order (s + 1) is given by 


k=1,2,...,pri¥j4#k 


Tits) — Vikds\" ik 


V ij-km.t = ira) Pan) 
(1 — Vik (3) qi — V" jk. (s)) 


where k, m, ..., t are (s + 1) secondary subscripts and 7774s), Tik-(s)»» Vjk-(s)> ate 
partial correlation coefficients of order s. Thus 


cer ey aT 9 
T ij-km...t = oli edt = est Is as od TS (s . 1) 
1+sp 1+sp 1+ sp 


Using (**) and (***), the required result follows by induction. 


(ii) We have 1 - R? =—>- 

O11 

where R is the multiple correlation coefficient of a variable with other (p - 1) 
variables and 


L Pp ff... 9p 
p 1 p..p ; 
m =| P P 1... p , adeterminant of order ‘p’ and 
p p p...l 
1 p p...p 
p 1 p...p 
p p il... p 


O11 =. ° a determinant of order (p — 1). 
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1p p Pp... 
1 1 p p..p 
g=[1+@- Dp] ue A ee (On adding C2, C3...,C, to C)). 
. a a 

rp p Pp I 
I p p p p 
0 (1-p) ‘0 0 0 
0 0 (l-p) 0 0 


=> wo =(1+@-1)p] 


0 0 0 OO. (-p) 
[On operating R; — R), (i = 2, 3,...p)]. 
@ = [1+(p~1)p](1 - py 


Similarly, we will have " 
@1, =[1+(@-2)p](1—p)” 


_o | (p- 1p 
ae = [jt e= 2 | 


Example 10-41. In a So distribution, all the total (order zero) 
correlation coefficients are equal to Po #0. Let p, denote the partial correlation 
coefficient of order k and R, be the multiple correlation co <fficient of one variate 
on k other variates. Prove that 


‘ 1 re 
(t) Po =— 1)’ (li) Pg — Pk-1 =— Pe Pri, and 


k po? ; 
i) R.2 = ——oPO2 i Univ. M.Sc. (Stat.) 198 
(iit) Ry 1+(k-1po’ (Delhi Univ (Stat.) 1987] 
Solution. (7) We have proved in Example 10-40, that 
p _—Po__ 
. 1 + kpo 


In the case of p-variate distribution, the partial correlation coefficient of the 
highest order is p,-2 and is given by 


a 
Pr2=1 + (p -2)Po 


Since Ilpp2!s1 =>. -1Sp,2S1, 
we have (on considering the lower limit) 
Po 
-1 <——_—————_-—_ -[1+ (p-2)po] S 
l<iv@-Dp. 7 [1+ (p - 2)po] $ Po 


=> Po 2- 


(p- 1) 
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° os _ 
G) EATS: = Pk- Pea 1 epg 1+ (K-10 


_ aie aT 
“£0! (1+ kpo)ll + (K— Dpol 


1+kpo}\l1+C&—1)po ePrt 
(iii) Taking » = Po and k = p— 1 in part (ii) Example 10-40, we get 


1+k 
1 —-R,2 = (1—Po) Fexcnina 


(1 — po)(1 + kpo) k Po” 
R? =1- = 
L+(k-1)pp 1+ (k-1)Po 
Example 10-42. If r;. and r,3 are given, show that ro, must lie in the 
range : rio fig t (1 — ryq? — 143? + ry22r33*)!? 
If ry. =k and r)3 =—k, show that ro3 will lie between -1 and 1 - 2k. 
[Sardar Patel Univ. B.Sc. Oct., 1992; Madras Univ. B.Sc. (Stat, Main) 1991) 


(On simplification), 


Solution. We have 
2 
' -s 12 — 113923 
712.37 = | 


V(1 = 7432) (1-143?) 
(ri2—Tisray? S$ (1 - r137)(1 = 23%) 
=> hig + 39%23" — Wy 113723 $1 — 143% - ro? + 11377237 
=> N19? + 1437 + 1o3* — 2712 113723 <1 s..(*) 
This condition holds for consistent values of rj2, 7,3 and r3..(*) may be 
rewritten as : 
Fog? — (2r,2713)F23 + (712 + 1137-1) <0. 
Hence, for given values of r,. and r;3, 723 must lie between the roots of the 
quadratic (in r2,) equation 
Fog” -- (2ry2 113) Fog + (rio? + M137 - 1) =0, 
which are given by : 
Pog = Nori t Vryz?ris? — (riz? + r37 — 1) 


Hence 


N2"13- 


1 — P19? — 743? + ry2"r13? Sra Sno 


$V (1 = ry 9? 743? + 71277132) «..(**) 


In other words, 723 must lie in the range 


renstvdl — 142? — 17437 + 427737) 


In particular, if r;2 =k and r;,=—k, we get from (**) 


-hR-VQ - k? — k2 + k4) Sry f~h+V(1 — k? ~ 2 + k*) 
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=> —~k?-(1-—k2) Sry $—k? + (1 -k?) 
. -1 $ry4$1-2k 


. EXERCISE 10(g) 
1. @) Explain partial correlation and multiple correlation. 
(b) Explain the concepts of multiple and partial correlation coefficients. 
Show that the multiple correlation coefficient R,.23 is, in the usual 
notations given by : Ry937 = 1- = 
M11 


2 (a) In the usual notations, prove that 
2 2 
Ryo? = + rs ls 2 
(b) If Ry.23 = 1, prove that r2.;3 is also equal to 1. If R,.2, = 0, .does it 
necessarily mean that R>.,3 is.also.zero ? 
3. (a) Obtain an expression: for the variance of the residual X,.23 in terms of 
the correlations Y12, 3 and r3; and deduce that R123) 2712 and r}3. 


(b) Show that the standard deviation of‘order p may be expressed in terms of 
standard deviation of order (p - 1) and-a correlation coefficient of order (p — 1). 
Hence deduce that : 


(i) D1 20}.2 291.23 2... 2 81.93. 
eee 
(ii) 1 ~Ri23..0=(1- ri2) (l-r 32)..(1 — in-23...(n-1)) 
[Delhi Univ. M.Sc. (Stat.) 1987] 


4. (a) Ina p-variate distribution all the toal (zero order) correlation 
coefficients are equal to py # 0. If p, denotes the partial,correlation coefficient of 
order k, find p,. Hence deduce that : 


(1) Pe ~ Pe -1 = — Pe Pei 
_ (6) Show that the-multiple cortelation coefficient: Rj.93.,; between X, and 
(X2,X3, ..., Xj), j = 2, 3, ..., p satisfies the inequalities : 
” Riz S$ Ry23-8 «S$ R1.23..p 
[Delhi Univ. M.Sc. (Maths.), 1989] 


5. (a) Xo, X1, .;-. X, are (n + 1) variates. Obtain a linear function of X,,. 
X2, ...,X, which will have a maximum correlation with Xo. Show that the 
correlation R of Xo with the linear function is givenby | 
1 


R= (1- an 
Wo0 
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i) Tal r, rnd odes 1 
and (pop is the determinant obtained by deleting the first row and the first column 
of @. 
(b) With thé usual notations, prove that 
® : 
071.234... = oo! = 0,7 (1 — r12?)(1 - 713.2”)...(1 — Pinzs..n-1) 
(c) For a trivariate distribution, prove that 
ae 112 — 113 723 
V (1 = 7137) (1 - 123?) 
6. (a) The simple correlation coefficients between temperature (X,), com 
yield (X2) and rainfall (X3) are, r,2 = 0-59, r,3 = 0-46 and rz, = 0-77. 
Calculate the partial correlation coefficients rj2.3, 723.; and 73)... Also 
calculate R}.23. 
(b) If rz = 0-80, 713 =— 0-40 and r., = — 0-56, find the values of 712.3, 713.2 
and r 231° Calculate further R 1(23)s R213) and R312). 
7. (a) In certain investigation, the following values were obtained :; 
"12 = 0-6, r13 =—(-4 and ro3 = 0:7 
Are the values consistent ? 
(b) Comment on the contieney of 


1 
N2=5 2 P29 = £731 =—5- 

(c) Suppose a computer has found, for a given set of values of X,, X2 and 

X43, 
n2=0-91, m35 0-33 and 7r3.=0-81 

Examine whether the computations may be said to be free from error. 

8. (a) Show that if 7,2. =7,,=0, then R23) = 0. What is the significance 
of this result in regard to the multiple regression equation of X 1; on X_ and X; ? 

(b) For what value of R,.>, will X2 and X, be uncorrelated ? 

(c) Given the data : "12 =-0-6, ri3= 0-4, find the value of To3.SO0 that Rj.2 
the multiple correlation coefficient of X,; on Xz and X; should be unity. 

9. From the heights (X,), weights (X2) and ages (X3) of a group of students 
the following standard deviations and correlation coefficients were obtained : 
G, = 2-8 inches, 62 = 12 lbs, and 63 = 1-5 years, r;2 = 0-75, ro, = 0-54, and 
r3, = 0-43. Calculate (i) partial regression coefficients and (ii) partial correlation 
coefficients. 

10. For a trivariate distribution : 

xX, = 40 X, = 70 Xe z= Of) a 
GO; = 3 0, =6 0; =7 
ro = 0-4 ‘rn = 0.5 "3 = 0-6 
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Find 
(i) Ry.23, (i) 723.1, (iii) the value of X3 when X, = 30 and X_ = 45. 
11. (a) In a study of a random sample of 120 students, the following results 
are obtained : 
Xr = 68, X> = 70, 3 
S,? = 100, S,* = 25, S;* =81, 
Tio = 0-60, ri = 0-70, rn =06 
{S? = Var (X;)], where X,, X2, X3 denote percentage of marks obtained by a 
student in I test, II test and the final examination respectively. 
(i) Obtain the least square regression equation of X3 on X, and X>. 
(ii) Compute rj23 and R3,1. 
(iii) Estimate the percentage marks of a student in the final examination if 
ne gets 60% and 67% in I and II tests respectively. 
(b) X,; is the consumption of milk per head, X2 the mean price of milk, and 
X3, the per capita income. Time series of the three variables are rendered trend 
free and the standard deviations and correlation coefficients calculated : 
§) = 7-22, 52= 5-47, 53 = 6-87 
No =— 0-83, r= 0-92, ro, = — 0-61 
Calculate the regression equation of X, on X, and X, and interpret the regression 
8s a demand equation. . 


12. (a) Five thousand candidates were examined in ‘the subjects (a), (b); (c); 
ae of these subjects carrying 100 marks. The following constants relate to 
data: ~~ 


Subjects 
(a) (5) (c) 
Mean 39-46 52°31 45-26 
Standard deviation 6-2 9-4 8-7 


lye = 0-47 oq = 0-38 Tab = 0-29 
Assuming normally correlated population, find: the number of candidates 
Who will pass if minimum pass marks are an aggregate of 150 marks for the 
Subjects together. 
(6) Establish the equation of plane of regression for variates X,, X>, X3 in 
determinant form 
X,/0; XJ, = X3/03 
12 1 'n =Q0 


13 'n 1 


[Delhi Univ. B.Sc. (Maths. Hons.), 1986) | 
13. (a) Prove the identity 
512.3 623.1 631.2 = 12.3 723.1 131-2 [Gujarat Univ. B.Sc., 1992] 


~ 
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(b) Prove that 
Ry.937 = by2.3 712 . + by3.2 113 - 
[Sardar Patel Univ. B.Sc., 1991}) 

14, (a) If X3 = aX, + bX> for all sets of values.of X,, X2, and X3, find the 
value of 733.;. 

(b) If the relation aX, + bX, + cX3 = 0 holds for all sets of values X,, X, 
and X3, what must be the partial correlation coefficients ? 

15, (a) If rig = 723 = 73, = p #1, then 


a. ra 


a V(1 +9) 


(b) Y,;, Y2, Y3 are uncorrelated standard variates. X,; = Y2 + ¥3, 
X2 = Y,+Y;, and X; = Y; + Y,. Find the multiple -correlation coéfficient 
between X, and (X, and X>). 

16. X, Y, Z are independent random variables with the same variance, If 


X= K-21, X= K+ Y +2), X3== + 2Y +2), 


show that X,, X2, X3 have equal variances. Calculate r;2.3 and Rj (23). 

17. (a) If X,, Xz and X, are three variables measured from their respective 
means as origin and if e, is the expected value of X, for given values of X, and 
X, from the linear regression of X, on Xz.and X3, prove that 

Cov (X), €;) = Var (e;) = Var (X,) — Var (X, — e)) 
(b) If r:2=k and rz, =—k, show that r,, will lie between —1 and 1 — 2’. 
18. (a) For three variables X, Y and Z, prove that 
rxy + yz + 'zx 2-3 wo(") 
Hint. Let us transform X, Y, Z to their standard variables U, V and W: 


(say), respectively, where 
y =X EO y -r-EW w -2-£@ 
Oy ae Oy a Oz 


so that 

E(U)=E(V)=E(W)=0 oe) 
and Oy* = Oy? = Cy’ = 11> E(U2) = E(V2) = E(W?) = 1 = 

_ Cov (U,V) __ E(UV) - E(U) E(V) _ 
and Tuw = E(UW); ruw = E(VW) 
Since correlation coefficient is independent of change of origin and scalé: 

proving (*) is equivalent to proving C 

ruv + Pow + Fuw 2-3/2 ce) 


To establish (****) let us consider the E(U + V + W)2, which is aiwayS 
non-negative i.e., E(U+V+W)220, and use (**) and (***). 
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(b) X,Y,Z are : three reduced (standard) variates and E(YZ) = E(ZX) = — 1/2, 
find the limits between which the coefficient of correlation r(X, Y) is necessarily 


placed. 

Hint. Consider E(X +Y+Z)?20 => r2 -5 : 

(c) If 712, ro, and r3, are correlation coefficients of any three random variables 
X,, Xz and X3 taken in pairs (X,, X2), (X2, X3) and (X3, X;) respectively, show 
that 1 + 2ri2 1373, 2 12? + 7137 + To? 

19. (a) If the relation aX, + bX, + cX; = 0, holds for all sets of values of 
X,, Xz and X3, where X,, X_ and X, are three standardised variables, find the 
three total correlation coefficients r,5, 723 and r)3 in terms of a, b and c. What 
are the values of partial correlation coefficients if a,b and c are positive ? 

(b) Suppose X,, Xz and X3 satisfy the relation a,X, + a2X> + a3X3 =k. 

(i) Determine the three total correlation coefficients in terms of standard 
deviations and the constants a), a2 and a3. 

(ii) State what the partial correlation coefficients would be. 

20. (a) Show that the multiple correlation between Y.and X,, X2, ..., Xp is 
the maximum correlation between Y and any linear function of X;, Xo, ..., Xp. 

(b) Show that for p variates there are PC. correlation coefficients of order 
zero and:?-2C,,, PC, of order s. Show further that there are °C,. 2°-? correlation 
coefficients altogether and ?C2, 2°! regression coefficients. 


ADDITIONAL EXERCISES ON CHAPTER X 


1. Find the correlation coefficient between 


(i) aX + b and Y, (ii) ix + mY and X + Y, when correlation coefficient 
between X and Y is 0. 


2. If X, and X, are independent normal variates and U and V are defined by 
U =X, cosa+X,2sina, V=X,cosa-X, sina, 
Show that the correlation coefficient p between U and V is given by 
46,202 

~ 40,2 6,2 + (6,2 — 622) sin? 2a ’ 
where 6,” and 67? are variances of X, and X2 respectively. 

3. The variables X and Y are normally correlated, and €, n are defined by 

&=Xcos0+/Y sin 6, H = Y cos 6 -X sin 8 

Obtain 6 so that the distributions of & and 1 are independent. 


4. A set of n observations of simultaneous values of X and Y are made by 
an observer and the standard deviations and product moment coefficient about the 
Mean are found to be ox, Oy and pxy. A second observer repeating the same 
Observations made a constant error e in observing each X and a constant error E 
n observing each Y. The two sets of observations are combined into a single set 
and coefficient of correlation calculated from it. Show that its value is 


(oxy + beB) +f (6x? + be2)(0y? + 1E%) 


p7=1 
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Hint. here we have two sets of observations : 


Ist Set: (x, ¥;), i= 1,2, ...,2; Mean =x ,s.d.=0,. 
Product moment coefficient Pp, = 7, 0,9, 
and Set : (x + e, y, + E), i= 1, 2.505 


»_d = 
Mean (X’}.=47 LQ +e) =x +e 


Variance = 0,” = +5 [(x; + ¢)- (+e)? =+ 5 &j-FP =o! 


Mean (y’) =y+E, 0,” =o,?. 
Product moment coefficient : 


a | Sa ots i 2 
Pry = L(x: + €) - (% +e), + BE) - (9+ B)] = Py 
‘To obtain the correlation coefficient for the combined set of 2 observations 
use Formula (10-5); Example 10-11(a) page 10-15. 
5. Each of n independent trials can materialise in exactly one of the results 


A, Az, ..-, Ag. If the probability of A; is p; in every trial ( XY pi= 1) , find 
; é = 1 


the probability of obtaining the frequencies r,, rz, ..., 7, for A, Az, ... » Ab 
respectively in these trials. Also find E(r;), Var (7;) and show that the correlation 
coefficient between r; and r; is independent of n. 

6. In a sample of size n from a multiriomial population n,, no, ..., n, are of 
type 1,2, ...,& with Xp; = 1, where p; is the probability of type i @ = 1, 2, .. 
k). Show that the expected value of n2 when n; is given is (n — 1) po(1 — p,) and 
hence or otherwise show that the coefficient of correlation between n; and n; is 

1 


2 
[2 —] 
: F - pi) (1 - p)) 

7. A ball is drawn at random from an urn containing 3 white balls 
numbered 0, 1, 2 ; 2 red balls numbered ‘0, 1 and 1 black ball numbered 0. If thé 
colours white, red and black are again numbered 0. 1 and 2 respectively, show 
that the correlation coefficient between the variables : X, the colour number a! 
Y, the number of the ball is - 7 


8. If X, and X2 are two independent normal variates with a common mea! 
zero and variances 0,2 and >? respectively, show that the variates defined by 
02 0) 
U, =X, + == — 
U) 1 X2 and U2 g, 31+ @ 42 
are independent and that each is normally distributed with mean zero and 
common variance (6,2 + 62”). 
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9. If X,,-X2 and X, are uncorrelated variables with equal mean M and 
variances V,2, V2.2 and V,? respectively, prove that.correlation coefficient p 


X 
between Z,; = = and Z,= a is given by 
3 3 
a, a 
V[(Vi2 + V32)(V2? + V3?) 
Hint. Neglecting the cubes and higher powers of = , x; being the 


deviation of X; from M and letting the means and s.d.’s of Z, and Z, to be J,, I, 
and s;, $2 respectively, we get . 


X, 1 _ 
h=y ay eeu Ns +My 


_1 ( 211) ( x3i\-! 
-t V+ 1+ 4 


_1 [ ( Xai Xa Nai Xa ] 
-1> 1~ a+ M2° °°" + uM M2 + oes 
ve 
=1+ M2 
2 
Similarly J, =1+ s 
h =f 
1 Xx 
Now st = X, —1,? 
2 y2 2 2 
or 5,27+/,2 = 14 Ue OL and so we have 2= U4 UE 
2 2 
Similarly s,? = 5+ a 
2 
N =s 3 vias! OO o2 » Vs" ° ° 2 
Ow Nps)S2 Lz I ) (x I 2) M2 (On simplification) 
iH ‘ _Ne S152 _ V37 


10. (Weldon's Dice Problem). n white dice and m rea dice are shaken 
lOgether and thrown on a table. The sum of the dots on the upper faces are noted. 
€ red dice are.then picked up and thrown again among the white dice left on 
€ table. The sum of the dice on the upper faces is again noted. What is the 
“correlation coefficient between the first and the second sums ? 
Ans, n/(n + m) 
11. Random variables X and Y have zero means. and non-zero variances 6x? 
ido’ 1¢Z7=Y-X , then find o7 and the correlation coefficient p(X, Z) of 
. and Z in terms of Ox, oy and the correlation coefficient r(X, Y) of X and Y. 
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For certain data Y = 1-2X and X = 0-6Y, are the regression lines. Compute 
r(X, Y) and oy/oy. Also compute p(X, Z), if Z= Y —X. 
(Calcutta Univ. B.Sc: (Maths. Hons.), 1984) 
12. An item (say, a pen) from a production line can be acceptable, 
repairable or useless. Suppose a production is stable and let p,q,r(p +q+t+rsz 
1), denote the probabilities for three possible conditions of an em. If the items 
are put into lots of 100: 
(i) Derive an expression for the probability function of (X, Y) where X ang 
Y are the number of items in the lots that are respectively in the firg 
two conditions. 


(ii) Derive the moment generating function of X and Y. 
(iii) Find the marginal distribution X. 
(iv) Find the conditional distribution of Y given X = 90. 
(v) Obtain the regression function of Y on X, 
(Delhi Univ. M.A, (Eco.), 1985} 
13. If the regression of X, on Xo, ..., X, is given by : 
E(X, |X2, .... Xp) = a + B2X2 + By X35 +... + BX, 
Ox 523... Sry 
O32 933--. Oxy | jj = Variances 
and > 0, ; 
fo | ; * = saitliaage) 
Op. Sp3 ++» Spp 
then the constants a, Bo, ..., B, are given by 


« 


SL iiagueie OL ge, ia OL Rip 2 
GM te a Het Ra gz Hate te: % 
Rij 
and B; = a 1 (1, y -+> P) 
11 G; 
where R;; is the cofactor of p, in the determinant (R) of the correlation matrix 
Pi = P12 --- Pip 
P2 P22-:- Pr» 
R= 


Ppt Pp2 >>> Ppp 
(Delhi Univ. M.Sc. (Stat), 1988- 
14, Let X, and Xz be random variables with means 0 and variances 1! and 
correlation coefficient p. Show that: 


Elmax (X,2, X.2]<$ 1+ V1- 2 
Using the above inequality, show that for random variables X, and X, with 


means |, and [,, variances 6, and 0,2 and correlation coefficient.p and for any 
k>0, 
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P (|X, ~-p,l2ko, or 1X, -p,12ko,] < al + Vi - p?,| 


15. Let the maximum correlation between Xp and any linear function of 
X,,X%2 coe Xn be R and if To) =7q2= --- =lo, =F 
and all other correlation coefficients are equal to s, then show that : 


R ! n 1/2 
11+ (r- vi 


16. If f = f(x, y) is the p.d.f. of BVN (0, 0, 1, 1, p) distribution, verify 
rere of of 
op oxdy 
Further, if two new random variablés U and V are defined by the relation 
U=P(Z<x) and V=P(Z< y) where Z ~ N(O, 1), 
ak the marginal distributions of both U/ and V are uniform in the interval 


1 
(-;3 3° 5) and their common variance is — S 35° 


Hence prove that R = Corr. (U, V), satisfies the relation: » = 2 sin (1R/6). 
[Delhi Univ. B.A. (Stat. Hons. Spl. Course), 1988) |: 
17. If (X, Y) ~ BVN (i, My, 9,7, 0,7, p), then prove that a + bX + cY, 
(b #0, c #0) is distributed as N(a + byt, + cHl,, b’0,? + c’a,? + 2bcpa,9,). 
{ (Delhi Univ. M.Sc. (Stat), 1989) 
18. Let X,,X2,X3 be a random sample of size n = 3 from NG: 1) 
distribution. 
(a) Show that Y, = X; + 5X3, Y. = X2+ 5X; has a bivariate nornial 
distribution. 
(b) Find the value of 5 so that p(¥, Yo) = 5. 


(c) What additional transformation involving Y, and Y, would produce a 
bivariate normal distribution with means 1, and [l2, variances 6,7 and 02”, and 
the same correlation coefficient p ? 

Ans. (6)-lorl. (c)Z,= O17; + $1, Zo = G22 + M2. 

19. If (X, Y) ~ BVN (0, 0, 1, 1, p), prove that : 

E[max (X, Y)] = [(1 - pyr" and E [min (X, Y)] = -— [(1 -p)/n]!” 

20. If (X, Y) ~ BVN (0, 0, 6,2, 62”, p), show that rth cumulant of XY is 
given by : 

K,=$(r-1)! 0,7 027 [(p + 1)" + (p - 1)']. 
Deduce that E(X? Y) = o,2.0,2 (1 + 2p”). 


21. Let f and g be the p.d.f.’s of X and Y with corresponding distribution 
functions F and G. Also let 


A(x, y) = flx) g(y) [1 + & fx) - 1) 2G) - 1]; tats 1, 
Show that h(x, y) is a joint p.d.f. with marginal p.d.f.’s fand g. Further, 
let fand g be N(O, 1) p.d.f.’s. Show that Z =X + Y, is not normally 
distributed, except in the trivial case a = 0. 
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Hint. ‘Find M(t) = E(e'2) and use Cov (X, Y) = a/n. 


22. State p.d.f. of bivariate normal distribution. Let X and Y nave. joint 
p.d.f. of the form : 


~5 [a(x — by)? + 2ayo(% — b1) (y - ba) + @22(y ~ 52)7]; 


LE IVS —o <x, y)< 


Find (i) k, (ii) the correlation coefficient between X and FY. 

23. Write down, but do not: derive, the moment generating function for a 
pair of random variables which have a bivariate normal distribution with both 
means equal to zero. 

The independent random variables X, Y, Z, are each normally distributed 
with mean O and variance 1. [f U = X+Y¥+ZandV=X-— Y + 2Z, show that 
U and V have bivariate normal distribution. Find the correlation of U with 
V and the expectation of U when V is equal to 1. 

24. Let X, and X, have a joint m.g.f. 


M(t, t,) = [ale’! t+ 2-41) 4 belt +e)’ 
in which a and b are positive constarits such that 2a + 2b = 1. 
Find EX;), E(X)), Var. (X;), Var (X), Cov (Xi, Xp). 


Ans. Means = 1, Variances = 3; ; Covariance = 2a-3. 


25. X;,.X2,X3 have joint distribution as a multinomial distribution with 
parameters N, p;, p2, p3. If r;; is the Correlation coefficient between X; and X;, 
find the expression for 712, 723 and 73; and hence deduce the expression for the 
partial correlation coefficient r,.23. 

26. (i) If all the infer-correlations between (p + 1) variates Xo, X1, X2..., 
X, are equal to 7, show that each of the partial correlation co-efficients of order 
p — 1 is equal to r/[1 + (p — 1)r) and that the multiple correlation of Xp on X,, 
X2, soey Ap is given by 

(1 —r)(1 — pr) 
1 - Rloa2..2)= Vy (p = 1)r 

(i) Ne = (r123 - 71327201 — F132) - Pigg, s)M2] 

- 27. IfR denotes the multiple correlation co-efficient of X, on X>, X3, .... 
X, in p-variate distribution, prove that 

(i) R? = Ro*, where Ro is the correlation of X, with any arbitrary linear 
function of X>, X3,... Xp. 

(ii) R? = R;?, where R, is the multiple correlation coefficient of X, with 
X2,X3, ...,Xz,k <p 


(ii) 1-R2= Il (1 = Py, 19) 


jw 


Theory of Attributes 


_ 11. Introduction. Literally, an attribute means a quality or 
characteristic. Theory of attributes deals with qualitative characteristics which are 
not. amenable to quantitative measurements and hence need slightly different 
statistical treatment from that of the variables. Examples of attributes are 
drinking, smoking, blindness, health, honesty, etc. An attribute may be marked 
by its presence (possession) or absence (dispossession) in a member of given 
population. It may be pointed out that the method of statistical analysis 
applicable to the study of variables can also be used to a great extent in the 
theory of attributes and vice-versa. For example, the presence or absence of an 
attribute may be regarded as changes in the values of a variable which can 
possess only two values viz. 0 and 1. For the sake of clarity and simplicity, the 
theory of attributes has been developed independently. 


11-2. Notations. Suppose the population is divided into two classes, 
according to the presence or absence of a single attribute. The positive class, 
which denotes the presence of the attribute is generally written in capital Roman 
letters such as A, B, C, D etc. and the negative class, denoting the absence of 
the attribute is written in corresponding smali Greek letters such as a, B, y, 5, 
etc. For example if A represents the attribute sickness and B represents 
blindness, than « and B represent the attributes non-sickness (health) and’ sight 
respectively. The two classes .viz., A (possession of the attribute) and a 
(dispossession of the attribute) are said to be complementary classes and the 
attribute « used in the sense of not-A_ is often called the complementary 
attribute of A. Similarly B, y, 5 are the complementary attributes of B, C, D 
respectively. 

The combinations of attributes are denoted by grouping together the letters 
concerned ¢.g. AB is the combination of-the attributes A and B: Thus for the 
attributes A (sickness) and B (smoking), AB would-mean the simultaneous 
possessioh of sickness and smoking. Similarly AB will represent sickness and 
non-smoking, @B non-sickness (health) and smoking, and o6 non-sickness and 
ngn-smoking. 

if a third attribute be included to represent, say male, then ABC will stand 
for sick males who are smokers, Similar interpretations can be given to ABY, 
ABC, ABy etc. 

11-3. Dichotomy. If the universe (population) is divided into two sub- 
classes or complementary classes and no more, with respect to each of the 
attributes A, B, C etc., the division or classification is said to be ‘dichotomous 
classification” . The classification is termed manifold if each class is further sub- 
divided. 

11-4. Classes and Class Frequencies. Different attributes in 
themselves are called different classes and ‘the number of observations assigned to 
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them are called class frequencies which are denoted by bracketing the class- 
symbols. Thus (A) stands for the frequency of A and (AB) for the number of 
objects possessing the attribute AB. 

Remark. Class frequencies of the type (A), (AB), (ABC) etc. are known as 
positive frequencies; (a), (aB), (aBy) etc. are known as negative frequencies 
and (0B), (ABC), (aBC) etc. are called the contrary frequencies. 

11-4-1. Order of Classes and Class frequencies. A class 
represented by n attributes is called a class of nth order and the corresponding 
frequency as the frequency of the nth order. Thus (A) is a class frequency of order 
1; (AB), (AC), (By) etc. are class frequencies of second order: (ABC), (ABy) 
(aBC) etc. are frequencies of third order and so on. N, the total number of 
members of the population, without any specification of attributes, is reckoned 
as a frequency of zero-order. 

-Thus in a dichotomous classification with respect to n attributes, the 


e | ad n e ° an 
number of class frequencies of order ‘r’ is ( ). 2’, since r attributes out of n 
r 


; n ‘ , 
can be selected in : ways and each of the r attributes contributes two 


symbols, one representing the positive part (e.g., A) and the other the negative’ 
part (e.g., a). Thus the total number of class frequencies of all orders, for n 
attributes is : 


Py rare(4 J2+(Q Bret (" a= +2y=3 


ra OQ 
(11-1) 
Remarks 1. In particular, for.n attributes, the total number of class 
frequencies of different orders are given as follows : 


Order 0 1 2 be, r oP n 


No. of frequencies 1 2n € \z ees (", )z Se. 25 


2. Since in the case of n attributes, the positive class frequency of order r 
has ( elements, their. total number is : 


DX (7 )(5 (4 ee (Sears 


yr =Q 
3. In case of 3 attributes A, B ard C, the total number of class frequencies 
is 33 = 27, as given below : 
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Order Frequencies 
0 N 
, {@ ® (C) 
(a) (@) (y) 


(AB) (AB) (0B) (of) 
2 (AC) (Ay) (ac) (ay) 
(BC) (BY) (BC) (By) 


5 ee (ABy) (ABC) — (By) 
(QBC) (oBy) (082 (ay) 


...(11-2) 

11-4-2. Relation Between Class Frequencies. All the class 

frequencies Of various orders are not independent of each other and any class 

frequency can always be expressed in terms of class frequencies of him. -der. 
Thus 


N = (A) + (a) = (B) + (B) = (C) + (9), ete. 
Also, since each of these A’s or a@’s can either be B’s or B’s, we have 
(A) =(AB)+(4B) and (a) = (0B) + (af) 
Similarly (8B) =(AB)+(@B) and (8) =(AB)+ (a8) 
(AB) = (ABC) + (ABy), (AB) = (ABC) + (py) 
(2B) = (ABC) + (By), (aB) = (aBC) + (aBy) 
and so on. Thus 
(A) = (AB) + (AB) = (ABC) + (ABy) + (ABC) + (ABy) 
(B) = (AB) + (@B) = (ABC) + (ABy) + (OBC) + (By), etc. 

The classes of highest order are called the ultimate classes and their 
frequencies, the ultimate class frequencies. Thus in case of n attributes, the 
ultimate class frequencies will be the frequencies of nth order. For example, the 
class frequencies (ABC), (ABy), (ABC), (ABy), Speed (aBy), (aBC), (aBy) 
are the ultimate frequencies for three attributes A, B and C 

Remarks 1. In case of n attributes, the ultimate — frequencies each 
contain h Symbols and since each symbol may be written in two ways, viz., 
positive part and negative part, ¢.g.,A or a, B or B, etc., the total number of 
ultimate class frequencies is 2”. 

2. By expressing any class-frequency in terms of the class frequency of 
higher order, we can express it ultimately as the sum of some of the 2” ultimate 
-Class frequencies. 

3. The total number of ultimate class frequencies specify the data 
completely. 

4. The set of ultimate class frequencies is not the only set which specify 
the data completely. In fact any set of class frequencies which are (i) 2” in 
number and (ii) which are algebraically independent of each other, will specify 
the data completely. Such a set is called the Fundamental Set. For example, the 
positive class frequencies form such a set. Thus for n = 2, the set of positive 
class frequencies 2? = 4 (c.f. 11-2), is N, (A), (8), (AB). If we are given these 
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lrequencies, then it is obvious from the table that the remaining {requencics, 
viz., (AB), (a@B) (&B),() and (B) can be obtained by subtraction, e.g., given : 


A a Total 


(a) = N—(A), (B) = N-(B) 
(AB) = (A) — (AB), (&B) = (B) - (AB) 
(4B) = () — (AB) = N — (A) - (B) + (AB) 
11.5. Class Symbols as Operators. Let us write symbolically: 
A.N = (A) b .. (113) 


which means that the operation of dichotomising N according to A gives the 
class frequency equal to (A). Similarly, we write 


a.N = (a) 
Adding, we get 
A.N + a@.N = (A) + (Q) 


= (A +a). N = (A) + (Q) 
= (A+a).N=N 
= A+a=} 


Thus in symbolic expression we can replace A by (1 — a) and @ by (1 —A). 
Similarly, B can be replaced by (1 — B) and B by (1 — B), and so on, 


Dichotomising (B) according to A, Ict us write 
A. (B) = (AB) 
Similarly, B. (A) = (BA) 
A. (B) = B. (A) = (AB) = AB.N, 
which amounts to dichotomising N according to AB. 
For example : 
(aB) =aB.N =(1 -A)(1-B). N=N-A.N-B.N+AB.N 
= N-[ (A) + (B)] + (AB) 

(aBy) =aBy.N=(1-A)(1-B)(1-C).N 
=N-A.N-B.N-C.N+AB.N + ACN + BCN-ABC.N 
= N-[(A) + (B) + (C)] + [(AB) + (AC) + (BC)] - (ABC) 

(ABy) =ABy. N=AB(1 ~C).N =AB. N-~ABC.N 
= (AB) - (ABC) 

(ABC) =(1-A) (1 -B)C.N=(C-AC- BC+ ABC).N 
= (C) — (AC) - (BC) + (ABC) 


and so on. 
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Example 11-1. An investigation of 23,713 households was made in an 
‘ urban and rural mixed locality. Of these 1,618 were farmers, 2,015 well-to-do 
and 770 families weré having at least one graduate. Of these graduate families 
335 were those of farmers and 428 were well-to-do, also 587 well-to-do families 
were those of farmers and out of them only 156 were having at least one of their 
family member as graduate. Obtain all the ultimate Class frequencies. 


Solution. Let the attribute ‘farming’ be denoted by A, the attribute ‘well- 
to-do’ by B and ‘having at least one graduate’ by C. Then in the usual notations, 
we are given 


N = 23713, (A) = 1618, (8) = 2015, (C)=770, (AB) = 587, 
(BC) = 428, (AC) =335 and (ABC) = 156. 
For three attributes A, B, C.,the number of ultimate class frequencies is 


23 = 8, one of them being (ABC) = 156. The remaining frequencies are obtained 
below : 


(ABy) = (AB) — (ABC) = 587 - 156 = 431 
(ABC). = (AC) ~ (ABC) = 335 ~156 = 179 
(ABy) = (A) — (AB) - (AC) + BC) 
= 1618 ~ 587 — 335 + 156 = 852 
(aBC) = (BC) - (ABC) = 428 — 156 = 272 
(aBy) = (B) - (AB) - (BC) + (ABC) 
= 2015 — 587 — 428 + 156 = 1156 
(ABC) =(C) —(AC) - (BC) +{ABC) = 770 - 335 — 428 + 156 = 163 
(aBy) =N —(A)- (6) -(C) + GB) + (AC) + BC) - (ABC) 
= 23713 — 1618 — 2015 — 770 + 587 + 335 + 428 ~ 156 = 20504 


Example 11-2. (a) Given the following ultimate class frequencies, find 
the frequencies of positive class, 


(ABC) =149, (ABy) = 738, (ABC) = 225, (AB y) = 1,196 
(aBC)=204, (aBy)=1,762, (aBC)=171 ad (aBy) =21,842 
(b) Find the remaining class frequencies, given the following data : 
N = 23,713, (A) = 1618, (B) = 2015, (C) = 770 
(AB) = 587, (AC) = 428, (BC) =335, (ABC) = 156 
Solution. (a) (A) = (ABC) + (ABy) + (ABC) + (ABy) = 2,308 
(B) = (ABC) + (ABy) + (ABC) + (aBy) = 2,853 
(C) = (ABC) + (ABC) + (BC) + (BC) = 749 
(AB) = (ABC) + (ABY) = 887 
(AC) = (ABC) + (ABC) = 374 
(BC) = (ABC) + (ABC) = 353 
and N =[(ABC) + (ABy) + (ABC) + (ABy) + (aBC) + (087) 
+(aBC) + (aBy)] = 26,287 
(b) For three attributes, there are 3° = 27, class frequencies in all. Thus we 
have to determine the remaining 19 class frequencies : 
Order 1 : 
(a) =N -(A) = 22,095 ; (B) = N — (B) = 21,698 
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(y) =N -(C) = 22,943 


Order 2: Order 3: 
(AB) = (A) - (AB) = 1,031 (ABy) = (AB) — (ABC) = 431 
(aB) = (B)- (AB) = 1,428 (ABC) = (AC) - (ABC) = 272 
(oB) = (a) - (0B) = 20,667 (ABy) = (AB) - (ABC) = 759 
(Ay) = (A) -(AC) = 1,190 (ABC) = (BC) - (ABC) = 179 


(aC) =(C)-(AC)=342 * (aBY) = (0B) — (ABC) = 1249 
(ay) =(a)-(0C) = 21,753 (@BC) = (BC) - (ABO) = 163 
(By) =(8)-(BC) = 1,680 (aBy) = (a) - (ABC) = 20,504 
(BC) =(C)- (BC) = 435 
(By) =(B)- (BC) = 21,263 
Example 11:3. Show that for n attributes A;, A, A3, .... An 
(A; Az A3... A,) 2 (A;) + (Az) + (A3) +... + (A,) -(2 - 1) N ..(11- 4) 
where N is the total number of observations. 
Solution. We have 
(0 O12) = 102. N = (1 — A;)(1 - Az). N = N = (Aj) - (A2) + (A A2) 
Since class frequency is always non-negative, we have 
(Q;Q) 20 = (A;A2) 2 (A) + (Az) —-N os(®) 
It follows that (11-4) is true for 2 attributes. 
Let us now suppose that (11-4) is true for 7 attributes A;, Az, ..., A, SO 
that 
(A; Az A3...A,) 2 (Ai) + (Az) + (Aa) +... + (A, -—(7 - IN 
Replacing the attribute A, by another compound attribute A,A,,,, we get 
(A; AzA3... ApApst) 2 (Ai) + (An) + (As) +...+ A-Anr) -— (7 - DN 
2 (A;) + (Aa) + (A3) +...+ ((A,) + Ani) — NJ -(r- 1)N 
(From (*)} 
, = (A)) + (Aa) +...+ (A,) + (Apa) — FN 
This implies that if (11-4) is true for n =7, it is also true form =r+ 1 
attributes. But we have seen in (*) that (11-4) is true for n = 2. Hence by 
mathematical induction, the result is true for all positive integral values of 7. 
Example 11-4. Show that if A occurs in a larger proportion of the cases 


where B is than where B is not, then B will occur in a larger proportion of cases 
where A is than where A is not. 


Solution. The problems can be restated as follows : 
Given (AB) prove that 5) 


> (AB) > 
) 7 @)’ (4)? (@ 
(AB) _ (AB) (>) _ (AB) 
ow () 7 @) ~ (©)? (Bs) 
=> 1+ (B) >1+ a 


(B) (AB): 
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N.S MAD 
aad (B) 7 (AB). 
N. , (B) 
a (A) 7 (AB) 
= (A) + (a) , (AB) + (aB) 
(A) (4B) 
(2) |, , (@B) 
=> V+ (A) 71+ (4B) 
(a) , (0B) 
= (A) 7 (AR) 
(AB)  (aB) : 
= (A) > (a) , as required. 


EXERCISE 11 (a) 


1. (a) Explain the following : (i) Order of a class, (ii) Ultimate classes and, 
(iit) Fundamental set of class frequencies, 

(6) What is meant by a class-frequency of (i) first order, (ii) third order ? 
How would you express a Class frequency of first order in terms of class 
frequencies of third order ? 

2. What is dichotomy ? Show that the continued dichotomy according to n 
attributes gives rise to 3" classes. 

3. (a) Given that (AB) = 150, (AB) = 230, (aB) = 260, (a8) = 2,340; find 
the other frequencies and the value of N. 

(5) Given the following frequencies of tie positive classes, find the 
frequencies of the rest of the classes : 

(A) = 977, (AB) = 453, (ABC) = 127, (B) = 1,185, (AC) = 284, 
N = 12,000, (C) = 596, and (BC) = 250. 

Ans, (AB) = 524, (aB) = 732, (a8) = 10,291, (By) = 935, (BC) = 346, 
(By) = 10,469, (Ay) =693, (aC) = 312, (ABy) = 326, (aBC) = 123, 

‘(@By) = 609, (ABC) = 157, (ABy) = 367, (ABC) = 189, (ay) = 10,192. 

4, Given the following data, find frequencies of (i) the remaining positive 
classes; and (ii) the ultimate classes ; 

N = 1,800, (A) = 850, (8) = 780, (C) = 326, (ABy) = 200, (ABC) = 94, 
(ABC) = 72, and (ABC) = 50. 

5. (a) Measurements are made on a thousand husbands and a thousand 
wives. If the measurements of the husbands exceed the measurements of the 
wives in 800 cases for one measurement, in 700 cases for another and in 660 
cases for both measurements, in how many cases will both measurements on the 
wife-exceed the measurements on the husband ? 

Ans. 160 

(b) An unofficial political study was made about the recent changes in 
Indian political scene and it was found that 919 Indira Gandhi Congress 
Supporters and 1,268 Organisation Congress supporters wanted socialistic 
economy, whereas 310 Indira Gandhi Congress supporters and 503 supporters of 
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the Organisation Congress wanted capitalistic economy in the country. Find out 
the total number of Indira Gandhi’s and that of the Organisation’s Supporters, 
giving the number of capitalistic economy’s and of the socialistic economy’s 
votaries, out of the individuals, who were surveyed. 

6. At a competitive examination at which 600 graduates appeared, boys 
Outnumbered girls by 96. Those qualifying for interview exceeded in number 
those failing to qualify by 310. The number of Science graduate boys 
interviewed was 300 while among the Arts graduate girls there were 25 who 
failed to qualify for intervew. Altogether there were only 135 Arts graduates and 
33 among them failed to qualify. Boys who failed to qualify numbered 18. 

Find (i) the number of boys who qualified for interview, 

(ii) the total number of Science graduate boys appearing, and 
(iii) the number of Science graduate girls who qualified. 

Ans, (7) 330, (2) 310, and (ii) 53. 

7. 100 children took three examinations A, B and C; 40 passed the first, 39- 
passed the second and 48 passed the third, 10 passed all the.three, 21 failed all 
three, 9 passed the first two and failed the third, 19 failed the first two and passed 
the third. Find how many children passed at least two examinations. Show that 
for the question asked certain of the given frequencies are not necessary. Which 
are they ? 

Ans. 38. Only frequencies required are (C),.(aBC), (ABy). 

8. In a university examination, which was indeed very tough, 50% at least 
failed in “Statistics”, 75% at least in Topology, 82% at least in “Functional 
Analysis” and 96% at least in “Applied Mathematics”. How many at least fatled 
in all the four ? (Ans. 3%) 

Hint. Use the result in Example 11-3. Page 11-6. 

9. If a collection contains N items, each of which is characterized by one 
or more of the attributes A, B, C and D, show that with the usual notations 

(i) (ABCD) 2 (A) + (8) + (C) + D) -3N, and 

(ii) (ABCD) = (ABD) + (ACD) - (AD) + (ADBy),- 
where § and y represent the characteristics of the absence of B and C 
respectively. 

10. Given (A) = (a) = (8) = (B) = +N; show that (AB) = (a8), (AB) = (aB). 

11. Given that (A) = (a) = (B) = (8) =(C)=(=5N 
and also that (ABC) = (ay), show that 2(ABC) = (AB) + (AC) + (BC) - $N. 

11-6, Consistency of Data. Any class frequencies which have been 
or might have been observed within one and the same population are said'to be 
consistent if they conform with one another and do not in any way conflict. For 
example; the figures (A) = 20, (AB) = 25 are inconsistent as (AB) cannot be 
greater than (A), if they are observed from the same population. 

‘Consistency’ of a set of class frequencies may be defined as the property 


that none of them is negative, otherwise, the data for class frequencies are said to 
be ‘inconsistent’. 
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Since any class frequency can be expressed as the sum of some of the 
ultimate class frequencies, it is necessarily non-negative if all the ultimate class 
frequencies are non-negative. This provides a criterion for testing the consistency 
of the data. In fact, we have the following theorem. 

Theorem 11:1. “The necessary and sufficient condition for the 
consistency of a set of independent class frequencies is that no ultimate class 
frequency is negative.” 

Remark. We can test the consistency of a set of 2" algebraically 
independent class frequencies by calculating the ultimate class frequencies. If any 
one of them is negative, the given data are inconsistent. 

11-6-1. Conditions for consistency of Data. Criteria for 
consistency of class frequencies are obtained by using theorem 11-1. For a single 
attribute A we have conditions of consistency as follows : 


@ (A)20 


ji) (a) 20 => (A)SN) (11-5) 
For two attributes A and B, the conditions of consistency are : 
@) (AB) 20 
(i) (AB) 2 0 = (AB) <(A) 
(i) (8B) = 0 = (AB)<@) 
(iv) (a8) = 0 = (AB)2(A) + ()-N) ...(11-6) 
Conditions of consistency for three attributes A, B and C are 
(i) (ABC) 2 0 


(z) (ABy) 2 0 = (ABC)S (AB) 

(iii) (ABC) = 0 = (ABC) < (AC) 

(iv) (aBC) > 0 = (ABC) S(BC) 

(vy) (AByy = 0 = (ABC)2 (AB) + (AC) -(A) 

(vi) (aBy) 2 0 = (ABC)2 (AB) + (BC) -@) 

(vii) (OBC) = 0 = (ABC)2 (AC) + BC) -(C) 
(viii) (aBy) 2 0 = (ABC)< (AB) +(BC) + (AC)-(A)-@6)-(+N 


(11-7) 
() and (viii) in (11-7) give : 
(AB) + (BC) + (AC) 2 (A) + (8B) + C) - WW) 
Similarly 
Gi) ad (iit) => (AC) +(C)-(B)s(C) »oe(11-8) 


(i) and (vi) =>. (AB) + (BC)- (AC) Ss @) 
(iv) ad (v) = (AB)+(AC)-(BC)S(A) 


Remark. As already pointed out (cf. Remarks (3) and (4),.§ 11-4-2)], 2" 
algebraically independent class frequencies are necessary to specify the data 
completely, one such set being the set of ultimate class frequencies and the other 
being the set of positive class frequencies. If the data supplied are incomplete so 
that it is not possible to determine all the ciass frequencies, then the conditions 
(11-5), (11-6) and (11-8) for one, two and three attributes respectively, enable us 
to assign the limits within which an unknown class frequency can lie. 
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Example 11-5. Examine the consistency of the following data : 

N = 1,000, (A) = 600, (B) = 500, (AB) = 50, the symbols ‘having their 
usual meaning. 

Solution. We have 

(aB) = N ~ (A) —(@) + (AB) = 1000 — 600 — 500 + 50 = -50. 
Since (af) < 0, the data are inconsistent. 

Example 11:6. Among the adult population of a certain town 50 per 
cent are males, 60 per cent are wage earners and 50 per cent are 45 years of age 
or over, 10 per cent of the males are not wage-earners and 40 per cent of the 
males are under 45. Make the best possible inference about the limits within 
which the percentage of persons (male or female) of 45 years or over are wage- 
earners. 

Solution. Let N = 100. Then denoting males by A, wage-earners by B and 
45 years of age or over by C, we are given : 

N = ms (A) = 50, (B) = 60, ene 


(AB) = x 50 =5, (Ay) =x 50 = 20 


(AB) = )- (AB) = 45, (AC) = (A) - (Ay) = 30 
We are required to find the limits for (BC). 
Conditions of consistency (11-8) give 


(i) (AB) + (BC) + (AC) 2 (A) + (B) +(C)-N 

= (BC) > 50 + 60 + 50 - 100 - 45 - 30 = -15 
(ii) (AB) + (AC) ~ (BC) S A 

=> (BC) > (AB) + (AC) -(A) =45 + 30 - 50 = 25 
(iii) (AB) + (BC) ~- (AC) <s-@) 

=> (BC) < (B) *(AC) ~ (AB) = 60 + 30 —45 = 45 


(iv) (AC) + (BC) ~ (AB) s (CC) 
(BC) < (C) + (AB) = (AC) = 50 + 45 - 30 = 65 
(iy to (iv) => 25 < (BC) <45 
Hence the percentage of wage-earning population of 45 years or over must 
Lie between 25 and 45. 
Example 11-7. /n a series of houses actually invaded by smallpox, 70% 
of the inhabitants are attacked and 85% have been vaccinated. What is the lowest 
percentage of the vaccinated that must have been attacked ? 
Solution. Let A and B denote the attributes of the inhabitants being 
attacked and vaccinated respectively. Then we are given : 
N= 100, (A) =70 and (8) =85 
Consistency condition gives: 
(AB) 2 (A) + (8) -N = (AB) 255 
Hence the lowest percentage of inhabitants vaccinated, who have been 
attacked is = 


(AB). 1) _ 55 
(py * 100 = g5x 100 = 64-7% 
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Example 11:8. Show that if 


(AB) _ (BC) _ (CA) _ 


then the value of neither x nor y can exceed I 14. 
Solution. Conditions of consistency give : 


(AB) < (A) => Ny S<Nx = ySx A ()) 
Also (BC) 2 (B)+(C)-N 

(BQ) , (8), © 
=> N 2 Nt N ~] 
> y 2 2x+3x-1 ve 
= 5x-1 <y .- (it) 
(i) and (i) give 

Sx-1 $x => 4xS1 > xs} ...(iii) 


Thus from () and (iii) we have y <x < - which establishes the result. 


Example 11:9. Show that (i) If all A's are B’s and all B’s are C's then 
all A’s are C'S, (ti) If all A’s are Bs and no B’s are C’s then no A’s are C’s. 


Solution. (i) AIlA’s are B’s = (AB) =(A) } (4) 
and all B’sareC’s => (BC)=(B) eng 

To.prove a (AC) = (A) 

Wehave (AB) +(BC)-(AC) <(B) 

> -(A) + (B)- (AC) <(@)’ (Using (*)] 

=> (A):s (AC) = (AC) 2 (A) 


But since (AC) + (A), we have (AC) = (A), as desired. 
(ii) We are given (AB) = (A) and (BC) = 0 and we want to prove (ACy= 0. 
‘We have’ 

(AB) + (AC) - (BC) < (A) - 
= (A) + (AC) -0 < (A) 
=> (AC) <, 0: Pa 
And since (AC) 2 0, we must have (AC) = 0. 


te EXERCISE 11(b) 

1. What do you understand by consisténcy of given data ? How do you 
check it ? 

2. (a) If a report gives the following frequencies as actually observed, show 
that there must be a misprint or mistake of some sort, and that possibly the 
misprint consists in the dropping of 1 before 85 given as the frequency (BC) : 

N = 1000, (A) =510, (B) =490, (C) =427, (AB) = 189, (AC) = 140, (BC) =85. 
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(b) A student reported the results of a survey in the following manner, in 
terms ei the usual notations : 
= 1000, (A) = oe (B) = 312, (C) = 470, (AB) = 42, (BC) = 86, 
Pe = 147, and (ABC) = 
Examine the 0 of the above data. 
(c) Examine the consistency and adequacy of the following data to determine. 
the frequencies of the remaining positive and ultimate classes. . 
N = 10,000, (A) = 1087, (B) = 286, (C) = 877, 
(CAB) = 281, (CaB) = 86, a 78, (ABC) = 57 


3. Given that (A) = (B) = (C) =5 iN and‘ 80 per cent of.A’s are B’s, 75 per 


cent of A’s are C’s, find the limits to the percentage of B’ s that are C’s. 
Ans. 55% and 95%. 
4. If (A) = 50, (B) = 60, (C) = 50, (AB) = S, (AY) = 20, N = 100, find the 
greatest and the least possible values of (BC) so that the data may be consistent. 
Ans. 25 < (BC) < 45 


. If 1,000 = N = 12(A) = 2(B) = 23 (C) = 5 (AB), and (AC) = (BC), what 


should be the minimum value of (BC) ? : 

Ans. 150 . ae 

6. Given that (A) = (B) = (C) =3N = 50 and (AB) = 30, (AC) = 25, find 
the limits within which (BC) will lie. 

7. In a university examination 65% of the candidate passed in English, 
90% passed in the second language and 60% passed in the optional subjects. 
Find how many at least should have passed the whole examination. 

Ans. 15%. Hint. Use Example 11.3. 

8. A market investigator returns the following’ data. Of 1,000 people 
consulted 811 liked chocolates, 752:liked toffees and 418 liked boiled sweets, 
570 liked both chocolates and toffees, 356 liked chocolates and boiled sweets and 
348 liked toffees and boiled sweets, 297 liked all three. Show that this 
information as it stands must be incorrect. 

9. (a) In a school, 50 per cent of the students are boys, 60 per cent are 
Hindus and 50 per cent are 10 years of age or over. Twenty per cent of the boys 
are not ‘Hindus and 40 per cent of thie boys are under 10. What conclusions can 
you draw in regard to percentage of Hindu students of 10 years or over ? 

(6) In a college. 5O per cent of the students are boys, 60 per cent of the 
student are above 18 years and 80‘per cent receive scholarships. 35 per cent of 
the students are bcys above 18 years of age, 45, per cent are’ boys receiving 
scholarships, and 42 per cent are above 18 years and receive scholarships. | 
Determine the limits to the proportion of boys above 18 years who are in receipt 
of scholarships. 

Ans. Between 30 and 32. ; . 

10. The following Summary appears in’a,feport on a survey covering 1,000 
fields. Scrutinise the numbers and point out if there is any mistake or misprint 
in them. 
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Manured fields 510 
Irrigated fields 490 
Fields growing improved varieties 427 
Fields both irrigated and manured 189 
Fields both manured:and growing improved varieties 140 
Fields both irrigated and growing improved varieties 85 
Hint. Let A: manured fields, 
B : Irrigated fields 
and C : Growing improved varieties; then (aBy) < 0. 


11. A social survey in a village revealed that there were more uneducated 
employed males than educated ones; there were more educated employed males 
than uneducated unemployed males. There were more educated unemployed under 
35 years of age than employed uneducated males over 35 years of age. Show that 
there are more uneducated employed males under’35 ware of age than educated 
unemployed males over 35 years of age. 

12. In a war between White and Red forces, there are more Red soldiers than 
White, there are more armed Whites than unarmed Reds, there are fewer armed 
Reds with ammunition than unarmed Whites without ammunition. Show that 
there are more armed Reds without ammunition than unarmed Whites with 
ammunition. 


13. Given that (A) = (B} = (C) = Nee = GO) - p, find what must be 


the greatest and least values of p in sais Pe we may infer that (BC)/N, exceeds 
any given value, say gq. 


Ans. ;(1 —2q)<p<4(1 +24). 


11-7. Independence of Attributes. Two attributes A and B are said 
to be independent if there exists no relationship of any kind between them. If A 
and B are independent, we would expect (i) the same proportion of A’s amongst 
B’s as amongst B’s, (ii) the Proportion of B’s amongst A’s is same as that 
amongst the o's. For example, if insanity and deafness are independent, the 
proportion of the insane people among deafs and non-deafs must be Same. 
11-7-1. Criterion of Independence. If A and B are independent, then 
(#) in § 11-7 gives . 


a oO .. (11-9) 
= 1 - =1- UM 
=> os ‘= sy | ...(11-9a) 
Similarly, (¢i) in § 11-7 gives 
| 4 7 oO 11-10) 
=e , 45) _,_@B) 


(4) = @) 
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(AB) _ (of) : 
=> (A) = (a) ..-(11-10a) 


In fact (11-9) = (11-10) and vice-versa. 
For example, (11-9) gives 
(AB) _ (AB) _ (AB) + (ABD _ A) 
6) @) (&)+(®) ~ 
(AB) _ (B)_ (B)- (AB) _ (os), 
7 (A) “N~ N-(A) ~ @)” is 
which is (11°10). Similarly, starting from (11-10), we would arrive at (11-9). 


It becomes easier to grasp the nature of the-above relations if the frequencies 
are supposed to be grouped. into a table-with two rows and two columns as 


follows, : 
oc 
oc 


Second criterion of independence = be obtained in terns “Of thé class 
frequencies of first order. (11-10) gives 


(apy = Ad) ; (11-11) 
as 4B) _A@ | ® .(11-1a) 


which leads to the following important fundamental rule : 

“If the attributes A and.B are independent, the proportion of AB’s in the 
population is equal to the product of the proportions of A’s and B's in the 
population.” 

We may obtain a third criterion of independence in terms of second order 
class frequencies, as follows. 


(AB). (a8) =AY@) (0 (8) _ (A) B) (ay B) (Using 11-11) 
(AB) . (aB) = (AB). - 
os (AB) 


=m Z vo(11-12) 
(af) 
Aliter. (11-12) may also be ia from(11 ‘9) and (11-9a) as explained 
velow: 


inate AB) (BD (ah 
(11-9) and (11-94) => (AB) .®) (af) 
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(AB)(aB) = (AB) . (a8) 
Se: (11-10) and (11-10a) give the same result. 
11-7-2. Symbols (AB) and 8. Let us write 


(AB)o =“) ...(11-13) 


which is the value of (AB) under the hypothesis that the attributes A and B are 
independent. 

Let 5 = (AB) -—(AB)po .» (11-14) 
denote the excess of (AB) over (AB). Then 


8 = (4B) - AS) _ 2 [w (any - (4) 
=~ [ ((AB) + (AR) + (@B) + (@B)} (4B) 
— {(AB) + (AB)} {(AB) + uid 
=~ | (AB) (a) - (AB) (0) | (On simplification) 


(11-12) => 5=0,ifAandB are independent. .- (11-15) 
Example 11-10. [£5 = (AB) —(AB)po, then with usual notations, prove 
that 
() [(A) - (@)][G) - (B)] + 2N6 = (AB)? + (af)? - (AB)? - (0B)? 


)B) { (AB) (4B) _(AY@) § (AB) (@B) 
oe ioe ON ri) I 


Solution. (i) We have 6 = (AB) — (AB) = (AB) ~ (A) (8) 


N 
L.H.S. = [(4) - (@)][@) - (B)] + 2N5 
= [(AB) + (AB) ~ (@B) - (oB))[(AB) + (WB) - (AB) - (B)] 
+ 2N [ <4) 2 Ae) } 
= [((AB)— (@B)} + ((AB) — (@B)}.J[((AB) — (aB)} - (CAB) - @B)}) 
+ 2[N(AB) — (A) (8)] 
= [(AB) - (@B)]? - [(AB) - (aB)?? 
+ 2[(AB){(AB) + (AB) + (.B) + (@B)} — {(AB) + (AB)} ((AB) + (0B))] 
= [(AB)? + (aB)? — 2(AB) (aB)] 
—-[(ABY + (@B)? - 2(AB) (aB)] + 2[(4B) (aB) ~ (AB) (aB)) 
(On simplification) 
= (AB)? + (aB)? - (AB)? ~ (0.B)? = RLS. 


ci) PHD | EO -O |=] cB cas)- eras | 
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=i (AB) (N - (B))~ 8) (4) - 48) | 


=H . (AB) - (A) (B) |- (AB) - AO) _ 5 


Since 6 is symmetric in A and B, by interchanging-A and B, we will 
obtain the second result. 

11-8. Association of “Attributes. Two attributes A and B are said 
to be associated if they are not independent but are related in some way or the 
other. They are said to be 

(A) (B) 
N 


positively associated if (AB) > 


. (11-16) 
and negativel associated if (AB) < Oe 


In other words, two attributes A and B are positively associated if 5 > 0, 
negatively associated if 5 < 0 or and are independent if 5 = 0 (c.f. § 11-7-2). 

Remarks 1. Two attributes A and B are said to be completely associated 
if A cannot occur without B, though B may occur without A and vice-versa. In 
other words, for complete association either all A’s are B’s i.e., (AB) = (A) or all 
B’s are A’si.e., (AB) = (B) according as either A’s or B’s are in a minority. 
Similarly, complete dissociation means that no A’s are B’s i.e., (AB) = 0 or no 
a’s are B’s i.e., (aB) = 0 or more generally when either of these Statements is 
true. 


2. It should be carefully noted that the word ‘association’ used in Statistics 
is technically different from the general notion of association as used in day-to- 
day life. Ordinarily, two attributes are said to be associated if they occur together 
in a number of cases. But statistically two attributes are said to be associated if 
they occur together in a large number of cases than expected if. they were 
independent, i.e., if 5 = (AB) — (A) (B)/N > O. In Statistics, the statement that 
“some A’s are B’s”, however great the proportion, does’ not necessarily imply 
association between them. Thus to find out if two attributes are associated, we 
must know (A), (8), (AB) and N. Incomplete information will not enable us to 
conclude anything about association between them. For example, consider the 
following statement : 

“90 per cent of the people who drink alcohol die before reaching the age of 
75 years. Hence drinking is bad for longevity of life.” 

The inference drawn is not correct, since the given information is not 
complete for drawing any valid conclusions about association. It might happen 
that 95% of the people who do not drink, die before reaching 75 years of age. In 
. that case drinking might be found good for longevity of life. 

3. Sampling fluctuations. If 5 # 0 and its value is fairly small, then it is 
possible that this association is just by chance (or commonly termed as ‘due to 
fluctuations of sampling) and not. really significant of any-real association 
between the attributes. We should not, therefore, draw hasty conclusions about 
association or dissociation unless 5, the difference between (AB).and its expected 
value (under the hypothesis of independence) (A) (8)/N, is significant. The 
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problem : “how much difference is to be regarded as significant’ will be discussed 
in detail in Chapters 12 (Large sample test for attributes) and 13 (Chi-square test 
of goodness of fit). This point has been raised here only as a precautionary 
measure to warn the reader against drawing hasty inferences. 

11-8-1. Yule’s Coefficient of Association. As a measure of the 
intensity of association between two attributes A. and B, G. Udny Yule gave the 
coefficient pe ay Q, defined as follows : 

Q= -—(A aB BeOS 

(a) (aB) + (AB) (0B)  (aB) (aB) + (AB) (aB) 

If A and B are independent, 5=0 => Q=0. 

If A and B are completely associated, then 
either (AB)=(A) = (AB)=0 

or (AB) = (8) => (aB)=0 
and in each case Q = +1. 

If A and B are ini complete dissociation then either (AB) = 0 or (aB) = 0 and 
we get Q =-1. 

Hence -1<Q<1 .-(11-18) 

Remark. An important property of Q .is that.it is independent of the 
relative proportion of A’s or a’s in the data. Thus if all the terms containing A 
in Q are multiplied by a constant, k'(say), its value remains unaltered. Similarly 
for B, B and a. This property renders it specially useful to situations where the 
proportions are arbitrary, e.g., experiments. 

118-2. Coefficient of Colligation. Another coefficient with the 
same an as Q, is the coefficient of colligation Y, given by 


| (AB)(aB) 
{1 -a) Adie Fae 7: tt + ( ose ..(11-19) 
1- 


Remarks -.1. Obviously Q=0 => Y= = 0, 


»-(11-17) 


4 
Q=-1 = Y=-landQ=1 =>’ Y=1 and Conversely. 
AB) (aB 
2, If we let ABL-LCB) _ p50 tha 
we et (4B) (OB) so that 
y -i-Vk = y= 1 + - 2Nk 
1+Vk 1+4k+2Nvk 


= 1+Y? = 2(1 + k) __201 + k) 
14k+2Vk (1+Vk) 


2y_ ai -Vey(i+Vi t-k 
*  T472 = 2048 “1T+k 
: c (AB) (0B) 
(AB) (a8) _ (AB) (o.B) - (AB) (0B) 
= 7, AB) (a) * (AB) (aB) + (AB) (0B) 
(AB) (aB) 
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Example 11-11. Find if A and B are independent, positively associated 
or negatively associated, in each of the following cases : 
(i) N = 1000, (A) = 470, (B) = 620, and (AB) = 320. 
(ii) (A) = 490, (AB) = 294, (a) = 570, and (aB) = 380. 
(iii) (AB) = 256, (0B) = 768, (AB) = 48, and (aB) = 144, 
Solution. (i) 5 = (AB) ae) Ae 
470 x 620 
= 320 1000 = 320 — 291-4 = 28-6 
Since 5 > 0, A and B are positively associated. 
(ii) We have N = (A) + (a) = 490 + 570 = 1060 
(B) = (AB) + (B) = 294 + 380 = 674 


8 = (48)-A®) - 294 oxen 294 — 311-6 <0 


Hence A and B are negatively associated. 
(iti) (A) = (AB) + (AB) = 256 + 48 = 304 
(B) = (AB) + (0B) = 256 + 768 = 1024 
N = (AB) + (AB) + (0B) + (a@B) = 256 + 48 + 768 + 144 = 


1216 
ee te (A) (B) _ 304x 1024 
5 = (AB) - N= 256- 1216 =0 
‘Hence A and B are independent. 


Aliter. Since all the four frequencies of order 2 are given, usjng (11-15), 
we have 


5 =~ [ (AB) (af) - (AB) (aB)] = [256 x 144 - 48 x 768] 


= 38 [144 — 48x 3] =0 
= A and B are independent. 
Example 11-12. Irivestigate the association between darkness of eye- 
colour in father and son from the following data : 


Fathers with dark eyes and sons with dark eyes : 50 
Father's with dark eyes and sons with not dark eyes : 79 
Fathers with not dark eyes and sons with dark eyes : 89 
Fathers with not dark eyes and sons with not dark eyes : 782 


Also tabulate for comparison the frequencies that would have been observed 
had there been no heredity. 


Solution. Let A: Dark eye-colour of father and 
B: Dark eye-colour of son. 
Then we are given (AB) = 50, (AB) = 79; (@B) = 89, (aB) =.782 
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50 x 782 -79 x 89 32069 
O =50 x 782+ 79x 89 46131 * 0 
Hence there is a fairly high degree of positive associatiori between the eye 
colour of fathers and sons. 
Wehave (A) =(AB)+(AB)=50+79 =129 
(B) = (AB) + (0B) =50+89 =139 
(x) = (aB) + (af) = 89 + 782 =871 
. = (AB) + (aB) = 79 + 782 = 861 
=(A)+(a) = 129 + 871 = 1000 
Under the ian of no heredity, i.e., independence of attributes A and B, 
we have 


(AB) ={4)B) _ 129139 _ 1g. (4p, A 1B) . Me 26 i 


N 1000 1000 
_ (a) (B) _ 871 x 139 le 20). 871 x 861 
(CB)p = a a oo = 121: (Blo = Sa se 


Example 11-13. Can vaccination be regarded as a see measure for 
small pox from the data give below ? 


' ‘Of 1482 persons in a locality exposed to small-pox, 368 in all were 
attacked.’ 
‘Of 1482 persons; 343 had been vaccinated and of these only 35 were 
attacked.’ 
Solution. Let A denote the attribute of vaccination and B that of attack by 
small-pox. Then the given data are : 
N = 1482, (A) = 368, (B) = 343 and (AB) = 35 
(a8) = N - (A) — (B) + (AB) = 1482 — 368 — 343 + 35 = 806 
(AB) = (A) — (AB) = 368 - 35 = 333 
(aB) = (B) — (AB) = 343 — 35 = 308 
Q = (AB) (0B) — (AB) (@B) _ 35 x 806 - 333 x 308 = —0.57 
(AB) (a6) + (aB) (0B) “35 x 806 + 333 x 308 
Thus, there is negative association between A and B i.e., between ‘attacked’ 
and ‘vaccinated’. In other words, there is positive association between not 
attacked and vaccinated. Hence vaccination can be regarded as a preventive 
measure for smallpox. 
EXERCISE 11(c) 


1. @) What do you mean by independence of attributes ? Give a ‘criterion 
of independence for attributes A and B. 

(b) What are the various methods of finding whether two attributes are 
associated, dissociated or independent ? Deduce any one such measure of 
association. 


(c) When are two attributes Said to be positively associated and negatively 
associated ? Also define comiplete association and dissociation of two attributes. 


(d) Derive an expression for a mzasure of association between two 
attributes. 
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(e) What is association of. attributes ? Write a note on the strength of 
association and how it is measured ? 

(f) Find whether the attributes o and B are positively associated, negatively 
associated or independent. Given (AB) = 500, (a) = 800, (8) = 600, N = 1500. 

2. (a) Define Yule’s coefficient of association and the coefficient of 
Colligation. Establish the following relation between coefficient of association 
Q and coefficient of colligationY: © 

__2Y 
Q= 1 + Y2 

(b) For the following table, give Yule’s coefficient of association (Q) and: 

elie of Colligation (Y). Examine the cases (i) bc = 0, (ii) ad = 0, and (iii) 


B notB 
A a b 
not A Cc d 


Ans. Q=1=Y if bc =O and Q =-1) = Y if ad =0 and Q = 0 if ad = bc. 
(c) Prove that in the usual notations Q = 2Y/(1 + Y?). What is the range of 
values for Q ? 


(d) If an attribute A is known to be completely associated with an attribute 
B, (i) what can’ you infer about the association between o. and B ? (a and B are 
equivalent to ‘not A’ and ‘not B’ respectively), (ii) a and B ? 
3. (a) The following table is reproduced from a memoir written by Karl 
Pearson : 
Eye colour in son 


Not light Light 
Eye colour { ‘Not light 230 148 
in fatherf Light 151 471 


Discuss if the colour of son’s eyes is associated with that of father. 

Ans, Yes. Positively associated, Q = 0-66. 

(6) The following table shows the result of inocculation against cholera. 
Not attacked Anata 


Inacculated 431 

Not-inocculated 291 9 

Examine the effect of inocculation in controlling susceptibility to cholera, 
Ans. Inocculation is effective in-controlling cholera. ; 


4, (a) find the association between proficiency .in English end in Hindi 
among candidates at a certain test if,245-of them passed in Hindi, 285 failed in 
Hindi, 190 failed in Hindi but passed in English and 147 passed in.both. 

(6) The male population of a state is 250 lakhs. The number of literate. 
males is 20 lakhs and total number of male criminals is 26 thousand. The 
number of literate male criminals is 2 thousand. Do you find any association 
between literacy and criminality ? 


Ans, Literacy and criminality are positively associated. 
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(c) From the following particulars find whether blindness and baldness are 
associated : 


Total population 1,62,64,000 
Number of baldheaded 24,441 
Number of blind 7,263 
Number of baldheaded blind 221 


§. In a certain investigation carried on with regard to 500 graduates and 
1500 non-graduates, it was found that the number of employed graduates was 
450 while the number of unemployed non-graduates was 300. In the second 
investigation 5000 cases were examined. The number of non-graduates was 3000 
and the number of employed non-graduates was 2500. The number of graduates 
who were found to be employed was 1600: 

Calculate the coefficient of association between graduation and employment 
in both the investigations. 

Can any definite conclusion be drawn from the coefficients ? 

Ans, Q (lst Investigation) = +0-38, Q (Second Investigation) = — 0-11 

6. (a) Three aptitude tests A, B, C were given to 200 apprentice traincées. 
From amongst them 80 passed test A,'78 passed test B and 96 passed the third 
test. While 20 passed all the three tests, 42 failed all the three, 18 passed A and 
B but failed C and 38 failed A and B but passed the third. Determine (7) how 
many trainees passed at least two of the three tests and (ii) whether the 
performances in tests A and B are associated. Ans. (i) 76, (ij) Q=0:3 

(6) In a survey of a population of 12000, information is gathered regarding 
three attributes A, B and C. In the usual notations, 

(A) = 980; (AB) = 450, (ABC) = 130 
(B) = 1190, (AC) = 280, (C) = 600 and (BC) = 250. 
Find: (@ (aBy) GidQazs = Coefficient of Association between A and £: 

Comment on your findings. 

7. A group of 1000 fathers was studied and it was found that 12-9% had 
dark eyes. Among them the ratio of those having sons with dark eyes to those 
having sons with not dark eyes was 1 : 1-58. The number of cases where fathers 
and sons both did not have dark eyes was 782. Calculate coefficient of 
association between darkness of eye colour in father and son. Give the 
frequencies that would have been observed had there been completely no heredity. 

Hint. (AB) = 50, (AB) = 79, (QB) = 89 and (a) = 782. 

8. A census revealed the following figures of the blind and the insane in 
two age-groups in a certain population : 


Age-Group Agé-Group 
I5S—25 years over 25 years 
Total population 2,70,000 1,60,200 
Number of-blind 71,000 2,000 
Number of insane 6,000 1,000 
Number of insane among the blind 19 9 


(¢) Obtain a measure of association between blindness and insanity for each 
age-group. 
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(ii) Which group shows more association or dis-association (if any) ? 

9. Show that if (AB),, (0B), (AB);, (aB); and (AB)», (@B)2, (AB). and 
(aB)2 be two aggregates rene to the same values of (A), (B), (a) and 
(B), then 

(AB), ~ (AB), = (0B)2 - (8); = (AB), — (AB); = (aB); — (@B)2 


10. Show that if § = (AB) ~ “e) then 
bey aia (AB)\(oB)] 


OBJECTIVE TYPE QUESTIONS 
‘ 1. State, giving reasons, whether each of the following statements is true 
or false : 
(i) There is no difference between correlation and association. : 
(ii) All the class frequencies of various orders are independent of each other. 
(i) If the attributes: A and B are positively associated, then,a and B are also 
positively associated. 
(iv) Square.of Yule’s coefficient of association carinot exceed 1. 
(v) Yule’s coefficient of association cannot be negative. 
(vi) For. two attributes A and B, the coefficient of association Q is 0-36. If 
each ultimate class frequency is doubled then @ is 0-72. 
(vii) If (AB) = 10, (aB) = 15, (AB) = 20 and (a8) = 30, then A and B are 
associated. ; 
(viii) If every item which possesses an attribute A possesses the attribute B 
as well, then the coefficient of association between A and Bis 1. 
Hi. Indicate the correct answer : 
(i) Incase of two attributes A and B, the ultimate class frequencies-are - 
(a) : (A), (6) : (AB), (c) : (a), (d@): (B). 
(ii) The condition for ‘the consistency of a set of independent class 


frequencies is that no ultimate class’ frequency is (a) zero, (b) positive, 
(c) negative. 


(iii) Attributes A and B are said to. be independent if 
(a) (AB) ar a , (6) (AB) = xB) . (c) (AB) < Se) 


(iv) siete A and B are Said to.be eof vely associated if 
(qy AB) < (AB) (yy (ABD _ (AB) (, (AB), (AB) (AB) _ (AB) 
(B)  ©@®)’ 6 @®’" ®° ®’’@- @& 
(v) If N = 50, (A) = 35, (B) = 25,. (AB) = 15, then the aibuics A and B 
are said to be: 
(a) correlated, (6) independent, (c) negatively associated, , Posey 
associated 
(vi) When there is a perfect positive association between two seine: Q 
would be (a) zero, (b)- 0-9, (c)-1, (d) +1. - 


CHAPTER TWELVE 


Sampling and Large Sample 
Tests 


12-1. Sampling—Introduction. Before giving the notion of 
sampling we will first define population. In a statistical investigation the 
interest usually lies in the assessment of the general magnitude and the study of 
variation with respect to one or more characteristics relating to individuals 
belonging to a group. This group of individuals under study is called population 
or universe. Thus in statistics, population is an aggregate of objects, animate or 
inanimate, under study. The population may be finite or infinite. 

It is obvious that for any statistical investigation complete enumeration of 
the population is rather impracticable. For example, if we want to have an idea 
of the average per capita (monthly) income of tue people in India, we will have 
to enumerate all the earning individuals in the country, which is rather a very. 
difficult task. 


If the population is infinite, complete enumeration is not possible. Also if 
the units are destroyed in the course of inspection.(e.g., inspection of crackers, 
explosive materials, etc.), 100% inspection, though possible, is not at all 
desirable. But even if the population is finite or the inspection is not destructive, 
100% inspection is not taken recourse to because of multiplicity of causes, viz., 
administrative and financial implications, time factor, etc., and we take the help 
of sampling. 

A finite sibset of statistical individuals in a population is called a sample 
and the number of individuals in a sample ts called the sample size.. 

For the purpose of determining population characteristics, instead of 
enumerating the entire population, the individuals in the sample only are 
Observed. Then the sample characteristics are utilised to approximately determine 
or estimate the population. For example, on examining. the sample of a 
particular stuff we arrive at a decision of purchasing or rejecting that stuff. The 
error involved in such approximation is known as sampling error and is inherent 
and unavoidable in any and every sampling scheme. But sampling results in 
considerable gains, especially in time and cost not only in respect of making 
Observations of characteristics but also in the subsequent handling of the data. 

Sampling is quite often used in our day-to-day practical life. For example, in 
a shop we assess the quality of sugar, wheat or any other commodity by taking a 
handful of it from the bag and then decide to purchase it or not. A housewife 
normally tests the cooked products to find if they are properly cooked and contain 
the proper quantity of salt. 

12:2. Types of Sampling. Some of the commonly known and 
frequently used types of sampling are : 
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(i) Purposive sampling, (ii) Random sampling, (iii) Stratified sampling, 
(iv) Systematic Sampling. 

Below we will precisely explain these terms, without entering into detailed 
discussion. 

12-2-1. Purposive Sampling. Purposive sampling is one in which 
the sample units are selected with definite purpose in view. For example, if we 
want to give the picture that the standard of living has increased in the City of 
New Delhi, we may take individuals in the sample from rich and posh localities 
like Defence Colony, South Extension, Golf Links, Jor Bagh, Chanakyapuri, 
Greater Kailash etc. and ignore the localities where low income group and the 
middle class families live. This sampling suffers from the drawback of 
favouritism and nepotism and does not give a representative sample of the 
population. 


12-2:2 Random Sampling. In this case the sample units are selected 
at random and the drawback of purposive sampling, viz., favouritism or 
subjective element, is completely overcome. A random sample is one in which 
each unit of population has an equal chance of being included in it. 

Suppose we take a sample of size n from a finite population of size VN. Then 
there are YC, possible samples. A sampling technique in which each of the NC, 
samples has an equal chance of being selected is known as random sampling and 
the sample obtained by this technique is termed as a random sample. 


Proper care has to be taken to ensure that the selected samplé is random. 
Human bias, which varies from individual to individual, is inherent in any 
sampling scheme administered by human beings. Fairly good random samples 
can be obtained by the use of Tippet’s random number tables or by throwing of 
a dice, draw of a lottery, etc. 


The simplest method, which is normally used, is the lottery system which 
is illustrated below by means of an example. 

Suppose we want to select ‘r’ candidates out of n. We assign the numbers 
one to n, one number to each candidate and write these numbers (1 to 2) onn 
slips which are made as homogeneous as possible in shape, size, etc. These slips 
are then put in a bag and thoroughly shuffled and then ‘r’ slips are drawn one by 
one. The ’r’ candidates corresponding to the numbers on the slips drawn, will 
constitute thé random sample. 


Remark. Tippet's Random Numbers. L.H.C. Tippet’s random numbers 
tables consist of 10400 four-digited numbers, giving in all 10400 x 4, i.e., 
41600 digits, taken from the British census reports. These tables have proved to 
be fairly random in character. Any page of th¢ table is selected at random and the 
number in any row or column or diagona! selected at random may be taken to 
constitute the sample. 

12-2-3. Simple Sampling. Simple sampling is random sampling in 
which each unit of the population has an equal chance, say p, of being included 
in the sample and that this probability is independent of the previous drawings. 
Thus a simple sample of size n from a population may. be identified with a series 
of n independent trials with constant probability ‘p’ of success for each trial. 

Remark. It may be pointed out that random sampling does not necessarily 
imply simple sampling though, obviously, the converse is true. For example, if 
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an urn contains ‘a’ white balls and ‘b’ black balls, the probability of drawing a 
white ball at the first draw is (a/(a + b)) = p,, (say) and if this ball is not replaced 
the probability of getting a white ball in the second draw is [(a- 1)(a@ +5-1)] 
= p2 # p;, the sampling is not simple. But since in the first draw each white ball 
has the same chance, viz., a/(a + b), of being drawn and in the second draw again 
each white ball has the same chance, wiz., (a — 1)/(a + b — 1), of being drawn, 
the sampling is random. Hence in this case, the sampling, though random, is 
not simple. To ensure that sampling is simple, it must be done with 
replacement, if population is finite. However, in case of infinite population no 
replacement is necessary. 


12:2°4. Stratified Sampling. Here the entire heterogeneous 
population is divided into a number of homogeneous groups, usually termed as 
strata, which differ from one another but each of these groups is homogeous 
within itself. Then units are sampled at random from each of these stratum, the 
sample size in each stratum varies according to the relative importance of the 
stratum in the population. The sample, which is the aggregate of the sampled 
units of each of the stratum, is termed as stratified sample and the technique of 
drawing this sample is known as stratified sampling. Such a sample is by far the 
best and can safely be considered as representative of the population from which 
it has been drawn. 

' 12-3. Parameter and Statistic. In order to avoid verbal confusion 
with the statistical constants of the population, viz., mean (1), variance 02, etc., 
which are usually referred to as parameters, statistical measures computed from 
the sample observations alone, e.g., mean (x ), variance (s2), etc., have been. 
termed by Proféssor R.A. Fisher as statistics. 


In practice, parameter values are not known and the estimates based on the 
sample values are generally used. Thus statistic which may be regarded as an 
estimate of parameter, obtained from the sample, is a function of the sample 
values only. It may be pointed out that a statistic, as it is based on sample 
values and as there are multiple choices of the samples that can be drawn from a 
population, varies from sample to sample. The determination or the 
characterisaton of the variation (in the values of the statistic obtained from 
different samples) that may be attributed to chance or fluctuations of sampling is 
one of the fundamental problems of the sampling theory. 

Remarks 1. Now onwards, 1 and o? will refer to the population mean and 
variance respectively while the sample mean and variance will be denoted by 
x and s? respectively. 

2. Unbiased Estimate. A statistic t = t(x,, X2, ..., X,), a function of the 
sample values x1, X2, ..., X, iS an unbiased estimate of population parameter 6, 
if E(t) = 8. In other words, if 

E(Statistic) = Parameter, ..( 12-1) 
then statistic is said to be an unbiased estimate of the parameter. 

12:3:1. Sampling Distribution of a Statistic. If we draw a sample 
of size n from a given finite population of size N, then the total number of 
possible samples is : 

“N N} 
Ca=F ! (N _ n) pak, (say). 


= 
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For each of these k samples we can compiite some statistic ¢ = t(x1; X2, ..., 
x,), in particular the mean x, the variance s*, etc., as given below : 


Statistics 


The set of the values of the statistic so obtained, one for each sample, 
constitutes what is called the sampling. distribution of the statistic. For 
example, the values ¢,, t2, t3, ..., t determine the. sampling distribution of the 
statistic ¢. In other words, statistic ¢ may be regarded as a random variable 
which can take the values ¢), t2, 3, ..., t, and we can compute the various 
statistical constants like mean, variance, skewness, kurtosis etc., for its 
distribution. For example, the mean and variance of the sampling distribution of 
the statistic ¢ are given by : 


a | ji 
t epithet tay Dt 


Var (¢) =i lt -T P+ (y- T+ wet (et P| 
k 
1 2 
=% %, 6-7 


12-3-2. Standard Error. The standard deviation of the sampling 
distribution of a statistic is known as its Standard Error, abbreviated as S.E. The 
standard errors of some of the well known statistics, for large samples, are given 
below, where n is the sample size, o” the population variance, and P the 
‘population proportion, and Q = 1 —P, 7, and nz, represent, the sizes of two 
independent random samples respectively drawn from the given population(s). 


Statistic Standard Error 


Sample mean : x on 
VPQin 
Vot2n 
o2V2in 

1-36263 on 


1.25331 o/Vn 


x 


Observed sample proportion ‘p’ 


Sample s.d. : s 


Sample variance : 


Sample quartiles 


Oo vA & WY WD 


Sample median 
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Te: Sample correlation coefficient (r) (1 ~ o2yVn ; 


‘p ‘being the population 
correlation coefficient 


8. | Sample moment p3 o V 96/ n 
9. | Sample moment p14 ot V96/n 
10 Sample coefficient of variation (v) = \ 1+ La 
° S 4- 
V 2n 10 Y 2n 
: as = oi 52? 
11. | Difference of two sample means : ( xy — x2) oa + > 
Diff f le s.d.’s : (§ \ | oP Or 
12. ifference of two sample s.d.’s : (S| — 52) On + 2n, 
13. | Difference of two sample proportions PQ, : P2Q2 
(P1 — p2) ny n2 


Remark on the Utility of Standard Error. S.E. plays a very 
important role in the large sample theory and forms the basis of the testing of 
hypothesis. If ¢ is any statistic, then for large samples 


Z 7s 1) (cf. § 12:9) 
= Z = _t-E() 


= SE. (1) ~ NO, 1), for large samples. 


Thus, if the discrepancy between the observed and the expected 
(hypothetical): value of a statistic is greater than zy (c.f. § 12-7-2) times its S.E., 
the-null hypothesis is rejected at a level of significance. Similarly, if 

lt -EQ) 152, SE. (0), . 
the deviation is not regarded significant at 5% level of significance. In other 
words, the deviation, t — Z(t), could have arisen due to fluctuations of sampling 
and the data do not provide us any evidence against the null hypothesis which 
may; therefore, be accepted at o level of significance. [For details see § 12-7-3] 

(i) The magnitude of the standard error gives an index of the precision of the 
estimate of the parameter. The reciprocal of the standard error is taken as the 
measure of reliability or precision of the statistic. 


SE. (p) = VPQin lef. (4b) § 12-9-1] 
and S.E. (%) =o0Nn [ef. § 12-2] 


In other words, the standard errors of p and X vary inversely as the square root of 
the sample size. Thus in order to double the precision, which amounts to 
reducing the standard error to one half, the sample size has to be increased four 
times. 

(ii) S.E. enables us to determine the probable limits within which the 
population parameter may be expected to lie. For example, the probable limits 
for population proportion P are given by 
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pt 3V pain (cf. Remark § 12-9-1) 

Remark. S.E. of a statistic may be reduced by increasing the sample size 
but this results in corresponding increase in cost, labour and time, etc. 

12-4, Tests of Significance. A very important aspect of the sampling 
theory is the study of the tests of significance, which enable us to decide on the 
basis of the sample results, if 

(Z) the deviation between the observed sample statistic and the hypothetical 
‘parameter value, or 

(z) the deviation between two independent sample statistics; 

is significant or might be attributed to chance or the fluctuations of sampling. 
_ Since, for large n, almost all the distributions, e.g., Binomial, Poisson, 
Negative binomial, Hypergeometric (c.f. Chapter 7), ¢, F (Chapter 14), Chi- 
square (Chapter 13), can be approximated very closely by a normal probability 
curve, we use the Normal Test of Significance (cf. § 12-9) for large samples. 
Some of the well known tests of significance for studying such differences for 
small samples are t-test, F-test and Fisher’s z-transformation. 

12-5. Null Hypothesis. The technique of randomisation used for the 
selection of sample units makes the test of significance valid for us. For 
applying the test of significance we first set up a hypothesis—a definite 
statement about the population parameter. Such a hypothesis, which is usually a 
hypothesis of no difference, is called null hypothesis and is usually denoted by 
Ho. According to Prof. R.A. Fisher, null hypothesis is the hypothesis which is 
tested for possible rejection under the assumption that it is true. 

For example, in case of a single statistic, Ho will be that the sample 
Statistic does not differ significantly from the hypothetical parameter value and in 
the case of two statistics, Ho will be that the sample statistics do not differ 
significantly. 

Having. set up the null hypothesis we ‘compute the probability P that the 
deviation between the observed sample statistic and the hypothetical parameter 
value might have occurred due to fluctuations of sampling (cf. § 12-7). If the 
deviation comes out to be significant (as measured by a test of significance), null 
hypothesis is refuted or rejected at the particular level of significance adopted (c/f. 
§ 12-7) and if the deviation ts not significant, null hypothesis may be retained at 
that level. 

12-5-1. Alternative Hypothesis. Any hypothesis which is comple- 
mentary to the null hypothesis is called an alternative hypothesis, usually 
denoted by H,. For example, if we want to test the null hypothesis that the 
population has a specified mean [o, (say), f.e., Ho: [= Ho, then the alternative 
hypothesis could be 

() Hy, > Ho (Z.e., HW > Mo Or Ht < Mo) 
(2) Hy: > Uo 
(iit) Hy: b< po 
The alternative hypothesis in (7) is known.as.a.two tailed alternative and the 


alternatives in (ii) and (iii) are known as right tailed and left-tailed alternatives 
respectively. The setting of alternative hypothesis is very important since it 
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enables us to decide whether we have to use a single-tailed (right or left) or two- 
tailed test [c.f. § 12-7-1]. 

12:6. Errors in Sampling. The main objective in sampling theory is 
to draw valid inferences about the population parameters on the basis.of the 
sample results. In practice we decide to accept or reject the lot after examining a 
sample from it. As such we are liable to commit the following two types of 
errors : 

Type I Error : Reject Ho when itis true. 
Type II Error : Accept Ho when it is wrong, i.e., accept Ho when H, is true. 

If we write. 

P(Reject Ho when it is true) = P (Reject H)p |Ho} =a 

and P {Accept Ho when it is wrong) = P{ Accept Ho |H,) = 8 } »+(12-2) 
then o and B are called the sizes of type I error and type Il error, respectively. 

In practice, type.I error amounts to rejecting a lot when it is good and type 
Il error may be regarded as accepting the lot when. it is bad. 

Thus P {Reject a lot when it is good) = “| 12:24 

and P{Accept a lot when it is bad} = B vf ) 
where o and B are referred to as Producer's risk and Consumer’s risk, 
respectively. 

12-7. Critical Region and Level of Significance. A region 
(corresponding to a statistic ¢) in the sample space S which amounts to rejection 
of Ho is termed as critical region or region of rejection. If w is the critical 
region and if ¢ = t(X, Xq, ..., X,) iS the value of the statistic based on ‘a random 
sample of size n, then 


P(tewlHy)=a, P(te wlH,)=B8 (12-26) 
where @, the complementary set of @, is called the acceptance region. 


We have @©UG®=S and ONO=6 

The probability ‘a’ that a random value of the statistic t belongs to the 
critical region is known as the level of significance. In other words, level of 
Significance is the size of the type I error (or the maximum producer’s risk). The 
levels of significance usually employed in testing of hypothesis are 5% and 1%. 
The level of significance is always fixed in advance before collecting the sample 
information. 

12°7:1. One tailed and Two:Tailed Tests. In any test, the critical 
region is represented by a portion of the area under the probability curve of the 
sampling distribution of the test statistic. 

A test of any statistical hypothesis where the alternative hypothesis is one 
tailed (right tailed or left tailed) is called a one tailed test. For example, a test for 
testing the mean of a population 


Ho: 1 = Uo 
against the alternative hypothesis : 
Al, : [ft > [to (Right tailed) or A, : [1 < Uo (Left tailed), 
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is a single tailed test: In the right tailed test (H, : 11, > Uo), the critical region 
lies entirely in the right tail of the sampling distribution of x, while for the left 
tail test (H, : [2 <[o), the critical region is entirely in the- left tail of the 
distribution. . 

* A test of statistical hypothesis where the alternative hypothesis is two tailed 
such as : 


Ho : Lb = Uo, against the alternative hypothesis H, : 1. # po, (UL > Lo and pt < plo), 


is known as two tailed test and in such a case the critical region is given by the ° 
portion of the area lying in both the tails of the probability curve of the test . 


Statistic. 

In a particular problem, whether one tailed or two tailed test is‘to be applied 
depends entirely on the nature of the alternative hypothesis. If the alternative 
hypothesis is two-tailed we apply two-tailed test and if alternative hypothesis is 
one-tailed, we apply one tailed test. 

For example, suppose that there are two population brands of bulbs, one 
manufactured by standard process (with mean life ,1,) and the other manufactured 
by some new technique (with mean lifé 12). If we want to test if the bulbs differ 
significantly, then our null hypothesis is Ho: \1,; = U1, and alternative will be 
H, : \y # Uy, thus giving us a two-tailed test. However, if we want to test if the 
bulbs produced by new process have higher average life than those produced by 
standard process, then we have 

Hoi Py =H. and H, 2p, < Uy, 


thus giving us a left-tail test. Similarly, for testing if the product of new ;process 
is inferior to that of standard process, then we have : 

Ho: Wy =H. and Hy: > bh, 
thus giving us a right-tail test.-Thus, the decision about applying 4 two-tail test 
or a single-tail (right or left) test will depend on the problem under study. 

12-7-2. Critical Values or Significant Values. The vatue of test 
statistic which separates the critical (or rejection) region and the acceptance 
region is called the critical value or significant value. It depends upon : 

(i) The level of significance used, and 

(it) The alternative hypothesis, whether it is two-tailed or single-tailed. 

As has been pointed out earlier, for large samples, the standardised variable 
corresponding to the statistic ¢ viz. : 

_ ft — Eft) | * 

Z= SE (t) NO, 1), .(*) 

asymptotically as n — oo, The value of Z given by (*) under the null hypothesis 

is known as test statistic. The critical value of the test Statistic at level of 

Significance a for a two-tailed test is given by zq¢ where z,, is.determined by the 
equaticn 


P(IZ\|>z,) =a .2.(12-2c) 
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i.€., Zq is the value so that the total area of the critical region on both tails is a. 
Since normal probability curve is a symmétrical curve, from (12-2c), we get 


P (Z>2g)+ P(Z<-zq) =a [By symmetry] 
= P(Z >2q)+ P(Z>zq) =a 
= 2P(Z > 2g) =a 
=> P(Z> 2g) = 


i.e., the area of each tail is a/2, Thus zg is the value such jthat area to the right 
of zy is a2 and to the left of ~ zq is a/2, as shown in the following diagram. 


TWO-TAILED TEST 
(Level of Significance ‘a) 


Lower Upper 

critical critical 

value value a 
Rejection Rejection 


region («/z region («j2) 


In case of single-tail alternative, the critical value zg is determined so that 
total area to the right of it (for right-tailed test) is « and for left-tailed test the 
total area.to the left of — zq is a (See diagrams below), i.e., ° 


For Right-tail Test : P(Z>zq)=a »-(12-2d) 
For Left-tail Test : P(Z<-zg)=a ..(12-2e) 
RIGHT-TAILED TEST LEFT-TAILED TEST 
(Level of Signifiance ‘o) (Level of Significance ‘@) 


Rejection 
region 


Rejection 


Z=0 an Le Z=0 


Thus the significant ot critical value of Z for a single-tailed test (left.or 
right) at level of significance ‘at .is same as the critical value of Z for a two- 
tailed test at level of significance ‘20 . 

We give on page 12-10, the critical values of Z at commonly used levels of 
significance for both two-tailed and single-tailed tests. These values have been 
obtained from equations (12-2c), (12-2d) and (12-2e), on using the Normal 
Probability Tables as explained in § 12-8. 
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‘Critical Values Level of significarice (a) 

(2a) ld 5% 10% 
Two-tailed test IZ |= 2-58 \Zn1=196 Ziq |= 1-645 
Right-tailed test Ze = 2:33 Ze = 1-645 Ze = 1-28 
Left-tailed test Zq = -2-33 Zq = -1-645 Zq = -1-28 


Remark. If x is small, then the sampling distribution of the test statistic 
Z will not be normal and in that case we can’t use the above significant values, 
which have been obtained from normal probability curves. In this case, viz., n 
small, (usually less than 30), we use the significant values based on the exact 
sampling distribution of the statistic Z, (defined in (*), § 12-7-2], which tums 
out to be t, F, or x? [see Chapters 13, 14]. These significant values have been 
tabulated for different values of n and & and are given in the Appendix at the end 
of the book. 

12:7-3. Procedure for Testing of Hypothesis. We now summarise 
below the various steps in testing of a statistical hypothesis in a systematic 
manner. 

1. Null Hypothesis. Set up the Null Hypothesis Ho (see § 12-5, page 12-6). 


2. Alternative Hypothesis. Set up the Alternative Hypothesis 7. Ths will 
enable us to decide whether we have to use a single-tailed (right or left) test or 
two-tailed test. 


3. Level of Significance. Choose the appropriate level of significance (a) 
depending on the reliability of the estimates and permissible risk. This is to be 
decided before sample is drawn, i.e., & 1s fixed in advance. 


4. Test Statistic (or Test Criterion). Compute the test statistic 
Fet= E(t) 
S.E.(t) 
under the null hypothesis. 
5. Conclusion. We compare z the computed value of Z in step 4 with the 
significant value (tabulated value) zg, at the given level of significance, ‘a’. 
If | ZI < 2g, ie., if the calculated value of Z (in modulus value) is less than 
Zq We Say it is not significant. By this we mean that the difference ¢ — E(#) is 


just due to fluctuations of sampling and the sample data do not provide us 
sufficient evidence against the null hypothesis which may therefore, be accepted. 


If! Z1> 2zq, 2.é., if the computed value of test statistic is greater than the 


critical or significant value, then we say that it is significant and the null 
hypothesis is rejected at level of significance a i.é., with confidence coefficient 
(1°--q@). 


12-8. Test of Significance for Large Samples. In this section we 
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will discuss the tests of significance when samples are large. We have seen that 
for large valués of n, the number of trials, almost all the distributions, e.g., 
binomial, Poisson, negative binomial, etc., are very closely approximated by 
normal distribution. Thus in this case we apply the normal test, which is based 
upon the following fundamental property (area properiy) of the normal 
probability curve. 

If X ~N (u, 0%), then Z=AS# . A=2O) 1) 

Vvcx) 
Thus from the normal probability tables, we have 
P(-3 < Z <3) = 0-9973, i.e, P UZ <3) = 09973 : 

=> Pd Zi>3)=1-P(IZ1< 3) = 0-0027 ...(12-3) 
oe in all probability we should expect a standard normal variate to lie between 
+ 3. 

Also from the normal probability tables, we get 

P(-1-96 $ Z $ 1-96) = 0-95 i.e., P (IZ 1 < 1-96) = 0-95 


=> P(\Z1> 1-96) = 1 ~ 0-95 = 0-05 ..(12-3a) 
and P (Z| < 2-58) = 0-99 
=> P (\Z1> 2-58) = 0-01 (12-36) 


Thus the significant values of Z at 5% and 1% level of significance for a 
two tailed test are 1-96 and 2-58 respectively. 

Thus the steps to be used in the normal test are as follows : 

({) Compute the test statistic Z under Ho. 
(ii) If1Z1>3, Ho is always rejected. 

(ii) If! Z I< 3, we test its signficance at certain level of significance, 
usually at 5% and sometimes at 1% level of: significance. Thus, for a two-tailed 
test if |Z 1> 1-96, Ho is rejected at 5% level of significance. 

Similarly if | Z| > 2-58, Ho is contradicted at 1% level of significance and if 
IZ1< 2-58, Ho may be accepted at 1% level of significance. 

From the normial probability tables, we have : 

P(Z> 1-645) =05~P QSZ < 1-645) 

= 0-5 -— 0-45 
= 0-05 
P (Z> 2-33) =05-P 0 <Z < 2:33) 
= 0-5 — 0-49 
=0-01 

Hence for a single-tail test (Right-tail or Left-tail)’ we compare the computed 
value of | Zi with 1-645 (at 5% level) and 2-33 (at 1% level) and accept or reject 
Hy accordingly. 

Important Remark. In ‘the theoretical discussion that follows in the 
next sections, the samples under consideration are supposed to be large. For 
practical purposes, sample may be regarded as large if n > 30. 

12:9. Sampling of Attributes. Here we shall consider sampling from 
a population which is divided into two mutually exclusive and collectively 
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exhaustive classes-one class possessing a particular attribute, say A, and the 
other class not possessing that attribute, and then note down the number of 
persons in the sample of size n, possessing that attribute. The presence of an 
attribute in sampled unit may be termed as success and its absence as failure. In 
this case a sample of n observations is identified with that of a series of n 
independent Bernoulli trials with constant probability P of success for each trial. 
Then the probability of x successes in n trials, as given by the binomial 
probability distribution is 
p(x) = °C, P*¥ Q**;x=0,1,2,...,0 
12-9-1. Test for Single Proportion. If X is the number of successes 
in rn independent trials with constant probability P of success for each trial (cf. § 
7-2-1) 
E(X) =nP and V(X)=nPQ, 
where Q = 1 — P, is the probability of failure. 
It has been proved that for large n, the binomial distribution tends to normal 
distribution. Hence for large n, X ~N (nP, nPQ) i.e., 


X —E(X) X—-—nP 
Z= = ~ N(QO, 1 (12-4 
ve) VaPQ = 


and we can apply the normal test. 


Remarks 1. In a sample of size n, let X be the number of persons: 
possessing the given attribute. Then 


Observed proportion of successes = i =p, ae 
Ep) =E (=) = ” B(X) = nP = P 


=> E(p) = P ...(12-4a) 
‘Thus the sample proportion ‘p’ gives an unbiased estimate of the population 
proportion P. 


Also Vip) =v(=)=4 EV) =5npg=*2 


S.E.(p) = — ,..(12-4b) 


Since X and consequently X/n is asymptotically normal for iarae n, the 
normal test for the proportion of successes becomes 


-~D-E(p) _p-P _ 
L= SE. (p) VPon N(O, 1) ..(12-4c) 


2. If we have sampling from a finite population of size N, then 


S.E.(p) = Cae Fe ..(12-4d) 


3. Since the probable limits for a normal variate X are E(X) + 3 . V(X), 
the probable limits for the observed proportion of successes are : 
E(p) +3 SE. (p),ie., P+3 VPQIin : 


If P is not known then taking p (the sample proportion) as an estimate of P, 
the probable limits for the proportion in the population are : 
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p+t3 V pqin ...(12-4e) 
However, the limits for P at level of significance @ are given by : 
pti V pgin : .-.(12-4f) 


where z, is the significant value of Z at level of significance a. 
In particular 95% confidence limits for P are given by : 


p+ 1-96 V pain ; ...(12-4g) 
and 99% confidence limits for P are given -by 
p +2-58 V pqin .(12-4h) 


Example 12:1. A dice is thrown 9,000 times and a throw of 3 or 4 is 
observed 3,240 times. Show that the dice cannot be regarded as an unbiased one 
and find the limts between which the probability of a throw of 3 or 4 lies. 

Solution. If the coming of 3 or 4 is called a, success, then in usual 
notations we are given 

n= 9,000; X = Number of successes = 3,240 
Under the null hypothesis (Ho) that the dice is an unbiased sas we get 


P = Probability of success = — of gettinga30r4=i+ t= + 


Alternative hypothesis, H, : p #5 ,(i.e., dice is biased). 


We have Z=-=2£ ~ Noo, 1), since n is large. 
n 
a 3240 -9000x1/3___240 240 _ 5 4 


“= 19000 x (1/3) x (2/3) 2000 44°73 
Since | Z|! > 3, Ho is rejected and we conclude that the dice is almost 
certainly biased. 
Since dice is not unbiased, P # ; . The nadie limits for “P’ are given 
by: 


P +3 Qin = =p+3Vp qin , 


A 
where P= 12 os6ant5u¢- 1-p= 0-64. 
Hence the probable limits for the population proportion of successes may -be 
taken as 


A “AA : 
P+3V Pom = 0.36 + 3-\/ 236% 0-64 | 0364 3x 


= 0-360 + 0-015 = 0-345 and 0-375. 
Hence the probability of getting 3 or 4 almost certainly lies between 0-345 
and 0-375. 
Example 12:2. A random sample of 500 ‘pineapples was taken from a 
large consignment and 65 were found to be bad. Show that the S.E. of the 
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proportion of bad ones in a sample of this size is 0-015 and deduce that the 
percentage of bad pineapples in the consignment almost certainly lies between 
8&5 and 17-5, 


Solution. Here we are given n = 500 
X =Number of bad pineapples in the sample = 65 


p = Proportion of bad pineapples in the sample = S = 0-13 


q =1-p=0-87 
Since P, the proportion of bad pineapples in the consignment is not 
known, we may take (as in the last example) 


A A 
P =p=0-13, Q=q=0-87 


AA 
SE. of proportion = V POMn = V0-13 x 0-87/500 = 0-015 


Thus, the limits for the proportion of bad pineapples in the consignment are : 


A . AA 

P+3 V PQ/n =0-130 t 3 x 0-015 = 0-130 + 0-045 =.(0-085, 0-175) 

Hence the percentage of bad pineapples in the consignment lies almost 
certainly between 8-5 and 17-5. 


Example 12-3. A random sample of 500 apples was taken from.a large 
consignment and 60 were found to be bad. Obtain the 98% confidence limits for 
4e percentage number of bad apples in the consignment. 


2-33 
[ f $ (t) dt = 0-49 nearly | 


Solution. We have : 
p = Proportion of bad apples in the sample = & = 0-12 
Since the significant value of Z at 98% confidence coefficient (level of 
significance 2%) is given to be 2-33, 98% confidence limits for population 
proportion are : 


p +2-33 Vpqin = 0-12 + 2-33 V0-12 x 0-88/500 


= 0-12 + 2-33 x V0-0002112 = 0-12 + 2:33 x 0-01453 
= 0.12000 + 0-03385 = (0-08615, 0-15385) 
Hencé 98% confidence limits for percentage of bad apples in the 
consignment are (8-61, 15-38). 
Example 12-4. Jn a sample of 1,000 people in Maharashtra, 540 are rice 
eaters and the.rest are wheat eaters. Can we assume that both rice and wheat are 
equally popular in this State at 1% level of significance ? 


Solution. In the usual notations we are given n = 1,000 
X = Number of rice eaters = 540 


p =Sample proportion of rice eaters = 
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Null Hypothesis, Ho : Both rice and wheat are equally popular in the State 
so that 
P = Population proportion of rice eaters in Maharashtra = 0-5 
=> QO =1-P=0-5 
Alternative Hypothesis, H, : P # 0-5 (two-tailed alternative). 
Test Statistic. Under Hp, the test statistic is 


_ Pp 
Z =2=£-N(, 1), (since n is large). 
\ PQin 


Now Z — —0:54 = 0:50 _ _0:04 _ 4535 


V0-5 x 0-5/1000 90138 
Conclusion. The significant or critical value of Z at 1% level of significance 
for two-tailed test is 2-58. Since computed Z = 2-532 is less than 2-58, it is not 
significant at 1% level of significance. Hence the null hypothesis i is accepted and 
we may conclude that rice and wheat are equally popular in Maharashtra State. 
Example 12:5. Twenty people were attacked by a disease and only 18 
survived, Will you reject the hypothesis that the survival rate, if attacked by this 
disease, is 85% in favour of the hypothesis that it is more, at 5% level. (Use 
Large Sample Test.) 
[Paina Univ. B.Sc. (Hons.), 1992; Bombay Univ. B.Sc. 1987) 
Solution. In the usual notations, we are given n = 20. 
X = Number of persons who survived after attack by a disease = 18 
p = Proportion of persons survived in the sample = te = 0-90 
Null Hypothesis, Hg: P = 0-85, i.e., the proportion of persons survived 
after attack by a disease in the lot is 85%. 
Alternative Hypothesis, H, : P > 0-85 (Right-tail alternative). 
Test Statistic. Under Ho, the test statistic is : 


y ae N (0, 1), (since sample is large). 
1 Qin 
Now  Z = 0220. 0'85__ 005 _ A633 


~ 40-85 x 0-15/20 9:079 


Conclusion. Since the alternative hypothesis is one-sided (right-tailed), we 
shall apply right-tailed test for testing significance of Z. The significant value of 
Z at 5% level of significance for right-tail test is + 1-645. Since computed value 
of Z = 0-633 is less than 1-645, it is not significant and we may accept the null 
hypothesis at 5% level of significance. 

12:9-2. Test of Significance for Difference of Proportions. 
Suppose we want to compare two distinct populations with respect to the 
prevalence of a certain attribute, say A, among their members. Let X,, X_ be the 
number of persons possessing the given attribute A in random samples of sizes 
n, and n> from the two populations respectively. Then sample proportions are 
given by 
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Pi =X,/nm, and pz =X>/n. 
If P, and P, are the population proportions, then 
E(p;) =P}, E(p2) = P2 [c.f Equation (12-4a)) 
P P 
and Vin) =H and vipn) = 222 


Since for large samples, p, and p2 are aati normally distributed, 
(Pi — P2) is also normally dstributed. Then the standard variable corresponding to 
the difference (p, — p2) is given by 


(p1 — P2) ~ Ei — P2) 
Z= ~N(O, 1 
VV 1 — P2) ie 
Under the null hypothesis Ho: P, = P2, i.e., there is no significant 
difference between the sample proportions, we have 
E(p, — p2) = E(p;) — E(p2) = P, - P2=0 (Under Ao) 
Also VP, — p2) = V(P;) + V2), 


the covariance term Cov(p;, p2) vanishes, since sample proportions are 


independent. 
P,Q, _ P2Q2 (; 1 ) 
V(p, - =——+——" =P Ser Bs 
(P: ~ 2), ny ty Q i 
since under Ho : P, = P2 = P, (say), and Q; = Q,=@Q. 
Hence under Ho: P; = P2, the test statistic for the difference of proportions 
becomes 


Z =——P1 2 __ ~ no, 1) (12-5) 


1 1 
\ | P (+ + ~) 
2 nh m& 

In general, we do not have any information as to the proportion of A’s in 
the populations from which the samples have been taken. Under Ho : P, = P2 = 
P, (say), an unbiased estimate of the population proportion P, based on both the 
samples is given by 

A + X, +X 
p =P MP2 Ait 2 ...(12-5a) 
Ny + No ny + No 
The Saad iS rales since 


EP) = a El + mpr| ee mn, —— LEQ) + mE@2) | 


Ay : No ay Lei + mPa] =P [--P, =P, =P, under Ho] 


‘Thus (12-5) along with (12-5a) gives the required test statistic. 
Remarks 1. Suppose we want to test the significance of the difference 
between p, and p, where 
_ “ps + Nope) . 
(nm) + no) 


Sampling and Large Sample Tests: 12.17 


gives a pooled estimate of the population proportion on the basis of both the 
samples. We have 
VQ, - p) = Vp,)'+ VP) — 2 Cov (p1, p) -..(*) 
Since p; and p are not independent, Cov (py, p) #0. 
Cov.(p1, p) = El (p: - a {p-—E(p))] 
=E — E(p;)) { — {n,p, + nop2 - E(n,p, + n2p2)) \ | 


ny + no 


Serres: [ (p: — E(p,)) { ny(p; — E(p;)) + m(P2 — E(p2)) \ ] 


ae + No 
ot [ne {n- Eo) J +mE{ (1 -E@)\e2- E02) } | 
oa Ons, [niviy +n Cov (1.72 | 
“ht Vip), [-.- Cov (p1, p2) = 9) 
_—m_ pap 


“ny tng’ nh 7 


Also Var (p) = ————5, E ce + N2P2) - — E(n,p, + np) | 


(n, — 


*Greape Le? Var on +2 Var on |, 


covariance term vanishes = P, and p2 are independent. 
__! [2 2 22 | 
ae) © (ny +n)? [» im "2 


__ 4 
Ny + Ny 


Substituting in (*) and simplifying, we shall get 
Vip, -p)= = fh es |! ee 


, ee! | Ls ae 
ny) +n mere Pq n(n, + no) 


Thus, the test statistic in this case becomes 


Z=o——PuP N11) .(12-5b) 
I. Pg 
(nj + M2) 21 
2. Suppose the population proportions P, and P2 are given to be distinctly 
different, i.e., P, # P2 and we want to test if the difference (P, — P2) in popula- 
tion proportions is likely to be hidden in simple samples of sizes n, and nz 
from the two populations respectively. 
We have seen that in the usual notations, 
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_ 2i- Pp - Ei - pr) _ @i-py-(Pi-P2) _ 
ze SEQ = PD = N(O, 1) 


P,Q, * P2Q2 
ny nN 


Here sample proportions are not given. If we set up the null hypothesis 
Ho: p, = p2, i.e, the samples will not reveal the difference in the population 
proportions or in other words the difference in population proportions is likely 
to be hidden in sampling, the test statistic becomes 


iZi2— Pa! _ Lo, yy (12-50) 
Pri , PoO2 
ny no 
Example 12-6. Random samples of 400 men and 600 women were asked 
whether they would like to have a flyover near their residence. 200 men and 325 
women were in favour of the proposal. Test the hypothesis that proportions of 


men and women in favour of the proposal, are same against that they are not, at 
59% level. [Agra Univ. M.A., 1992] 


Solution. Null Hypothesis Hy: P; =P2=P, (say), i.e., there is no 


significant difference between the opinion of men and women as far as proposal 
of flyover is concerned. 


Alternative Hypothesis, H, : P, # Pz (two-tailed). 

We are given : 
n, = 400, X,; = Number of men favouring the proposal = 200 
nz = 600, X2 = Number of women favouring the proposal = 325 
Pi = Proportion of men favouring the proposal in the sample 


Test Statistic. Since samples are large, the test statistic under the 
Null.Hypothesis, Ho is : 


Zz Pi - P2 ae N (0, 1) 
Sy fae 
2 ny sd = 
A 
en — Pit Mep2 _X1 + Xz _ 200 + 325 | 525 _ 9.695 


nytny ~= m+n,  400+600 1000 


A A 
= Q =1-P =1-0-525 = 0-475 
ae 0-500 ~ 0-541 


1 1 
0-525 x 0-475 x (x0 + =0 ) 
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: ~ 0-041 
0-525 x 0-475 x (10/2,400) 


Yooo1039 ” 0.0823 ~ 


Conclusion. Since} Z |= 1-269 which is less than 1-96, it is not 
significant at 5% level of significance. Hence Hy may be accepte.. at 5% level of 
significance and we may conclude that men and women do not differ significantly 
as regards proposal of flyover is concerned. 

Example 12-7. A company has the head office at Calcutta and a branch at 
Bombay. The personnel director wanted to know if the workers at the two places 
would like the introduction of a new plan of work and a survey was conducted for 
this purpose. Out of a sample of 500 workers at Calcutta, 62% favoured the new 
plan. At Bombay out of a sample of 400 workers, 41% were against the new 
plan. Is there any significant difference between the two groups in their attitude 
towards the new plan at 5% level ?. 

Solution. In the usual notations, we are given : 

n, = 500, p, = 0-62 and 1, = 400, p. = 1 - 0-41 = 0-59 

Null hypothesis, Hy: P; = P2,i.e., there is no significant difference 
between the two groups in their attitude towards the new-plan. 

Alternative hypothesis, H, : P; # Pz (Two-tailed). 

Test Statistic. Under Hp, the test statistic for large samples is : 


— Pi t+ Mp2 _ 500 x 0-62 + 400 x_0-59 _ 
—_ ern 500 + 400 =e) 
A A . 
and Q =1-P =0.-393 
_ 0-62-0-55 
Aj 0-607 x 0-393 x a0 + = rr 
_003 008 po), 


Critical region. At 5% level of significance, the critical value of Z for a. 
two-tailed test is 1-96. Thus the critical region consists of all values ‘of Z ='1;96 
or Z < -1-96. 


Conclusion. Since the calculated value of | Z | = 0-917 is less than the 
critical value of Z (1-96), it is not significant at 5% level of significance. Hence 
the.data do not provide us any evidence against the null hypothesis which may. 
be accepted, and we conclude that there is no significant difference-between the 
two groups in their attitude towards the new plan. 


Example 12-8. Before an increase in excise duty on tea, 80Q persans out 
of a sample of 1,000 persons were found to be tea drinkers. After an‘increase {+ 
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duty, 800 people were tea drinkers in a sample of 1,200 people. Using standard 
error of proportion, state whether there is a significant decrease in the 
consumption of tea after the increase in excise duty ? 
Solution. In the usual notations, we havé n, = 1,000 ; n2 = 1,200 
p, = Sample proportion of tea drinkers before increase in excise duty 


800_ 
= 7000 0-80 
Pp, = Sample proportion of tea drinkers after increase in excise duty 
800__ 
= 1200 = 07 


Null Hypothesis, Ho: P, = P2, i.e., there is no significant difference in the 
consumption of tea before and after the increase in excise duty. 

Alternative Hypothesis, H, : P, > P2 (Right-tailed alternative). 

Test Statistic. Under the null hypothesis, the test statistic is 


Z =—A 2 ~ ng, 1) (Since samples are large) 
AA 1 de 
PQ ny . >) 
where 
A A A 
_ MP + Mp2 800+ 800 16 7 4 6 
Pen, tn, 1000 + 12007 22° 4 P= 1-P =, 
- 0-80 — 0-67 
16. 6 phi, cs 
92 * ax™ € - a) 
= OE = OS = 0842 
22 22 6000 


Conclusion. Since Z s much greater than 1-645 as well as 2-33 (since test 

is one-tailed), it is highly significant at both 5% and 1% levels of significance. 
Hence, 
we reject the null hypothesis Hy and conclude that there is a significant decease 
in the consumption of tea after increase in the excise duty. 
Example 12-9. A cigarette manufacturing firm claims that its brand A of the 
cigarettes outsells its brand B by 8%. if it is found that 42 out of a sample of 
200 smokers prefer brand A and 18 out of another random sample of 100 
smokers prefer brand B, test whether the 8% difference is a valid claim. (Use 
5% level of significance.) 

Solution. We are given.: 


X, 42 
X2_ 18 
nz = 100, X> = 18 => Pr= = 799 = O18 


_ We set up the Null Hypothesis that 8% difference in the sale of two brands 
of cigarettes is a valid claim, i.e., Ho : P, — Pz = 0-08. 
Alternative Hypothesis : H, : P, - Pz # 0-08 (Two-tailed). 
U'nder Hp, the test statistic is (Since samples are large) 
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Z os ~ N(O, 1) 


Beg gk 
PO ars 
5 X+X, 42018 2 : 
where P n+ A ~ 200 + 100 ~ 300 0:20 => Q=1-P =0-80 
ee _ -005 

jo wok ~ 40-16 x0.015 

_ ms (00 + io) 
_ = 0-05_ = - 0-05 1.02 


0.0024 0-04899 ~ 


Since | Z!1= 1-02 < 1-96, it is not significant at 5% level of significance. 
Hence null hypothesis may be retained at 5% level of significance and we may 
conclude that a difference of 8% in the sale of two brands of cigarettes is a valid 
claim by the firm. 


Example 12-10. On the basis of their total scores, 200 candidates of a 
civil service examination are divided into two groups, the upper 30 per cent and 
the remaining 70 per cent. Consider the first question of this examination. 
Among the first group, 40 had the correct answer, whereas among the second 
group, 80 had the correct answer. On the basis of these results, can one conclude 
that the first question ts no good at discriminating ability cj the type being 
examined here ? 


Solution. Here, we have 
n = Total number of candidates = 200 
= number of candidates in the upper 30% group 


=F Tan X 200 = 60 
nz = — number of candidates in the remaining 70% group 
=7 sna * 200 = 140 


X, = The number of candidates, with correct answer in the first group = 40 
X2 = The number of candidates, with correct answer in the second group = 80 
Xx _ 40 _ _* 2 80 
= 60 ~ 


ull Hypothesis, Ho : There is no rill difference in the sample 
ime 1.é., P; = P2,1.e., the first question is no good at dicriminating-the 
ability of the type being examined here. 
Alternative Hypothesis, H, : P, # P2. 
Test Statistic. Under Ho the test statistic is : 


4 . ea ° . 
Z2— rr N(O, 1) (since samples arc large). 


12-22 Fundamentals of Mathematical Statistics 


where 
A xX,+X 40 + 80 x A 
ria! SLY Seep ee ey, =-]-P=(0. 
= oi. 608 140° 
0-6666 — 0-5714 _ 0-0953 
~ 0.0756 


1 } 
0-6 x 0-4 (5+ 40) 


Conclusion. Since | Z | < 1-96, the data are consistent with the null 
hypotliiesis at 5% level of significance. Hence we conclude that the first question 
is not good enough to distinguish between the ability of the two groups of 
candidates. 

Example 12-11. In a year there are 956 births in a town A, of which 
52-5% were males, while in towns A and B combined, this proportion in a total 
of 1,406 births was 0-496. Is there any significant difference in the proportion of 
male births in the two towns ? 

Solution. We are given 

n= 956, ny+no= 1,406 or Nn = 1,406 - 956 = 450 
P, = Proportion of males in the sample of town A = 0-525. 

‘Let pz be the proportion of males in the sample (of size nz) of town B. 

Then 


Z= = 1-258 


A 
P = Proportion of males in both the samples combined. 


— = 0-496 (Given) 
956 x 0-525 + 450 x p2 
1406. = 0-496 
=> Pz = 0-434 (On simplification) 
Null Hypothesis, Hy: P; = P2, t.e., there is no significant difference in the 
proportion of male births in the two towns A and B. 
Alternative Hypothesis, H, : P, # Pz (two-tailed). 
Test Statistic. Under Ho, the test statistic is : 


Z= —S—- N(O, 1) (Since samples are large) 


P Q a+: + i 
A 
where 5 _ MPI MP2 _o96 6=1-P =0-504 
- ny + No 
0-525 — 0-434 0-091 


2 = _ = = 3.368 
] 
0-496 x 0-504 (555 + 755) 
Conclusion. Since | Z | > 3, the null hypothesis is rejected, i.e., the data 
are inconsistent with the hypothesis P,; = P2 and we conclude that there is 
significant difference in the proportion of male births in the towns A and 5. 
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Example 12-12. In two large populations, there are 30 and 25 per cent 
respectively of blue-eyed people. Is this difference likely to be hidden in samples 
of 1,200 and 900 respectively from the two populations ? 

[Delhi Univ. B.Sc., 1992) 

Solution. Here, we are given n, = 1200, nz = 900. 

P, = Proportion of blue-eyed people in the first population 

= 30% = 0:30. 
P, = Proportion of blue-eyed people in the second population 
= 25% = 0-25. 
; Q,=1-P, =0-70 and Q.=1-P,=0-75 

We set up the null hypothesis Ho that p, = p2,i.e., the sample proportions 
are equal, i.e., the difference in population proportions is likely to be hidden in 
sampling. 

Test Statistic. Under Ho: p, = p2, the test statistic is : 


1P,-P 
IZ l= se ~ N(O, 1) (Since samples are large.) 
P,Q, A P2Q2 
ny No 
IZ\= 0:30 — 0-25 = 005 = 2:56 


Conclusion. Since | Z | > 1-96, the null hypothesis (p, = p2), is refuted at 
5% level of significance and we conclude ‘that the difference in population 
proportions is unlikely to be hidden in sampling. In other words, these samples 
will reveal the difference in the population proportions. 

Example 12:13. In a random sample of 400 students of the university 
teaching departments, it was found that 300 students failed in the examination. 
In another random sample of 500 students of the affiliated colleges, the number 
of failures in the same examination was found to be 300. Find out whéther the 
proportion of failures in the university teaching departments is significantly 
greater than the proportion of failures in the university teaching departments and 
affiliated colleges taken together. 

Solution. Here we are given : n, = 400, Ne = = 500 


300 300 
Pi = 499 = 9°75, P2= 500 = 060 
q,=1 -p\ = 1— 0-75 = 0-25 and q2 =1 -p2= 0-40 
Here we set up the null hypothesis Ho that p, and p, where p is the pooled 
estate, i.e., proportion of failures in the university teaching departments and 
affiliated colleges taken together, do not differ significantly. 


AA 
SE. of (p—p,) = \) ae x - [cf. (1256) page 12-18] 


fe +n2p2 _ 400 x 0-75 + 500 x 0-60 


ntn, 400 + 500 vo 


12.24 Fundamentals of Mathematical Statistics 


q= 1-067 = 0-33 


. | 0:67 x 0-33 500° 
te SE. of ( —p;) = "400 +500 ~ 400 = 0-018 


Test Statistic. Under the null hypothesis Hp, the test statistic is : 


A 


-—#F1 sd neg 1 Since samples are large 
S.E. of (p - p1) ( 9 ) ( Pp i ) 


0-67 - 0-33 0-15 
Z=""0018 -00187 °° 
_, Conclusion. Since the calculated value of Z is much greater than 3, it is 
highly signifcant. Hence null hypothesis Ho is rejected and we conclude that 
there is significant difference between p, and D. 


Example 12-14. If for one-half of n events, the chance of success is p 
and the chance of failure is q, while for the other half the chance of success is q 
and the chance of failure is p, show that the standard deviation of the number of 
successes is the same as if the chance of successes were p in all the cases, i.e., 
‘ npq but.that the mean of the number of successes is n[2 and not np. 

Solution. Let X, and X2 denote the number of successes in the first half 
and the second half of n events respectively. Then according to the given 
conditions, we have 


E(X\)=5p | EX) =54 
d 


n sa n 
V(X1) = 5 PF V(X2) = 5 Pa 
The mean and variance of the number of ee ‘ all the n events are 
given by E(X, + X2) = E(X)) + E(X2) “2 + - 5 


and V(X) +X2)=VKy) + (Kr) =F pq +5 qp= npg. 
since the first and second half of events are aes 

Hence the variance is the same as if the probability of success in all the n 
events is p. 

EXERCISE 12(a) 

1, (a) There are 2 populations and P, and P, are thé proportion of members 
in the two populations belonging to ‘low-income’ group. It is disired to test the 
hypothesis Ho : P, = Pz. Explain clearly, the procedure that you would follow to 
carry out the above test at 5% level of significance. 

State the theorem on which the above test is based. 

In.respect of the above 2 populations, if it is claimed that P,, the proportion 
of ‘low-incomé’ group in the first population.is greater than.P2, how will you 
modify the procedure to test this claim (at 5% level) ? 


(b) Take a concrete illustration and in relation to this illustration, explain 
the following terms :— 
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(i) Null hypothesis and alternative hypothesis. 

(ii) Type I and Type I errors. 

(iii) Critical Region. 

(c) Suggest a possible source of bias in the following : 

(i) The mean income per family in a certain town is sought to be estimated 
by sampling from motor owners. 

(ti) Readers of newspapers are sampled by printing in it an invitation to 
them to send up their observations on some typical event. 

(iii) A barrel of apples is sampled by taking a handful from the top. 

(iv) A set of digits is taken by opening a telephone directory at random and 
choosing the telephone numbers in the order in which they appear on the page. 

2. (a) Explain clearly the terms “Standard Error” and “Sampling 
Distribution.” Show that in a series of nm independent, trials with constant 
probability p of success, the standard error of the proportion of successes is 


V pqin, where q=1-p. 

(b) n individuals fall into one or the other.two categories with probabilities 
p and g (=1 ~ p), the number in the two’categories are x, and x, (x; + x2 = 7). 
Show that covariance between x, and x2 is — npg. Hence obtain the variance of 


the difference (2 - 2), between the proportions. 


(c) Explain clearly the procedure generally followed in testing of a 
hypothesis, Point out the difference between one-tail and two-tail tests. 

(2) What do you mean by interval estimation and how would you set up the 
confidence limits for a parameter from a sample ? Give the formula for 95% 
confidence limits for mean and proportion. What modifcations do you have to 
make if the sampling is done from finite population, (i) without replacement, 
(ii) with replacement ? {Calcutta Univ. B.A. (Maths Hons.), 1988] 


3. P; and P2 are the (unknown) proportions of students wearing glasses in 
two universities A and B. To compare P, and P2, samples of sizes nm, and nz are 
taken from the two populations and the number of students wearing glasses is 
found to be x, and x, respectively. Suggest an unbiased estimate of (P, — Pz) and 
obtain its sampling distribution when n, and nz are large. Hence explain how to 
test the hypothesis that P; = Po. 

4. (a) A coin is tossed 10,000 times and it turns up head 5,195 times. 
Discuss whether the coin may be regarded as unbiased one, explaining briefly the 
theoretical principles you would use for this purpose. (Ans. No.) 

(6) A biased coin was thrown 400 times:and head resulted 240 times. Find 
the standard error of the: observed proportion of heads and deduce that the 
probability of, getting a head in a single throw of the coin lies almost certainly 
between 0-53 and 0-67. (Ans. 0-02445). 

(c) Experience has shown that 20% of a manufactured product is of the top 
quality. In one day’s production of 400 articles only 50 are of top quality. Show 
that either the production of the day taken was not a representative sample or the 
hypothesis of 20% was wrong. (Ans. Z = 3- 75) 
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5. (a2) In ailarge consignment of oranges a random sample of 64 oranges 
revealed that 14 oranges were bad. Is it reasonable to assume that 20% of the 
Oranges were bad ? 

(b) By a mobile court checking in certain buses it was found that out of 
1000 people checked on a certain day at Red Fort, 10 persons were found to be 
tickétless travellers. If daily 1 lakh passengers travel by the buses, find out the 
estimated limits to the ticketless travellers. (Ans. 997 to 1003) 

(c) In a random sample of 81 items taken from a large consignment some 
were found to be defective. If the standard error of the proportion of defective 
items in the sample is 1/18, find 95% confidence limits of the percentage of 
defective items in the consignment. 

[Madras Univ. B.Sc. (Stat. Main), 1991] 

6. (2) In some dice throwing experiments Weldon threw dice 75,145 times 


and of these 49,152 yielded a 4, 5 or 6. Is this consistent with the hypothesis 
that the dice was unbiased ? 


Hint. Ho : Dice is unbiased, i.e., P = 2=4 = 0-5; Hy: P zi 
_p=P____0-654-0-5____ 0-154 


NPQin N05 x 0-5775145 90018 
Ans. No. 


(b) 1,000 apples are taken from a large consignment and 100 are found to be 
bad. Estimate the percentage of bad apples in the consignment and assign the 
limits within which the percentage lies. 

7. (a) A personnel manager claims that 80 per cent of: all single women 
hired for secretarial job get married and quit work within two years after they are 
hired. Test this hypothesis at 5% level of significance if among 200 such 
elias 112 got married within two years after they were hited and quit their 
jobs. 

(6) A manufacturer claimed that at least 98% of the steel pipes which he 
supplied to a factory conformed to specifications. An exarhination of a sample of 
500 pieces of pipes revealed that 30 were .defective. Test this claim at a 
significance level of (i) 0-05, (ii) 0-01. 

Hint. X =No. of pipes conforming to specifications in the sample. 

= 500 - 30 = 470 
p =Sample proportion of pipes conforming to specifications 
= 2 = 0.94 

Hy: P = 0-98, i.e., the proportion of pipes conforming to specifications in 
the lot is 98%... 

H, : P < 0-98 (Left-tail alternative) 

Test Statistic. oe Jet eee a 


reo 410-98 x 0-62/500 0-98 x 0-02/500 


(c) A social worker believes that fewer than 25% of the couples in a certain 
area ever uSed any form of birth control. A random sample of 120 couples was 


Test Statistic. Under Ho, Z 
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contacted. Twenty of them said they had used some method of birth control. 
Comment on the social worker’s belief. 


Hy: P = 0-25, H, : P < 0-25 (left-Tailed) 


8. In a random sample of 800 adults from the population of a certain large 
city, 600 are found to have dark hair. In a random sample of 1,000 adults from 
the habitants of another large city, 700 are dark haired. Show that the difference 
of the proportion of dark haired people is nearly 2-4 times.the standard error of 
the difference for samples of above sizes, 

9. (a) In a random sample of 100 men taken from village A, 60 were found 
to be consuming alcohol. In another sample of 200 men taken form village B, 
100 were found to be consuming alcohol. Do the two villages differ 
significantly in respect of the proportion of men who consume alcohol ? 

[Delhi Univ. M.A. (Business Eco.), 1987] 

(b) In a random sample of 500 men from a particular district of U.P., 300 
are found to be smokers. In one of 1,000 men from another district, 550 are 
smokers. Do the data indicate that the two districts are significantly different 
with respect to the prevalence of smoking among men ? 

Ans. Z = 1-85, (not significant). (Delhi Univ. B.Sc., 1991) 


10. A company is considering two different television advertisements for 
promotion of a new product. Management believed that the advertisement A is 
more effective than advertisement B. Two test market areas with virtually 
identical consumer characteristics are selected; A is used in one area and B in 
other area. In a random sample of 60 customers who saw A, 18 tried the 
product. In another random sample of 100 customers who saw 8, 22 tried the 
product. Does this indicate that advertisement A is more effective than 
advertisement B, if a 5% level of significance is used ? Given critical value at 
5% level is 1-96 and at 10% level of significance is 1-645. 

[Dethi Univ. M.C.A., 1990] 


11. (a) 1,000 apples kept under one type of storage were found to show 
rotting to the extent of 4%. 1,500. apples kept under another kin1 of storage 
showed 3% rotting. Can it be reasonably concluded that the second type of 
storage is superior to the first ? 


(b) In a referendum submitted to the students body at a university, 850 men 
and 566 women voted. 530 of the men and 304 of the women voted yes. Does 
this indicate a\significant difference of opinion on the matter at 1% level, 
between men and women students. [Ans, Z = 3-2, (significant).] 


(c) In a simple ‘sample of 600 high school students from a State, 400 are 
found to use dot pens. In one of 900 from a neighbouring State, 450 are found 
to use dot pens. Do the data indicate that the States-are significantly different 
with respect to the habit of using dot pens among the students ? (Ans. Yes.) 

12. (a) A firm, manufacturing dresses for-children, sént out advertisement 
through mail. Two groups of 1,000 each were contacted; the first group having 
been contacted in white covers while the second in blue covers. 20% from the 
first while 28% from the second replied. 


Do you think that blue envelopes help the sales ? 
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(b) A machine puts out 16 imperfect articles in a sample of 500. After 
machine is overhauled, it puts out 3 imperfect articles in a batch of 100. Has the 
machine improved ? 

Hint. We are given : n, = 500, and n2 = 100 

Pi -70 = 0-032; p2 =700 

Null Hypothesis, Ho: P; = P, i.e., there is no-significant difference in the 
machine before overhauling and after overhauling, In other words, the machine 
has not improved after overhauling. 

Alternative Hypothesis, H,:P,<P, or P,>P»2 

P — Pit nepe  _16+3 19 _ 9935 
" Ay tng 500 +100 600 


SE. (p1 — ps) =o 032 x 0-968 (<i + zip) = 00193 


0-032 - 0-030 _ 0-002 _ 
Z ="~"0.0193. 0.01937 1 


Since Z < 1-645 (Right-tailed test), it is not significant at 5% level of 
significance. 


= 0-030 


(c) In a large city A, 25% of a random sample of 900 school boys had: 


defective eye-sight. In another large city B, 15-5% of a random sample of 1,600 
school boys had the same defect. Is.this difference between the two proportions 
significant ? (Ans. Not significant.) 


13. (a) A candidate for election made a speech in city A but not in B.A 
sample of 500 voters from city A showed that 59-6% of the voters were in 
favour of him, whereas a sample of 300: voters from city B showed that 50% of 
the voters favoured him. Discuss whether his speech could produce any effect on 
voters in city A. Use 5% level. 

Ans. | Z| = 2-67. Yes. 


(b) In a large city, 16 out of a random sample of 500 men were found to be 
drinkers. After the heavy increase in tax on intoxicants another. random sample 
of 100 men in the same city included 3 drinkers. Was the observed decrease in 
the proportion of drinkers significant after tax increase ? 

Ans. Ho: P,; = Pz, H,: P,>P2;Z = 1-04. Not sigificant. 

14, The sex ratio at birth is sometimes given by the ratio of male to 
female births instead of the proportion of male to total births. If z is the ratio, 


i.e., 2 = piq, show that the standard error of z is approximately " , zi (2) 


n being large, so that ceviations are small compared with mean. 

12:10. Sampling of Variables. In the present section we will 
discuss in detail the sampling of variables such as height, weight, age, income, 
etc. In the.case of sampling of variables‘cach member of the population provides 
the value of the variable and the aggregate of.these values forms the frequency 
distribution of the population. From the population, a random sample of size n 


e 
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can be drawn by any of the sampling methods discussed before which is same as 
choosing n values of the given variable from the distribution. 


12-11. Unbiased Estimate for population Mean (1) and 
Variance (G2). Let x), X2, .... X, be a random sample of size n from a large 
population X,, Xz, ... Xy (of size N) with mean 1 and variance o?. Then the 


sample mean ( x ) and variance (s) are given by 
z= x x; and gat x (x;-x )? 
ah : 


Nijgez 


Now ez)=e(; = oe E(x) 
Since x; is a sample observation from the population X;, (i = 1, 2, ..., N) it 
can take any one of the — X},X2, ..., Xy each with equal probability 1/N. 


Ej) = iXi+ 5X, + .. + oxy 


=a Ki +Xo tu. XW EM wl) 
E(x) = (a) = ny => E(x)=u ...(12-6) 


Thus the sample mean (x ) is an unbiased estimate of the population mean 


(j1). 
Now E(s*) zeft 5 (x; -x]=E (2 : xj? -x2] 


=) z E(x,;7) —E(x)* ..0(2) 


We have V(x) = Elx;-E@)]* = EG; - 1)’, [From (1)] 


1 
=a (1-1)? + (X2-H)? + ... + Xv—-M)?] = 0? ...G) 
Also we know that 


V(x) =EG2) - [EQ = E(x) = V(x) + (E(x)? (4) 
In part.cular 
Etx?) = V(x) + {E(x)}? = 07 + 2 .»(5) 


Also from (4), E(x 2) = V(x) + (E(x)}? 
o? 
But V(x) ==, where 07 is the population variance. <[cf. § 12-13] 


E(x#) = e + 2 [Using (12-6)] ...(Sa) 
Substituting from (5) and (5a) in (2) we get 
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o? ..(12-7) 


Since E(s?) #02, sample variance is not an unbiased estimate of 
population variance. 


From (12-7), we get 
f .E(s2)=02 > E ( ns’ ) =o? 


n-1 n-1 


a 


isl 


> e| 1 Y «@, -2)|=0 i.e., E(S?)= 0? -.-(12-8) 


where $2 = 


; 1; z (x; -x )? ...(12-8a) 


. S? is an unbiased estimate of the population variance o?. 
Aliter for E(s*). 


eat 5 (x;- x 2 ]=4 2 { (x;-H) -@-w}| 

=} 2, (x; - HW)? + n(x -p)? -2@ - p) %, a0) | 
But % (x; - pW) = DX; — np = nx — np = n(x - pp) 
Re “11s (ai wn wet & (x; ~ HW)? - (x ~B)? 
iz 1 Ninl 
6%) =) 5 Bx, -w) - BG - py 
- ot ES Bly; BG))?~ EF -E(3)) 
I. ie weeee 1 

te V(x) = VE) = ( 1- ry )o 


Remarks 1. Here we see that although sample mean is an unbiased 
estimate Of population mean, sample variance is not an’ unbiased estimate of 
population variance. However, an unbiased estimate of of o? is given by S?, 
given in equation (12-82). 
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S? plays a very important role in sampling theory, particularly in small 
sampling theory. Whenever o? is not known, its estimate S? given by (12-82) is 
used for practical purposes. 


We haveste= S Gas) and 2 — > G@azy 
Nijwl n-1;5, 
= ns? = (n — 1)S? » gr= (1 - ) S? 


Hence for large samples i.e., forn — ©, we have s? — §?. In other 
words, for large samples (!.e., n —> o), we may take 


A 
o* = 57 ...(12-8b) 
12-12. Standard Error of Sample Mean. The variance of the 
sample mean is o?/n, where ois the population standard deviation and n is the 
size of the random sample. 


The S.E. of mean of a random sample of size n from a population with 
variance 0? is oIV n. 
Proof, Let x;, (i = 1, 2, ..., 2) be a random sample of size n from a 


population with variance o?, then the sample mean x is given by 


x ad ny txt ty 
we 1 1 
V(x) =v Pac +Xo+ 2... + x) |= Avie +224 we x) 


= [Ven + V(ix2) + ... + vee) | 


the covariance terms vanish since the sample observations are independent, [c.f 
Remark (ii) § 6-6] 
But V(x) = 02, (i= 1, 2, ..., 2) [From (3) of § 12-11] 


(2) =4(n09 == 


= S.E.( x) -VE-% (12:9) 
n 


12-13. Test of Significance for Single Mean. We have proved 
that if x;,(@@ = 1, 2, ..., 2) is a random sample of size n from a normal 
population with mean j1 and variance 67, then the sample mean is distributed 
normally with mean yt and variance o2/n, i.e., x ~ N(1, o2/n). However, this 
result holds, i.e., x ~ N(t, 07/n), even in random sampling from non-normal 


population provided the sample size n is large [c.f. Central Limit Theorem, 
§ 8-10). 


Thus for large samples, the standard normal variate corresponding to x is : 
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gata 
oN 


Under the null hypothesis, Hp that the sample has been drawn from a 
population with mean wt and variance 2, i.¢., there is no significant difference 
between the sample mean (x ) and population mean ({1), the test statistic (for 
large samples), ts : 


_x —U 
.. (12-9a) 
~ oNn 


Remarks 1. If the population s.d. 6 1s unknown then we usc its estimate 
provided by the sample variance given by [Sec (12-84)]: 
A A 
o*=s? = O=s (for large samples). 
2. Confidence limits for 1. 95% confidence interval for Ut is given by : 
x —-Up 


IZ1< 1-96, ie., < 1-96 


an ¥-1-960/Vn <p < ¥41-960/Vn 12-10) 


and x + 1 96G/V n are known as 95% confidence limits for tt. Similarly, 99% 
confidence limits for Ware x + 2:580/Vn and 98% confidence limits for Ll are 
¥ +2330/Vn. 


However, in sampling from a finite population of size N, the 
corresponding 95% and 99% confidence limits for LL are respectively 


F196 Se es p and F258 V4 ...(12-10a) 


3. The confidence limits for any parameter (P, H, cic.) are also known as 
its fiducial limits. 


Example 12°15. A sample of 900 members has a mean 3-4 cms,. and s.d. 
2-61 cms. Is the sample from a large population of mean 3-25 cms. and s.d. 2-61 
cms, ? 


[f the population ts normal and its mean is unknown, find the 95% and 98% 
fiducial limits of true mean. 


Solution. Null ltypothesis, (Hy): The sample has been drawn from the 
population with mean p = 3-25 cms., and §.D. 6 = 2-61 cms. 


Alternative Hypothesis, H, : is # 3-25 (Two-tailed). 
Test Statistic. Under Hp, the test statistic ts : 


Be eee ae N(O, 1), (since n is large) 
oN n 
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Here, we are given 


x = 3-4 cms., n = 900 cms., ut = 3-25 cms. and o = 2-61 cms. 
_ 3-40 - 3.25 0-15 x 30 


Since | Z 1 < 1-96, we conclude that the data don’t provide us any evidence 
against the null hypothesis (Ho) which may, therefore, be accepted at 5% level of 
significance. 

99% fiducial limits for the population mean are : 


X£1.96 0/Vn => 3-40 + 1-96 x 2-61/¥900 
=> 3-40 + 0-1705, i.e., 3-5705 and 3-2295 
98% fiducial limits for o are given by : 


¥ +233 -% = ie, 340 + 2:33 x =Ot 

=> 3-40 + 0-2027 i.e., 3-6027 and 3-1973 

Remark. 2-33 is the value z,; of Z from standard normal probability 
integrals, such that P(IZt>z)=0-98 = P(Z>2z;)=0-49. 

Example 12°16. An insurance agent has claimed that the average age of 
policyholders who insure through him is less than the average for all agents, 
which is 30-5 years. 

A random sample of 100 policyholders -who had insured through him gave 
the following age distribution : 


Age last birthday No. of persons 
16—20 12 
21—25 22 
26—30 20 
31—35 30 
36—40 16 


Calculate the arithmetic mean and standard deviation of this distribution and 
use these values to test his claim at the 5% level of significance. You are given 
that Z (1-645) = 0-95. 

Solution. Null Hypothesis, Ho : 1 = 30-5 years, i.e., the sample mean 
( x ) and population mean (j1) do not differ significantly. 

Alternative Hypothesis, H, :1< 30-5 years (Left-tailed alternative). 


CALCULATIONS FOR aes AND S.D. 


Age last 
birthday 
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5 x 16 164 16 \2_ 
21S 28-8 years s=5x\] Pe oD) = 6.35 years 


Since the sample is large, G ~ 5 = 6-35 years. 
Test Statistic. Under Ho, the test statistic is 


x = 28 + 


Z= = ~ N(O, 1), (since sample is large). 
N 


S“*in 
_ 28-8 - 30-5 _ =1-7. 
Now = -2:-681 
635/100 _ 0°635 


Conclusion. Since computed value of Z = -—2-681 < ~1-645_ or 
|Z | = 2-681 > 1-645, it is significant at 5% level of significance. Hence we 
reject the null hypothesis Hy (Accept 47;) at 5% level of significance and 
conclude that the insurance agent’s claim that the average age of policyholders 
who insure through him is less than the average for all agents, is valid. 

Example 12-17. As an application of Central Limit Theorem, show that 


if E is such that P (\X - Wl<E)> 0-95, then the minimum sample size n is 


.96)2 G2 
given by n= ae ae where ttand o* are the mean and variance respectively 
of the population and X is the mean of the random sample. __ 
Solution. By Central Limit Theorem, we know that X ~ N(u, o2/n) 
asymptotically i.e., for large n. 
X- 
z =-—+t 
o/\n 
From normal probability tables, we have 
P(1Z1\¢ 1-96) = 0-95 


~ N(O, 1), asymptotically i.e., for large n. 


= | A=} < 1.96|=095 
on 
— ve] 
=> Pp 1X - wt s 1-96 -F-|=095 ..(*) 
Ya 
We are given that 
P [IX -pl<E] > 0-95 us) 
From (*) and (**), we have 
066 .96)2. G2 3.84¢2 
E> L965 m n > 2:28} o - 28s 


n 
Hence minimum sample size 7 for estimating u with 95% confidence 
coefficient is given by n = 3-84 o7/E?, where E is the permissible error. 
» Remark. The minimum sample size for estimating 1. with confidence 
coefficient (1 — a) is given by 02z,2/E”, where z, is the significant value of Z 
at level of significance o and E is the permissible error in the estimate. 
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Arguing similarly, the minimum sample size for estimating population 
proportion P with confidence coefficient (1 - a) is given by n = PQ 2,2/E?, 
where 2, is the significant value of Z at ‘a’ level of significance and E is the 

A 


permissible error in the estimate. If P is unknown, we may use P = p. 
Example 12-18. The mean muscular endurance score of a random sample 
of 60 subjects was found to be 145 with a s.d. of 40. Construct a 95% 
confidence interval for the true mean. Assume the sample size to be large 
enough for normal approximation. What size of sample is required to estimate 
the mean within 5 of the true mean with a 95% confidence ? 
[Calicut Univ. B.Sc. (Main Stat.) 1989] 


Solution. We are given : n = 60, x = 145 and s = 40. 
95% confidence limits for true mean (jt) are : 


X + 1-96 siVin (o? = s2, since sample is large) 
1-96 x 40 78-4 ; 
‘ae pa A aE Ben tes. See + 10.19 = f - 


Hence 95% confidence interval for 1 is (134-88, 155-12). In the notations 
of Example 12-17, we have 


/ - (22 2)". (38 x 40)? 
eee lr ae ae 5 
[-» 295 = 1:96, 6=5=40 andIX—-p1<5=E] 


= (15-68)? = 245-86 = 246. 


. Example 12-19. The standard deviation of a population is 2-70 inches. 
Find the probability that in a random sample of size 66 (i) the sample mean will 
differ from the population mean by 0-75 inch or more and (ii) the sample mean 
will exceed the population mean-by 0-75 inch or more (given that the value of 
the standard normal probability integral from 0 to 2-25 is 0-4877). 


Solution. Here we are given n =66, o = 2-70 inches. Since n is large, 
the sample mean x ~ N(1, o2/n). 
Z =2=—H~ NO, 1) .(*) 
oN n 
We want 


(i) P{ Ix —p| 20-75] =1-P[ 1x —p | < 0-75] 
=1-P| | 2 < 0-75] {From (*)] 


-1-P[1z1<0.75 1 | 


-1-2?| 0<z<0.75 | 
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166 | 
=1-2P| 0<z< 0-75 x 5 
0-75 <B-134 | 
2:70 
= 1-2 P[0<Z <2.25] = 1-2 x 0-4877 = 0-0246 
(ii) P(X -p > 0-75) = = P(Z > 0:75 Vn/o) = P(Z > 2:25) 
= 0-5 ~P(0 < Z < 2-25) = 0-5 — 0-4877 = 0-0123 


Example 12-20. A normal population has a mean of 0-1 and standard 
deviation of 2-1. Find the probability that mean of a sampie of size 900 will be 
negative. [Delhi Univ. B.Sc. (Stat. Hons.), 1986] 


Solution. Here we are given that X ~ N(t, 0%), where pp = 0-1 and 
6 = 2-1 and n = 900. 
Since X ~ N(1, 6”), the sample mean x ~ N(1, 62/n). The standard normal 
variate corresponding to x is given by: 
7 at att _x-0-1 x-0-1 
pr 2:1/30 ~ 0-07 
x =0-1+0-07Z, where Z ~ N(O, 1) 
The required probability p, that the sample mean is negative is given by : 
p =P(x <0)=P(0-1 + 0-07 Z:< 0) 
=P ( oe oe = P (Z <-1-43) = P(Z > 1-43) 
= 0-5 -PO0<Z < 143) = 0-5 —0-4236 = 0:0764 
(From Normal Probability Tables). 
Example 12-21. The guaranteed average life of a certain type of electric 
light bulbs is 1000 hours with a standard deviation of 125 hours. It is decided to 
sample the output so as to ensure that 90 per cent of the bulbs do not fall short 


of the guaranteed average by more than 2-5 per cent. What must be the 
minimum size of the sample ? (Madras Univ. B.Sce., Oct. 1991] 


Solution. Here 2 = 1000 hours, o = 125 hours. 
Since we do not want the sample mean to be less than the guaranteed 
average mean (jt = 1000) by more than 2-5%, we should have 
x > 1000 — 2-5% of 1000 => x> 1000 - 25 =975 
Let n be the given sample size. Then 


~1-2P|0<z< 


Z =*—E~ NO, 1), since sample is large. 
ONn 


ewan 7 ata, 925-1000, a 


oNn 125Nn 5 


According to the given condition, we have 


(-.- x > 975) 
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P(Z > —Nn/5 ) = 0-90 => PO<Z<Vn/5 5) = 0-40 


V n/S = 1-28 (From Normal Probability cae 
=> n =25 x (1-28)* = 41 (approx) 


Example 12-22. A survey is proposed to be conducted to know the 
annual earnings of the old Statistics graduates of Delhi University. How large 
should the sample be taken in order to estimate the mean annual earnings within 
plus and minus Rs. 1,000 at 95% confidence level ? The standard deviation of 
the annual earnings of the entire population is known to be Rs. 3,000. 

Solution. We are given: o = Rs. 3,000. 

We want: P[1x-p1< 1,000] = 0-95 (4S 

We know that, in sampling from normal population or for large samples 


from any population X ~ N(, o2/n). Hence from normal probability tables, we 
have : 


> ?| 


P [I ZI < 1-96} = 0-95 


X= < 1.96 |=09s 


on 


=> P[Ix-pl<196x (Nn) =0-95 CF) 
From (*) and (**), we get 
196 X 5 _ ony — 1:96X 3000 _ sop 


Vn Vn 
n= (1-96 x 3)? = (5-88)? = 34:56 = 35 
Aliter. Using Remark to Example 12-17, 


na (zec2). 1-96 x 3,000 )? w~ 35 
“\ E ~ 1,000 ° 


12-14, Test of Significance for Difference of Means. Let x, be 
the mean of a random sample of size n, from a population with mean 1, and 
variance 6, and let x2 be the mean of an independent random sample of size nz 
from another population with mean 1. and variance 62”. Then, since sample 
sizes are large, 

x x, ~ Nu, O71 2/n,) and Xo ~NUy, ey) 2/n>) 

Also X — X2, being the difference of two independent normal variates is also 

a normal variate. The Z (S.N.V.) corresponding to x, — x2 is given by 
; S.E. ( Xi- X2) 

Under the null hypothesis Hp : t1; = p12, i.e., there is no significant 

difference between the sample means, we get 


E( X, —X,) = E(X,) - E(%2) = by - U2 = 0; 
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ee as _. 0,2 6,2 
V (F, -%) = V(x) + VX = +, 
ny 47) 
the covariance term vanishes, since the sample means x, and x, are independent. 
Thus under Ho : [4; = [2, the test statistic becomes (for large samples), 
Z = a 2 0, 1) (1211) 
V (G,2/ny) + (822/n2) 
Remarks 1. If 6,7 = 62? = 0?, i.e., if the samples have been drawn from 
the populations with common S.D. o, then under Ho : [4; = 2, 


Fai Xi %2 
©) Y (1/n, + 1/n) 
2. If in (12-11a), o is not known, then its estimate based on the sample 
variances is used. If the sample sizes are not sufficiently large, then an unbiased 
estimate of 0? is given by 
fy _ (my = 18)? + (ng = 152? 
(n, + nz - 2) 


ea ln - DEG, + m- NES2)] 


~ N(O, 1) ..-L12-11(a)} 


9 
° A. 
since Eo) = 
n, + 


e l 2 2)=02 
ae 5 lim-1e) +(m-1) 02] =6 


But since sample sizes are large, S,? ~ 5,7, S27? = 527, nn; -lo=m, 
n2-1-~ ny. Therefore in practice, for large samples, the following estimate of 
o* without any serious error is used : 

Ay N1S12 + No Sq” 
Ny, +N 

However, if sample sizes are small, then a small sample test, t-test for 
difference of means (c.f. Chapter 14) is to be used. 

3. If 0,7 # 62? and 6, and 02 are not known, then they are estimated from 


sample values. This results in some error, which is practically immaterial, if 
samples are large. These estimates for large samples are given by 


.-.[12-11(5)} 


A 
0: = S,? ~ $7 ‘ . 
since samples are large). 
| | Ae a So? x52 ( Pp gc) 
In this case, (12-11) gives 
*1= *2 ~ N(0, 1) ...[12-11(0)} 


SSS 
VN (5,241) +( 522/n2) 


Example 12-23. The means of two single large samples of 1000 and 
2000 members are 67:5 inches and 68-0 inches respectively. Can the samples be 
regarded as drawn from the same population of standard deviation 2-5 inches ? 
(Test at 5% level of significance). 
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Solution. We are given : 


n, = 1000, n2 = 2000; x, = 67-5 inches, x2 = 68-0 inches. 
Null hypothesis, Ho : 4, = [12 and o = 2-5 inches, i.c., the samples have 
been drawn from the same population of standard deviation 2-5 inches. 
Alternative Hypothesis, 1H, : 1, # 12 (Two tailed.) 
Test Statistic. Under Ho, the test statistic is (since samples are large) 


ic ~N (0, 1) 
afl *) 
re} + 
ny no 
a 67-5 — 68-0 ee ee 


—T.-). «2-5 x 0-0387 
2:5 X / RAR AAR 
1000 2000 

Conclusion. Since | Z | > 3, the value is highly significant and: we reject 
the null hypothesis and conclude that samples are certainty not from the same 
population with standard deviation 2-5. 

Example 12-24. In a survey of buying habits, 400 women shoppers are 
chosen at random in super market ‘A’ located in a certain section of the city. 
Their average weekly food expenditure is Rs. 250 with a standard deviation of 
Rs. 40. For 400 women shoppers chosen at random in super market ‘B’ in 
arother section of the city, the average weekly food expenditure is Rs. 220 with 
a standard deviation of Rs. 55. Test at 1% level of significance whether the 
average weekly food expenditure of the two populations of shoppers are equal. 


Solution. In the usual notations, we are given that 
= 400, : = Rs. 250, 5; = Rs. 40 
n2 = 400, X2 = Rs. 220 Sq =Rs. 55 
Null hypothesis, Ho : Ly = bb, 2.e., ae average weekly food expenditures of 
the two populations of shoppers are equal. 
Alternative Hypothesis, H, : 4, # 12. (Two-tailed) 
Test Statistic. Since samples:are large, under Ho, the test statistic is 


Z=—__-—*—__ ~ N00, 1) 


Since 0; and 62, the population sfandard deviations are not-known, we can 
take for large samples (c.f. § 12-15, Remark Nae 
O)2-= 5,7 and G = 5,7 
and then Z is given, by ‘ 


- = - 
ak —_— 250-220 = §8-82 (approx.) 


= 
an 


(E42) Af Sao + Soo } 


ny N2 
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Conclusion. Since |Z \ is much greater than 2-58, the null hypothesis 
(j1, = Ly) is rejected at 1% level of significance and we conclude that the average 
weekly expenditures of two populations of shoppers in markets A and B differ 
significantly. 

Example 12-25. The average hourly wage of a sample of 150 workers 
in a plant ‘A’ was Rs. 2-56 with a standard deviation of Rs. 1-08. The average 
wage of a sample of 200 workers in plant ‘B’ was Rs. 2-87 with a standard 
deviation of Rs. 1-28. Can an-applicant safely assume that the hourly wages — 
paid by plant ‘B’ are higher than those paid by plant ‘A’ ? 

~ Solution. Let X, and X, denote the hourly wages (in Rs.) of workers in 
plant A and plant B respectively. Then we are given : 


ny, = 150, %, = 2-56, 51-108 = 0; 


n> = 200, x2 = 2-87, s.=1-28= S> 
Null hypothesis, Ho: ; = U2,i.e., there is no significant difference 
between the mean level of wages of workers in plant A and plant B. 
Alternative hypothesis, H, 12> Wy i.e., by < U2 (Left-tailed test) 
Test Statistic. Under Hp, the test statistic (for large samples) is : 


Joie so A N(O, 1) 
0,7 ‘= ‘ sy sz? 
3) & m) 
a 2-87 ~ 0-31 — 0-31 
Z= = = = — = = 2-46 
Vo016 0126 
{0.08% , 28} uve 
150 200 


Critical region. For a one-tailed test, the critical value of Z at 5% level of 
significance is 1-645. The critical region for left-tailed test thus consists of all 
values of Z < -1-645. 

Conclusion. Since calculated value of Z (—2-46) is less than critical value 
(—1-645), it is significant at 5% level of significance. Hence the null hypothesis 
is rejected at 5% level of significance and we conclude that the average hourly 
wages paid by plant ‘B’ are certainly higher than those paid by plant ‘A’. 

Example 12:26. In a certain factory there dre two independent processes 
manufacturing the same item. The average weight in a sample of 250 items 
produced from one process is found to be 120 ozs. with a standard deviation of 
12 ozs. while the corresponding figures in a sample of 400 items from the other 
process are 124 and 14. Obtain the standard error of difference between the twa 
sample means. Is this difference significant ? Also find the 99% confidence 
limits for the difference in the average weights of items produced by the two 
processes respectively. 

Solution. We have 


n, = 250, x, = 120 o2., 5, = 12 oz. =6 
: (since samples are large). 


nz = 400, Xx2= = 124 0z., s, = 14 0z. = S, 
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S.E. (X1 -%) = V(G2/m) + Gein) = V(5,2/m) + (S2/ny) 


oie a) = V(0-576 + 0-490) = 1-034 


Null Hypothesis, Ho: 41 =(42,i.e.,. the sample means do not differ 
significantly. 
Alternative Hypothesis, H, : 4, # U2 (Two-tailed). 
Test Statistic. Under Ho, the test statistic is : 
__ Xx - X2 _ 120 - 124 
S. . (X, — X) 1-034 
IZi = =a = 3-87 
Conclusion. Since | Z| > 3, the null hypothesis is rejected and we conclude 
that there is significant difference between the sample means. 
99% confidence limits for | 11, — 1 |, i.e., for the difference in the average 
weights of items produced by two processes, are 


lx, = = X_ 1+ 2-58 S.E. (xX; +X) = 4+ 2-58 x 1-034 
= 4 + 2-67 (approx.) = 6-67 and 1-33 
1-33 <1, — pH, | < 6-67 

Exam ple 12-27. The mean height of 50 male students who showed 
above average participation in college athletics was 68-2 inches with a standard 
deviation of 2-5 inches; while 50 male students who showed no interest in such 
participation had a mean height of 67-5 inches with a standard deviation of 2-8 
inches. 

(i) Test the hypothesis that male students who participate in college 
athletics are taller than other male students. 

(ti) By how much should the sample size of each of the two groups be 
increased in order that the observed differénce of 0-7 inches in the mean heights 
be significant at the 5% level of significance. 

Solution. Let X, and X2 denote the height (in inches)‘of athletic 
participants and non-athletic participants respectively. In the usual notations, we 
are given : 

n, =50, x; =68-2, sy=2-53;n2=50, x2=67-°5, s2=2-8 

Null hypothesis, Ho: [ly = fl. 

Alternative hypothesis, H, : 1, >(12 (Right-tailed). 

Test Statistic. Under Ho, the test statistic for large samples is : 


~ N (QO, 1) 


Z ee.) Gat.) ae ~ N (0, 1) 
2 2 
Vn) 
my mj 
Z = —082-8f5._ = 0-7 = 0-7 = 1:32 


fase aay ‘oon 1 
530° ~ =50 
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For a right-tailed test, the critical (significant) value of Z at 5% level of 
significance is 1-645. 

(i) Since the calculated value of Z(1-32) is less than the critical value 
(1-645), it is not significant at 5% level of significance. Hence the null 
hypothesis is accepted and we conclude that the college athletes are not taller 
than ofher male students. 

(ii) The difference between the mean heights of two groups, each of size n 
will be significant at 5% level of significance if Z > 1-645 


ea 682-675. 4s 
(2 5) 2-84 
n nr 
_ 0-7 0-7 
= VF 31.645 = —“4— > 1.645 
V14.09/n 3-754) n 
- n> (ESss 3-754)" = (8-8219)? = 77-83, = 78 


Hence the sample size of each of the two groups should be increased by at 
least 78 — 50 = 28, in order that the difference between the mean oe of the 
two groups is significant. 

12-15. Test of Significance for the Difference of “Standard 
Deviations. If s; and sz are the standard: deviations 6f two independent 
samples, then under null hypothesis, Ho: 6, = 62, i.e., i.e., the sample 
standard deviations don‘t-differ significantly, the statistic 

Z= - ~ N(O, 1) for large samples. 


But in case of large samples; the SE of the difference of the sample standard 
deviations is given by ‘ 


6,7 o,2 
SE, (5, — 52) = on * on: 
Z=2=—1 2 _ _~NO,1) (12-12) 
GP 52? 
2n, 2m 


6,* and 6,2 are usually unknown and for large samplés, we use their 
estimates given by the corresponding sample variances. Hence the test statistic 
reduces to 
$1 - 52 


sy 
(2 7 om) 


Example 12:18. Random samples drawn from two countries gave the 
following data relating to the heights of adult males : 


Z= ~ N (0, 1) ...(12-13) 
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Country A Country B 
Mean height (in inches) 67-42 67:25 
Standard deviation (in inches) 2:58 2°50 
Number in samples 1000 1200 


(i) Is the difference between the means significant ? 
(tt) Is the difference between the standard deviations significant ? 
Solution. We are given : 
n,= 1000, x, =67-42 inches, 5, = 2-58 inches, 
no = 1200, X2=67-25 inches, s2 = 2-50 inches. 
As in the last examples (since sample sizes are large), we can take 
G, = 5, = 258, Sp=5.=2-50 
(i) Ho: pb, = 2, i.e., the sample means do not differ significantly. 
H, : by # Wy (Two tailed). 
Under the Null hypothesis Ho, the test statistic is 


X,—X2 . is 
=> —— -~ N(0, 1), since samples are large. 
V (S42/n1) + (S2/n2) 
Now Z = 0182 — 67-25 __ oper) 1 eee 7 
9.58 9. 50)2 666 6-25 
oie 1200 (i000 * 1200 


Conclusion. Since |Z | < 1-96, null hypothesis may be accepted at 5% 
level of significance and we may conclude that there is no significant difference 
between the sample means. 

(ii) Under Hp : that there is no significant difference between sample standard 
deviations, 

5; -S 


“SE. (5;-52) N(0, 1), since samples are large. 


Now Sv. (5; — 52) = 


< _A A 
if O, and oO, are not known and 6) = 5), 62 = 52. 


_ . | f_@ssy (2-502 | _ 
S.E. (5; —52) = 3x 1000t 2x 1200f = 007746 


“~ 


_258-2-50_ 0.08 _ 
Hence 2=""0.07146 0.07746 ~ } 3 


Conclusion. Since |Z | < 1:96, the data.don’t piovide us any evidence 
against the null hypothesis which may be accepted at 5% level of significance. 
Hence the sample standard deviations do not differ significantly. 


Example 12:29. Two populations have their means equal, but S.D. of 
one is twice the other. Show that in the samples of size 2000 from each drawn 
under simple sampling conditions, the difference of means will, in all 
probability, not exceed 0-150, where o is the smaller §.D. What is the 
probability that the difference will exceed half this amount ? 

Solution. Let the standard deviations of the two populations be o and 20 
respectively and let js be the mean of each of the two populations. Also we are 
given n, = nz = 2000. If x, and x, be the two sample means then, since samples 


are large, 
_ (%1 — X2) - E(x, — x2) 
SB. (#1 - 8 
N E(x, —X2) = E(x,)- E(x2)= 44 -y2=0 and 


Ow ; 
S.E. vt {f+ (5565 300) = 0-05 
(x1 ~ %2) = ny 2000 * 00) 
: — Xo 


Foal ~ NO, 1) 
S.E. (x; — X2) 


Under simple sampling conditions, we should in all probability have 
IZi<3 > lx, —~x,1<3 SE. (x ~—X>) 


~N(0O,1) ~ 


= (x, —X21< 0-150, 
which is the required result. 
‘We want .p=P[ 1x, ~¥31>2x 0-156] 


p =P[0-0501Z1>0-0750] Zz Z=- =~ yn (0:1)| 


‘0.059 
=P[IZl>15}=1-P[IZ1<.1-5] 
=1-2P(0<Z< 1-5) =1-2 x 0.4332 = 0-1336 


EXERCISE 12-2 
1..Define sampling distribution and standard error. Obtain standard error of 
mean when population is large. 
2. Find the standard error of a linear function of ainumber of variables. 


Deduce the standard error of the mean of n uncorrelated variables following the 
same distribution. 

3. Derive the expressions for the standard error of 

(i) the mean of a random sample of size n, and 

(ii) the difference of the means of two independent random samples of sizes 
ny, and np. 

4. (a) What is meant by a statistical hypothesis ? What are the two types 
of errors of decision that arise in testing a hypothesis ? Briefly explain how a 
statistical hypothesis is tested. 

The manufacturer of television tubes knows from past experience that the 
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average life of a tube is 2,000 hours with a standard deviation of 200 hours. A 
sample of 100 tubes has an average life of 1950 hours. Test at the 0-05 level of 
significance if this sample came from a normal population of mean 2,000 hours. 
State your null and alternative hypothesis and indicate clearly whether a one- 

tail or a two-tail test is used and why ? Is the result of the test significant ? 
{Calcutta Univ. B.Sc. (Maths. Hons.), 1990] 


(6) A sample of 100 items, drawn from 4 universe with mean value 64 and 
S.D. 3 has a mean value 63-5. Is the difference in the means significant ? What 
will be your inference, if the sample had 200 items ? 

{Madras Univ. B.E., Nov. 1990] 


(c) A sample of 400 individuals is found to have a mean height of 67-47 
inches. Can it be reasonably regarded as a sample from a large population with 
mean height of 67-39 inches and standard deviation 1-30 inches ? 

Ans, Yes, Z = 1-23. 


(dq) The mean breaking strength of cables supplied by a manufacturer is 
1800 with a standard deviation 100. By a new technique in the manufacturing 
process it is claimed that the breaking strength of the cables has increased. In 
order to test this claim a sample of 50 cables is tested. It is found that the mean 
breaking strength is‘ 1850. Can we support the claim at 0-01 level of 
significance ? 

Ans, Ho: 1 = 1800, H, : 2 > 1800, Z = 3-535. 

(e) An ambulance service claims that it takes on the average 8-9 minutes to 
reach its destination in emergency calls. To check on this claim, the agency 
which licenses ambulance services has them timed-on 50 emergency calls, 
getting a mean of 9-3 minutes with a standard deviation of 1-6 minutes. What 
can they conclude at the level of significance a = 0-05 ? 

Ans, Z = 1-768. 


(f) A paper mill in U.P. has agreed to buy waste paper for recycling from a 
waste collection firm, under the agreement that the waste collection firm will 
supply the waste paper in packages of 300 kg each, for which the paper mill 
will pay by the package. To speed up their work the waste collection firm is 
making packages by some approximation procedure. The paper mill does not 
object to this procedure as long as it gets 300 kg. per package on the average. 
The waste collection firm has an interest not to exceed 300 kg. per package, 
because it is ae paid for more, and not to go under 300-kg. because the 
paper mill might terminate the agreement if it does. To estimate the mean 
weight of waste paper per package, the waste collection firm weighed 75 
randomly selected packages and found. that the mean weight was 290 kg and 
standard deviation was 15 kg. Can we infer that the mean weight per package in 
the entire supply was 300 kg ? [Delhi Univ. M.A. (Eco.), 1987) 


Ans, Ho: p= 300 kg; H, -p # 300 kg. (Two-tailed). 


Z = 220 = 300 | 5.77 Significant. 


15N75 
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(g) The wages of a factory’s workers are.assumed to be normally distributed. 
with mean 2 and variance 25. A.random sample of 25 workers gives the total 
wages equal to 1250 units. 

Test the hypothesis : 11 = 52, against the alternative : 11 = 49, at 1% level of 
significance. 


1 [7232 
a } exp (- + u2) du =0-01. 


V2n 
[Calcutta Univ. B.Sc.(Maths. Hons.), 1988] 
Ans. Ho: 1 = 52, H,: = 49 < 52, (Left-tailed test). 
Z = -2, Not significant. 

5. (a) A sample of 450 items is taken from a population whose standard 
deviation is 20. The mean of the sample is 30. Test whether the sample has 
come from a population with mean 29. Also calculate the 95% confidence limits 
for the population mean. 

(6) A sample of 400 observations has mean, ©5 and standard deviation 12. 
Could it be a random sample from a population with mean 98 ? What can be the 
maximum value of the population mean ? 

6. (a) If the mean age at death of 64 men engaged in an occupation is 52-4 
years with standard deviation of 10-2 years, what are the 98% confidence limits 
for the mean age of all men in that population ? 

[Calicut Univ. B.Sc. (Subs.), 1989) 

(b) The weights of 1500’ ball bearings are normally distributed with mean 
22-40 and standard deviation 0-048. If 300 random samples of size 36 each are 
drawn from this population, determine the expected mean and. standard deviation 
of the sampling distribution of means, if sampling is done with replacement. 

How many of the random samples in the above problem would have their 
means between 22-39 and 22-41 ? [Madras Univ. B. - , April 1989] 


Hint. E (X) = = 22-40; SE, (X) =o/Vn = 0.048/V 36 = 0-008 
Required number of samples (out of 300) is :.300 x P (22-39 < X<22. -41) 


22-39 — 22-40 22-41 - 22-40\ 
= 300 x P ~ 0.008 <Z< 9.008 )sz NCO, 1) 


= 300 x P(-1-25 < Z < 1-25) = 600-x P(0 < Z< 1-25) ~ 237 

Vs (a) A random sample of 500 is drawn from a large number of freshly 
minted coins, The mean weight of thé coins in the sample is 28-57 gm. and the 
standard deviation is 1-25 gm. What are the limits which have a 49 to 1 chance 
of including the mean weight of all the coins ? How large a sample would have 
to be drawn to make these limits differ by only 0-1 gm, assuming that the 
standard deviation of tlie whole distribution is 1-25 gm. 

(6) A research worker wishes to estimate the mean of a population by using 
sufficiently large sample. The probability is 0-95 that the sample mean will not 
differ from the true mean of a normal population by more than 25% of the 
’ standard deviation. How large a sample should be taken ? (Ans. n = 62.) 


8. (a) A normal distribution has mean 0-5 and standard deviation 2-5. Find : 
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(i) The probability that the mean of a random sample of size 16 from the 
population is positive. 

(ii) The probability that the mean of a sample of size 90 from the 
population will be negative. 

(b) Thé mean of a certain normal distribution is equal to the standard error of 
the mean of a random sample of 100 from that distribution. Find the 
probability, (in terms of an integral), that the mean of a sample of 25 from the 
distribution will be negative. (Ans. 0-3085.) 


(c) The average value x of a random sample of observations from a certain 


population is normally distributed with mean 20 and standard deviation Sn. 
How large a sample should be drawn in order to have a probability of at least 


0-90 that x will lie between 18 and 22. (Karnataka Univ. B.E. 1991) 


9. (a) From a population of 169 units it is desired to choose a simple 
random sample of size n. If the population standard deviation is 2, determine the 
smallest ‘n’ for which the probability that the sample mean differs from the 
population mean by more than 0-75 is controlled at 0-05. 

(b) An economist would like to estimate the mean income (1) in a large 
city. He has decided to use the sample mean as an estimate of jt and would like 
to ensure that the error in estimation is not more than Rs. 100 with probability 
0-90. How large a sample should he take if the standard deviation is known to be 
Rs. 1,000 ? [Delhi Univ. M.A. (Eco.), 1986] 


2 2 
lego] _ pee eee _ ; 


(c) The management of a manufacturing firm wishes to determine the 
average time required to complete a certain manual operation. There should be 
0-95 confidence that the error in the estimate will not exceed 2 minutes. 

What sample size is required if the standard deviation of the time needed to 
complete the manual operation is estimated by a time and motion study expert 
as (i) 10 minutes, (ii) 16 minutes ? Explain intuitively (without referring ‘to the 
formula) why the sample size is large in (éi) than in (@). 

(Given Zo.975 = 1:96 and Zpo5 = 1-645) 

(Delhi Univ. M.C.A., 1987] 


2 2 2 
‘: -96 10 ° 
Ans. (n= (5) -(sp 2 ~ ) = 96, (ii) m= (pesas a 8) = 246, 


10. (a) Two populations have the same mean, but the standard deviation of 
one is twice that of the other. Show that in samples of 500 each drawn-under 
simple random conditions, the difference of the means will, in all probability, 
not exceed 0-30, where o is the smaller standard deviation, and assuming the 
distribution of the difference of the means to be normal, find the probability that 
it exceeds half that amount. (Ans. 0-1336.) 

(b) A simple sample of heights of 6,400 Englishmen has:a mean of 67-85 
inches and S.D. 2-56 inches, while a simple sample of heights of 1,600 
Australians has a mean of 68-55 inches and a S.D. of 2:52 inches. Do the data 
indicate that Australians are, on the average, taller than Englishmen ? 
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Ans. Ho : [ty = Uo, A; : [ty < fo, Z = 9-2, (significant). 

c) In a random sample of 500, the mean is found to be 20. In another 
independent sample of 400, the mean is 15. Could the samples have been drawn 
from the same population with standard deviation 4 ? 

11. (a) The following table presents data on the values of a harvested crop 
stored in the open and inside a godown : 


Sample size Mean ZL (x- xy? 
Outside 40 117 8,685 
Inside 100 132 27,315 


Assuming that the two samples are random and they have been drawn from 
normal populations with equal variances, examine if the mean value of the 
harvested crop is affected by weather conditions. 

Ans. Z = 0-342; Not significant. 

(b) Samples of students were drawn from two universities and from their 
weights in kgm., means and standard deviations are calculated. Make a large 
sample test to test the significance of the difference between the means. 


Mean S.D. Size of sample 
University A 55 10 400 
University B 57 15 100 


Ans. Z.= 1-2648; Not significant. 

(c) A storekeeper wanted to. buy a large quantity of light bulbs from two 
orands labelled ‘one’ and ‘two’. He bought 100 bulbs from each brand and found 
by testing that brand ‘one’ had mean lifetime of 1120 hours and the standard 
deviation of 75 hours; and brand ‘two’ had mean lifetime of 1062 hours and 
standard deviation of: 82 hours. Examine whether the difference of means is 
significant. 

12. The mean yield of two sets of plots and their variability are as given- 
below. Examine 

(i) whether the difference in the mean yields of the two sets of plots is 
significant, and 

(ii) whether the difference in the variability in yields is significant. 

Set of -40 plots Set of 60 plots 
+ Mean yield per plot 1258 Ib. 1243 Ib. 

S.D. per plot 34 Ib. 28 Ib. 

Ans. (i)Z=2:3, (ii) Z= 1:3. 

13. (a) In a survey of incomes of two classes of workers, two random 
samples gave the following détails. Examine waether the differences bétween the 
(¢) means and (éi) the standard deviations, are'‘significant. 


Mean annual Standard 
Sample Size income (in rupees) deviation (in rupees) 
I 100 582 24 
0 100 546 28 


Examine also whether the first sample could have come from a population 
with annual mean income of 500 rupees. 


_ 
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(6) The electric light tubes of manufacturer A have a lifetime of 1400 hours, 
with a standard deviation of 200 hours, while of manufacture B have a mean 
lifetime of 1200 hours with a standard deviation of 100 hours. If raridom 
samples of 125 tubes of each batch are tested, what is the probability that the 
brand A tubes will have a mean time which is at least (2) 160 hours more than 
the brand B tubes, and (ii) 250 hours more than the brand B tubes ? 


Hint. Under the assumption of normal population, the sampling 
distribution of (x, — x2) would have mean; 1, — HW, = 1400 -- 1200 = 200 hours 
and standard deviation : 


2 2 
SE. (%, -¥,)= ay = ) af {oe a | eco oe ts lees 


(i) The required ws is given by : 


(1 -%2)-(Mi-H2) _ 160 - 200 
—— ee 20 
S.E. (x1; - X2) 
= P(Z > -2) = 0-5 + P (-2<Z <0) 
(ii) The required probability is given by : 
P (( xX; —X2) 2250} =P (Z 22-5) = 0-5 -PO0<Z < 2-5) 
= 0-5 — 0-4938 = 0-0062 . 
14. A random sample of 1,200 men from one State gives the mean pay as 
Rs. 400 p.m.with a standard deviation of Rs. 60, and a random sample of 1,000 


men from another State gives the mean pay as Rs. 500 p.m., with a standard 
deviation of Rs. 80. 


Discuss, (stating clearly the result or theorem used), whether the mean 
levels of pay of men from the two States differ significantly. 


15. (a) A normal population has a mean 0-1 and a standard deviation 2:1. 
Find the probability that the mean of a sample of size 900 will be negative, it 
being given that the probability that the absolute value of a standard normal 
variate exceeds 1-43 is 0-153. 


(b) A random sample of 100 articles selected from a batch of 2,000 articles 
shows that the average diameter of the articles is 0-354 with a standard deviation 
0-048. Find 95% confidence interval for the average of this batch of 2,000 
articles. 


Hint. We are given n = 100, N = 2,000, x = 0-354, s =0-048. 


The. Standard Error of sample mean x in random sampling from the batch of 
N = 2,000 is given by : [c.f. (16-23)]. 


S.E(%) a Pe = Ya" x 7c S = s, for large n) 
A _ - 100 0-048 
= “2000-1. x ana = 0-00468 


Hence $5% confidence limits for 1 are given by : 


P {(x, —X2) 2 160} -?| 
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xX + 1-96 S.E. (x) = 0-354 + 1-96 x 0-00468 = (0-3448, 0-3632) 
16. (a) Explain the terms : 
(i) Statistic and Parameter 
(ii) Sampling distribution of a statistic, and 
(iii) Standard error of a statistic. 
(b) Explain why a random sample of size 30 is to be preferred to a random 
sample of size 25 to estimate the-population mean. 
17. (a2) Obtain the expressions for the standard error of sampling 
distributions of : (i) sample mean (x), and (ii) sample variance (s?), 
in random sampling from a large population. Assume that n, the sample size, is 
large. 
(6) Let X;, X>, ..., X, be a random sample from a population which has a 
finite fourth moment p, = E( X; — )’, r = 4; E(X;) =) and Var (X;) = 0? ; and 


- wha 2.1 
lt: X= DX and Stat 


z (X;-X )2. 


; -—!_ + 5 2 
Show that : OS= TaD 25% (X; - Xj)’, 


(i vars) =4[ p, -(3)"} 


(iit) Cov (X , 52) =pt3/n 
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Exact Sampling Distributions 
(Chi-square Distribution) 


13-1. Chi-Square Variate (Pronounced as Ki - Sky without S). The 
square of a standard normal variate is known as a chi-square variate with 1 degree 
of freedom (d.f.) 


Thus if X ~ N (1, 62), then Z -A=b~ N(O, 1) 
and Z= (x=4} , is a chi-square variate with | d.f. 


In general, if X,, (i = 1, 2; ..., n) are m independent normal variates wit! 
mean; and variance 67, (i = 1,2, ..., n), then 


n 2 pene 
y2 = 2 - al , is a Chi-square variate withnd.f. —...(13-1) 
13-2. Derivation of the Chi-square Distribution. 

First Method—Method of Moment Generating Function. 


If X;, i = 1,2, ..., n) are independent N(u1;, 62), we want the distribution of 


n en 8 a aa: 
yaa y (ABN SY U2, where U; 57 
isl 0; iw l G; 
Since X;’s are independent, U;’s are also independent. 
M apht) = M ru2 = U, M U? (¢) = (My 2)", 
since U;’s ~ N (Q, 1) are identically distributed. 
Now 


My a() =Elexp WU?) =| exp(u?)f ddr, 


7 1 
= exp(tu?) —= exp (- (x; - )?/207)} dx; 
J ex) eo Gs -wot 


= iz J = (tu?) exp ~ u7/2) du; E = feel 
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__1 Nn" 


V2n (: -: au" 
2 


= (1-21-72 


M, (0) = (1 20)-*”? (13-14) 


ere ] 
which is the m.g.f. of aGamma variate with parameters ; and 57. 


Hence by uniqueness theorem of m.g.f.’s, 
. » ae LL; 2 
oe ai ~ Bi 
=D ( = ) 


is a Gamma variate with parameters ; and zn. 
_ (t/2)r? _1 (n/2) - 1 by? 
. aP(x’) _ T(n /2) ° [exp ( 2 x] (x?) ay 


which 1s the required probability density function of chi-square distribution fan 
n degrees of freedom. 

Remarks 1. If a‘random variable X has a chi-square distribution with n 
d.f..we write X ~42,,) and its p.df, is given by : 


fix) = pane” X02) -1 0S x < 00 ,..(13-2a) 


2.16 X ~ x%q, then (X/2) ~ y(h/2). 
Proof, The p.d.f. of Y = +X, ts given by : 


eo) =). (FI > 


eee) ee tnf2)-1 
= PGI? enNP-1.2 


1 
| r nye 7 yy ; OS y< o 
=>  Y =(X2)~ y(n2) 
Second Method—Method of Induction 


If X;isaN (0, 1), then X;7/2 is a es so that X7 is a y2-variate with 
df. 1. 


° J exp (~ a?x?) dx = vn 
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If X, and X, are independent standard normal variates then X ? +X,7 isa 
chi-square variate with 2 d.f. which may be proved as follows : 
The joint probability differential of X, and X, is given by : 


=50 exp as + x2?)/2} dx, dx, —0o < (x), X2) S$ < co 


Let us now transform to polar co-ordinates by the substitution . x, =rcos 8, 
X, =r sin 8, Jacobian of transformation J is given by 


Ox; OX» : 
> + cos 8 sin 6 
ag 3 3 = = 7 
Ax, dX» ee 
7 «(00 rsin® rcos@ 


Also we have r2 = x,? + x? and tan @ = x2/x,. AS x; and x2 range from 
— oo tO + co, 7 varies from 0 to » and 9 from 0 to 2x. The joint probability 
differential of r and 8 now becomes 
dG(r, 0) =a exp (~ r2/2) rdrd@;0<rs0,0<O0<2nr 


Integrating over 9, the marginal distribution of r is given by 


-|* cy 2% 
diy) =} dG, ) =r exp (- P72) «| ms in 


= ra (-—r/2) rdr 
= dG’) = 5 exp (- r°/2) dr* 


=H Hp PCP) 2)! dep) 


2 X,7+X,? . 
Thus a er is a ¥(1) variate and hence r2 = X ,2 + X,* isa 


x?-variate with 2 d.f. 


For n variables X,;,@ = 1, 2, .... 2) we transform (X,, X2, ..., X,) to 
(x, 91, 92, ..., 84-1); (1 ~ 1 transformation) -by 
= X cos 6; cos 8, ... cos 9, 
X2 = X% cos 8, cos O2 ... cos 0,_5 sin 9, 
"X3.= % cos 0, cos 0, ... coss 8,_3 sin 6,_» 
(133 


x;= % cos 0, cos 8, ... cos 6,_; sin 8,_;,; 


X, =X sin 0, 
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where x > 0, —- 2 < 0, < 2 and —-n/2 < 0;< n/2 for i = 2, 3, ... (n— 1). 
Then X74 Xq? +... + XQ? =X? 
and IJ l=" cos*26, pe QO, ... cos 6, > 
(cf. Advanced Theory of Statistics Vol 1, by Kendall and Stuart.) 
The joint distribution of X,, X2, .... X, viz., 


dF (x1, X, 00) %q) = (Im) exp (->, x2/2) tr dx. 


transforms to 
dG(y, 91, 2, ..., O.1) = exp (-2/2) x! cos *? 8; cos" @, ... cos 9,2 
dyd8,d8, ee d8,, 
Integrating over 0,, 62, ... 6,-1, we get the distribution of 72 as 
P(x?) = k exp (~x7/2) (12)! dy?, 0 < x? < 0 
The constant k is determined from the fact that thé total probability is unity, 
L.é., 


Jo dp og=1 = kf expan O27! dy? = 1 


=> k= 


l 
2”? T(n/2) 
aP(y’) ran) exp (-42/2) (x2)? " ,0 Sx? < e 


; a ee ; 
Hence => >» X;7 is a y(n/2) variate. 
i=l 


=> x? = Y X/; isa chi-square variate with n degrees of freedom 
i=l 
(d.f.) and (13-2) gives p.d.f. of chi-square distribution withn df. _ 
Remarks I. If X;; i= 1, 2, ..., n are n independent normal variates with 
a X | oa i 2. ‘ , 
mean }; and S.D. 6;, then »y ( = P is a y?-variate with n d.f. 


se] 


2. In random sampling from a normal population with mean jt and S.D. 6, 
x is distributed normally about the mean pt with S.D. o/Vn . 


A! LN @,1) 


-_, |2 
=> eal is a y?-variate with 1 4.f. 


3. Normal distribution is a particular case of 1?-distribation when n = 1, 
since forn =}, 
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1 | 
PAX?) == exp 47/2) (x7)? ° dyx?, 0 $7? < 
V2 r(1/2) 


1 
=——= exp (-y2/?2) dy,-~<y <0 
ime x x x 


Thus x is a standard normal variate. 
13:3. M.G.F. of y?-distribution. Let X ~ 72,,). then 


My(t) = E(e%)= J, e fix)dx : 


| to] 
ae ; e® | e- V2 x(ni2)-1 dy 
ee eee a 1~-2¢ 2 
“satan J, °?|- (2) 2] eo 
2 1 T'(n/2) : 
* De T(n]2) [dd - 2/2"? (Using Gamma Integral] 
= (1 -2¢)-"? ,12t1< 1 ...(13-4) 


which is the required m.g.f. of a y?-variate with n d.f. 
Remarks 1. Using Binomial expansion for negative index, we get from 
(13-4) iflel<>. 


2! 
nen 
a 19+ ] > + 2 a+r - l 
+ = a Cty tix 
ut, = Coefficient of in the expansion of M(4) 
n(n n {n 
-v3 (t+1)(t+2)..@er-1) 
= n(n + 2)(n +4)... (n + 2r -2) ...(13-4a) 
Remark. If n is even so that n/2 is a positive integer, then 
u, =2 Tn) +r/Tmp) ...(13-4b) 


13-31. Cumulant Generating Function of  y?-distribution. If 
X ~ x(q), then 


Kx*(t) = log My() =- 5 log (1- 22) 
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(21)? (293 ( 4 : 
-2[ ars 20 + 2s + 24 + “a 


K, = Coefficient of tinK() =n 


2 
Ky = Coefficient of “— in K(1),= 2n 


2! 
3 
kK, = Coefficient an in K(t) = 8n 
4 
Ky = Coefficient of 75 in K(t) = 48n 
In general, 
K, = Coefficient of inK() =n2™\(r—1)! (13-40) 
Hence 
Mean =k, =n, Variance = [ly = Kz = 2n 
[ig = K3 = 8n, [ty = Ky + 3X2? = 48n + 12n? .--(13-4d) 
37 _ 8 My _ 12 
== and =tto-=43 
Bs Hye 7m Ba iy? on 


13-3-2. Limiting Form of x? Distribution for Large Degrees of 
Freedom. If X ~ 72), then My(#) = (1-21)-*2, lt I< ; 


The m.g.f. of standard y*-variate Z is given by 
Mx -p(d =e WOM | (do) (u = 2, o? = 2n] 
o 
or M(t) =e (1 - 2a) *!? 


=€ 


V2n 


NE n 2 

Kx =log Mj) =-t ion biog (1-2 ) 
_ aft 2,2 2,8(2\” 
=e Bete [240.244 7 


nth ( _ Tm) : 


= - + O(n-'), 


where 0(n7!) are terms containing n'2 and higher powers of n in the 
denominator. 


=> M(t) = e712, as n 00, 
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which is the m.g.f. of a standard normal variate. Hence by uniqueness theorem of 
m.g.f. Z is asymptotically normal. In other words, standard’ y? variate tends to 
standard normal variate as n — oo. Thus, %?-distribution tends to normal 
distribution for large d.f. 


In practice for n 2 30, the y2-approximation to normal distribution is fairly 
good. So whenever n 2 30, we use the normal probability tables for testing the 
significance of the value of 72. That is why in the tables given in the Appendix, 
the significant values of x? have been tabulated till n = 30 only. 


Remark. For the distribution of y?-variate for large values of n, see 
Example 13-7 and also Remark 2 to § 13-7-1. 


13-3-3. Characteristic Function of y?2-distribution. 
If X ~ x7(,), then 


x(t) = Elexp (ix) = | “exp (ita) flx) dx 


0 


1 7 1 — 2i = 
=sarary | o|- (a2) <fwt a 


= (1 — 2it)-"2 ..(13-4e) 
13-3-4, Mode and skewness of y?-distribution. 
Let X ~ x7(,), SO that 
f@) “Sry x2)-1 OSx <0 »eo(*) 
Mode of the distribution is the solution of 
f’@)=0 and f%(x)<0 
te peters wi. : x in (*) ae 


os -34 G : 0 —— (135) 
Since f(x) #0, f@)=0 > x=n- a 
It can be easily seen that at the point, x = (n — 2), f* (x) < 0. 
Hence mode of the chi-square distribution with n d.f. is (# — 2). 
Also Karl Pearson's coefficient of skewness is given by — 


< - Mean = Mode _ ne (n - 2) _ \/2 
Skewness == SD. ae zs ...(13-6) 


Since Pearson’s-coefficient of skewness is greater than zero for 2 2 1, the 
7%?-distribution is positively skewed. Further since skéwness is inversely 
proportional to the square root of d.f., it rapidly tends to symmetry as the df. 
increases and.consequently as n — oo, the chi-square distribution tends to normal 
distribution. 

13-3-5. Additive Property of 1~?-variates. The sum of independent 
chi-square variates is also.a x?-variate More precisely, if X;, (i= 1,2, ...,k) are 
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k 
i;dependent x?-variates with n; df. respectively, then the sum & X;is also a 


i=l 
k 
chi-square variate with & n; df. 
i=J 
Proof. We have 
My = (1 -2¢)-"/?;i= 1,2, ...,k. 
k 
The m.gf. of the sum > X; is givén by 
i=l 
My @) = My () My () ee «My (0) [-.. X,’s are independent] 


= (1-20) (1 -— 28)" ... (1 — 28)? 
= él fe Qt) t+ vee A,)/2 


which is the m.g.f. of a x*-variate with (n, + no +... + nt) df. Hence ey 


uniqueness theorem of m.g,f.’s, x X; is a y?-variate with bs n; af. 


Remarks 1. Converse is also true, ie., if X;; i= 1, 2,..., k are 
k 


y2-variates with n;;i=1,2,...,kd,f. respectively and if ¥ X; is a y?-variate 
iz] 


k 
with > n; df., then X;’s are independent. 
iz l 


2. Another useful version of the converse is as follows : 

If X and Y are independent non-negative variates such that X + Y follows 
chi-square distribution with n, + 2 df. and if one of them,:say, X is a ¥?- 
variate with n; df. then the other, viz.,Y, is a x?-variate with n2d/f. 

Proof. Since X and Y are independent variates, w2 have 

My yO =Mx® My) 


=> (1-2) *%? 211-277 . My) 
ee 3 2 aay: 
[xX4+Y WP aeny and X Gay’ 
=> M(t) = (1-20) 2” 


which is the m.g.f. of y?-variate with nz df. Hence by uniqueness theorem of 
m.g.f.’s, Y ~ es 


3. Still another form of the above theorem is “Cochran theorem” which is 
as follows: «*: 


Let X,, X2, ...,X, be independently distributed as standard normal variates, 
i.e., N(O, 1). Let 


2 Xi7=Q1+Qo+ ... + Oe 
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where each Q; is a sum of Squares ot linear combinations of X4, X2,..., X,, with 
n; degrees of freedom. Then if nj +7.+...+n,=n, the quantities Q;,Q>..., 
Q, are independent ¥2-variates with n,, ies .--, my GF. respectively. 


13-4. Chi-square Probability Curve. We get from (13-5) 


f= ja=2=8| fl). 13-7) 
Since x > 0 and j(x) being p.d.f. is always non-negative, we get from 
(13-7): 
f(x) <0 if @-2) <0, 


for all values of x. Thus the y?-probability curve for 1 and 2 degrees of freedom 
is monotonically: decreasing. 


When n> 2, 
| > 0, if x< (n — 2) 
f@M=4 =O0,ifx=on-2 
<Q, if x > (nm - 2) 


This implies that for n > 2, f(x) is monotonically increasing for 
0 < x < (n— 2) and monotonically decreasing for (n -— 2) < x <o, while at 
x=n-—2, it attains the maximum value. 


For n = I, as x increases, f(x) decreases rapidly and finally tends to. zero as 
x — oo, Thus for n > 1, the y?-probability curve is positively skewed [c,/f. 
(13-6)] towards higher values of x. Moreover, x-axis is an asymptote to the 
curve. The shape of the curve forn= 1, 2,3, ...,6 is given below. 


f(x} 


05 


For n = 2, the curve will meet 
the y = f(x) axis at x = 0, i.e., 
at f(x) = 0-5. For n = 1, it will 
be an inverted J-shaped curve. 


x 
PROBABILITY CURVE OF CHI-SQUARE DISTRIBUTION 


Theorem 13-1. [f ~;? and y2* are two independent y?-variates with 
n, and nz df. respectively, then 
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2 
om isaBa (3. ir variate 


(Gauhati Univ. M.Sc., 1992) 


Proof. Since y,7 and x27 are independent y2-variates with n, and n d.f. 
respectively, their joint probability differential is given by the compound 
probability theorem as 


AP(X 17, X2”) = dP, (%,7) aP2 (x22) 


S 1 wy, (n,/2) -1 
= Ee p) exp (-x17/2) (11) a? | 


a yD 2) (m/2)-1 4 2 
«| a ce om He) 2?) a2 | 


a a: (aes _ 2 2 
ae ae + n,)/2 T'(n,/2) T'(n/2) exp (- (1 + X2 2} 


Ls a 
. a1’) as (X2’) dx "dX, 05.47, X12”) <0 


Let us make the transformation : 


ue =YX2/Xo" and v =X," 
so that y,2 =uv and Yo? =v 
Jacobian of transformation J is by 
= (x1? ’ 12 2) = u =y 
d(u, v) 0 1 


Thus the joint distribution = random variables U and V becomes 


dG(u, v) ‘aetna {—(1 + u)v/2} 


x (uv) ty?" Ud db, 


1 
= 30 + Fn /2) F(ngl2) {—(1 + u)v/2} 


uk Ay + Ay 
xu2-'y 2 -! du dv, 0 < (u, v) <0 
Integrating w.r.t. v over the range 0 to oe, we get the marginal distribution 


of Vas: dG,(u) = | "dG v) 


-——_—___! >) ry-1 
“2A Timmy 


im exp -(2 + “\, yllmy +m)/2)-1 
P 2 
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et _ em) __ 
~ 242 Tn /2) P(np/2) © [C1 + u)/2]™* 9? 


ee eee de gs CL, Su < co 
mn No [+ uj * "2? 
Ba 2 


i , 
Hence U = isaB.(2 ; 3) variate. 
Theorem 13-2. [f x2 and x2? are independent y?-variates with n, and nz 
df. respectively, then 
-— xr 
Xa X2 
set m R2 2 vari 
are independently distributed, U asa B, ( ar) ) variate and V as a x? variate 
with (n, + no) df. 
Proof. As the Theorem 13-1, we have 


ee ee! es a2 2 
aP (x7, X2”) = 90 +2 T(n /2) F(ny/2) exp (- Qi? + X2")/2) 


and V= 2+ x2 


X (17) H (002) dy? dyz?, 0S (112, 12) < 00 
Let us transform to u and v defined.as follows : 
oo a3 2 
Me + Xe and v= 1° + X2 
so that y,?2 =uv and yo? =v—-4x;7 =(1-u)v 


As x, and x»? both range from 0 to °°, u ranges from 0 to 1 and v from 0 
tO 00, 


Jacobian of transformation J is 


Vv u 
J= =y 
-v l-—u 
1 (n/2)-1 
.. di R =m cn Une ce = te | 
ey) 2+"? rn /2) T(n/2) a 
x [a uy) a du dv 
1 (n,/2) -1 (n,/2) - 1 
= Ate? a ae 1 - 
2% * 2 Fn 12) PinJ2) oe 


x exp (-v/2). vi #722) -1 ay ay 


-| Dir + nal2) cn 12) - 1 uy! dy 
P(1,/2)0'(p/2) 


1 ((ny + 5/2) -1 
MN ce le LL -—y/2 Vv 1* ¥ dy 
Fou P{((n, + 2)/2) Bo 
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Since the joint probability differential of U and V is the product of their 
respective probability differentials, U and V are independently distributed, with 
1 


dG.) =—@——— ur" aw)" du, Osu 
B my Re 
2.2 
and 
1 ((n, + 0,)/2} -1 
dG = eo Oe GS rma dv, 
7) amr + aie r{(n, + n>)/2)} p ( v/2) v 
0 Sv <0 
ie, UasaB, § ; 2) variate and V as a x2-variate with (n, + n2) df 


Remark. The results in Theorems 13-1 and 13-2 can be summarised as 
follows * 


If X ~ Kin and Y ~ x2 yar independent chi-square variates then : 


i ¥ = 42 
OX+K~ x, a) 
variates is also a chi-square variate. 


i.e., the sum of two independent chi-square 


(it) * ~ Bo e ; "Bie. the ratio of two independent chi-square 


variates is a B2-variate. 


(iit) aa a Bi tx 4 


Theorem 13-3. In a random and large sample, 
k ey} 
a) — (13-8 
ee | al | (13-8) 
follows chi-square distribution approximately with (k - 1) degrees of freedom, 
where n;‘is the observed frequency and np;is the corresponding expected 
k : 


frequency of the ith class, (i = 1, 2,..., 4), nj = Nn. 


Proof. Let us consider a random sample of size n, whose members are 
distributed at random in k classes or cells. Let p; be the probability that sample 
observation will fall in the i th cell, @= 1, 2, ..., &). Then the probability P of 
there being n; members in the ith cell, (i= 1, 2, ..., k) respectively is given by 
the multinomial probability law, by the expression 


R 
n.! P * py"? ... De’, 
ke 


k k 
where 2» n=nand & p;=1. 
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13-13 
_ Ifn is sufficiently large so that n;, @ = 1,2 


, 2, «.., K) are not small then using 
Strling’s approximation to factorials for large n, viz 


lim (nt)=V2netn"* +3 » We get 
n > co 
V2n e-"n"*3 


n n n 
Pr ' P22... Pe’. 
1 n 
+i +! 
np2 2 MPEVNa*, 
ny No eee ny 


W2 2n)E} eA ntti +t) (Dn. .., p,)'2 


ell (ay 


1 
where C = k-1 -1 9 
(20) S— 0? n &- OP" (pip,...pp)!? 
is a constant independent of n;’s 


= x 
(V2n)te™ +nat+... + ™) 


e-" nt *? én AP, 


k 
log P =logC + 27 (n; + 5) log by 


=> log (P/C) = Oa ios). 


-»(*) 
i=] 
where A; = np; is the expected frequency for the ith cell, i.e 
E(n;) =,Npj = Xi» (7 = I, 2, ooey k). 
Let us define 
é _ nj- Xi 
t VA; » 

so that n; - A; = END; => nrzA;+6; Vi; woe”) 
Substituting in (*), ig get 


log (PIC) = x _ Ose Bs VR +5) tog [ I 


- E (A;-+ &, V2; +4) log L/{1 + &/VI) 


— 
= E Ort &i VAs +5) log (1+ G/VE)) 


If we assume that €; is small compared with A;, the expansion of 
log 1 + (€/VA,)} in ascending powers of €,/V/A; is valid. 
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log Pic ~- 5, (A; +E Vit pw) [ Fe 7“ +0 na”) | 


a E [ewa;- 5&2 +62 + 007%), 


neglecting: higher powers of EN); if ©; is small compred with A,. 
Since n is Jarge, so is A; = np; Hence O(A;"”) — 0 for large n. 


k k k k 
Also 2 ENA = 2 a al ni- 2 i 


k k 
= Enon E pen-n=0 (Een, I=) 


im] 


k k k 
log (PIC) =-[ E ENA +5 E E2+00"%] =-5 Eb 
k 
=> P =C exp (-3 z 8) 
i=l 


which shows that €;, = 1, 2, ...,&) are distributed as independent standard 
normal variates. : 


k 
Hence >> E2 = , ean ac 


iwl i=l 


being the sum of the squares of k independent standard normal variates is a {?- 
variate with (k — 1).d.f., one d.f. being lost because of the linear constraint 


k k 
Eee La-N=0% Emm Em ce 


Remarks 1. If O; and E; é = 1, 2, ..., &), be a set of observed and 
éxpected frequencies, ae 


k k 
i 5 he alt (2 O= 2 EF) ...(13-8a) 


i=l 


follows chi-square distribution with (k — 1) d.f 
Another convenient form of this formuta is as follows : 


z 0,2 + E?- 20 ;E; O7 
,( E; }=,2, (z+ £+- 205) 
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k 
-) & )-™, ..(13-8b) 


i=} 


k k = 
where > O;=  E;=N (say), is the total frequency. 
isl iw] 


2. Conditions for the Validity of y?-test. y7-test 1s an 
approximate test for large values of '\n. For the validity of chi-square test of 
‘goodness of fit’ between theory and experiment, the following conditions must 
be satisfied : 

(@) The sample -observations should be independent. 

(ii) Constraints on the cell frequencies, if any, should be linear, e.g., 
Ln; = x A; or Z O;= ZX E;. 

(iit) N, the total frequency should be reasonably large, say, greater than 50. 

(iv) Ng theoretical cell frequency should be less than 5. (The chi square 
distribution is essentially a continuous distribution but it cannot maintain its 
character of continuity if cell frequency is less than 5): If any theoretical cell 
frequency is Jess than 5, then for the application of 72-test, it is pooled with the 
preceding or succeeding. frequency so that the pooled: frequency is more than 5 and 
finally adjust for the d.f. lost in pooling. 

3. It may. be noted that the ?-test depends only on the set of observed and 
expected frequencies ‘and on degrees of freedom (d/f.). It does not make any 
assumptions regarding the parent population from which the observations are 
taken. Since x? defined in (13-8) does not involve any population parameters, it 
is termed as a statistic and the test is known as. Non-Parametric Test of 
eda ree Test. 

4. Critical Values. Let y,7(a) denote the value of.chi-square: for. n.d_/f. 
ail that the area to the right of this point is q, i.e., 


PIX? > Xa (a) = a .«.(13-8c) 


Critical values 


Rejection ee 
Zo one 


The value x,7() defined in (13-8c) is known as the upper (right-tailed) 
a-point or Critical Value or Significant Value of chi-square forndf. and has 
been tabulated for different values of n and @ in Table VI in the Appendix at the 
end of the book: From these tables we observe that the. critical values of y2 
increase as n (d.f.) increases and level of significance (a) decreases. 
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The critical values for left-tailed test or two tailed tests can be obtained from 
the above table, as discussed in Remark 1 to § 16-7-4. 


13-6. Linear Transformation. Let us suppose that the given 
set of variables X’ = (x,, X2,,... X,) iS transformed to a new set of variables 
Y’ = ()), Yo: «-+> Ya) BY means of the linear transformation : 

Vy = AyyXy + Ay QXo + «0. + AiyX, 
Voy A2j{X) + Aq2X%_ + 1... + Aan X, 
.-.(13-9) 


Ya BniX) * An2X2 + ... + Ann Xn 
i.é., Yi =H AiyX, + AQXQ + 00. HQink, 1 = 1,2, ...,0 


In matrix notation, this system of linear equations can bé expressed 
symbolically as 


Y = AX . ...(13:10) 
y1 x) Qi; Fj2--- Qin 
x a G72 ... Arn 
where = - ¥=[° | xel 2? Lasl 2% 
Ya \ Xn Qn} An? ..: Ann 


From matrix theory, we know that the system (13- 10) has a unique solution 
iff | A | #0. In other words, we can express X uniquely. in terms Y if A is non- 
singular and the solution is given by 

. X=A"'y ...(13-10a) 
where A-! is the inverse of the square matrix A. 

The linear transformation defined in (13-9) or -(13-10) is said to be 

orthogonal if 


X’X = Y’Y (13-11) 
=> X’X = (AX)’ AX = X‘(A’A)X 
=> A’A =I, : . (13-114) 
= A isan orthogonal matrix. 
More elaborately 

a Y’Y 
=> 7 x? = , y7 = a (GX, + ApXo + 2. + igXa)*,’ ..(*) 


i=l i=l 
for every set of variables, (x,, x2, ..., Xa): 
Rn 
EE we write oy = 2 Biz Aes (i,j j= 1,2,..., A”), 
* hea (*) implies that re is a Kronecker delta so she 


lisj 
$;:= ...(13- 
: {f tej ta 


whence it follows that A is‘an orthogonal matrix. 
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Linear Orthogonal Transformation. Def.:.A linear transformation 
¥ = AX, is said to be otthogonal if A is an orthogonal matrix. 

Remarks 1. It is very easy to verify the equivalence of the following two 
definitions of an orthogonal matrix. 

Def. 1. A square matrix A (nxn) is said to be orthogonal if 
A’A =AA‘ -I,. 

Def. 2. A square matrix A is said to be orthogonal if the transformation 
Y¥ = AX transforms X’X to Y’Y. 

2. If ¥Y = AX is an orthogonal transformation, then Y’Y = X’X and 
A’A =AA’ =I, 

Theorem 13-4. (Fisher's Lemma). If X;, (i= 1,2, ...,n) are independent 
N(O, 0) and they are transformed to a new Set of variables Y;, (i =1, 2, ...,n), 
by means of a linear orthogonal transformation, then Y; (i= 1,2, ...,n) are also 
independent N(0, 0). 

Proof. Let the linear orthogonal transformation be 

Y = AX so that Y’Y = XX and A’A =I, 

Since X;, (i = 1, 2, ..., 2) are independent N(0, a7), their joint density 

function is given by 


1 ¥ te 
X1> Xo» .06» X_) = (—— ) . exp] - x;*/202 |, — 00 < (x}, Xo, «00, Xp) < 
Passo od) = (Te) ( 7 - 


i=l 


= (—LY exp (-X’X/202) 
OV 2r 


The joint density of (Y;, Y2, ..>, Y,) becomes 


‘| 
eisyo, sees Yn) = 
ae 


) exp (-Y’ Yio?) IJ | 


1 _ oy, Y2> sos Yn) _ 
my J ~ 0(x1, x2, ce 
Now A’A =I, 
= [A’AL =I1,1=1 
=> -IA’IIAIL =1 
= IAI? =1 GslAI=IAL) 
=> [Al =+1 
I\Jl =tt1)=1- 


2 . 1 ag S 
2(V15 Yor ees Ya) = ‘) exp (- Y’Y/20?) 
(a) 


. 7 
1 ee 
= Th [—heeww coe] 
ios 1 | 2n 
Hence Y,, (= 11,72, ..., i) arerindependont N(0, 02). 
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Theorem 13-5. Let X,, Xz, ...,X, be a random sample from a normal 
population with mean piand variance o*. Then 


@) X x ~ NU o7/n), 


y \2 
(ii) >> (=X x) is a 7*-variate with (n -1) df., and 4 


sal 


is a 
a 2 = 
(iit) X =4 > X; and —_ > (t= x) are independently distributed, 


‘  Dethi Univ. B.Sc. (Maths Hons.) 1987; 
Sardar Patel Univ. B.Sc. 1992} 


Proof. The joint probability differential of X,,X2, ..., X,q iS given by 


1 a 
dP 9 o e0r9 A = ns e 8 2 dx aX, ; 9 
(x; x2 Xn) (; oa) o| - 3 20? , y (x; - - jt) | 1 4X. 


— 00 < (xX), Xz, ..-, X,_) < 00 
Let us transform to the variables Y;, (i = 1, 2, ..., 2) by means of a linear 
orthogonal transformation (Y = AX) (cf. § 13-6, page 13-16). 


Let us choose in particular 
Qy, =@Q\2 =... =Qy,= yn 
=> NN en +%2+..,; +X,) = Vn x .»(*) 
Vn 


(It can be easily seen that the above choice of @;;,,@12, ..., @1, Satisfies the 
a 
condition of orthogonality; viz., 2 a;? = 1). 
| 


Since the transformation is orthogonal, we have 


+ y,* . > nD (x; -x *? +nx? 


| an 


= Led ey? [From (*) 
=> Bs yi = De -3? -i(**) 


Also 2 -yP = 2 a -¥ + kp d (x; - FP +(x - py? 


A ©-- 
= 2 yi? + n(x — py? [From (**)] 


AS in Theorem 23-4, the Jacobian of transformation J = + 1. 
Thus the joint density function of X,, X2, ..., X, transforms to 
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1 a 
dG()y1, ya, :30% Yad “(nal exp| - 3 262 12, yi? + n(x- wr | 


xl Ji dy, dyz... dy, 
= — me a =! / a 
F 2n(a/V n) i 207 anyon 


1 n-1 = ye 
x ( - exp |. z x dy2 tyad| 


(.- dy, =Vndx) 


Thus X und © Y¥;2 = 2D (X;-X)? =ns?, (where 5? is the sample 
iw2 


isl 
variance), are independently distributed, which establishes part (iii) of the 


Theorem. Moreover X ~ N (ut, o2/n) and Y,, i = 1, 2,.3, ..., 2) are independent 
N(0, 62). Hence 


i2x2 i=] é 


being the sum of squares of (nm — 1) independent’standard normal variates is 
distributed as y?-variate with (n — 1) df. 


Aliter. The alternative proof of the above Theorem is based on the use of 
m.g.f.’s and is given below. 
We shall first prove that : 
xX =X and Kim KX, i =1,2,...” .» (L) 
are independently distributed. 
The joint m.g.f. of X and (X; -X) is given by : 


M(t, t>) =| eit iy (X,- | = Ee -t,) x + iy x,| 


=e em {45 > i+ ok 

is] : 

-elo| (12+ 12) x, + 4% z «| 
j= 
UY #1) 


13-20 Fundamentale of Mathematical Statistics 


-z[ew{ (852+4)x1].2 em Hs x, | 


U * i) 


i ...(ii) 
(-: X1,X2, ...,X, are independent) 


Now U = 2 X;, being the sum of (n - 1) iid. N (1, 02) variates 
j= 1 


isaN{(n—- 1), (n ns 1) 0} variate. 


MAd) =exp[t. (an - 1) p + 2. (n - 1) 62/2] (iti) 
Hence 
E| exp {at bs X j|-e[ex (a ; v) 
Gai 
= My T(t, — &)/n) 
-ex|(* maa ~1)p+ (" = 42.) (n - 1) =| 
: [On using (iii)] .. (EV) 


and E| exp{ (* - f2 + i) Xi My. (* = f2 + a) 


oe = = 2.g2 
=exp| formes oq + eee 2) e] 


[-X;~N (i, 029] el) 
Substituting from (iv) and (v) in (i), we get 


M (t), t) -exo[{(* aa IG = 1) + es + a} | 
: 2) .(n-1) + ana Ried 
= exp g ye 5 2S -) x exp| Ei? (Fs L) 02] 
| z 


; (On simplification) 
= M(t). M(t2)* 


= (@), X and X;-X;i= 1,2,...,n are independently, distributed ... (vi) 
ad = (b) X ~ N.(p, 62/n) ; r - .-. (vii) 
and "X;-X ~N (0." 0?) ...(viii) 
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Since X and X;-X ;i= 1,2, ..., are independently distributed, 
X and 3? =! = -Xy (viii a) 
i=1 


are independently distributed. is rn! 
To derive the distribution of s2, we note that : 


pb (X; -p)? -> (X; —X+X- ju)? ; 


-> (X;- XP +n (X-p), 


the product term vanishes since _ x (X;-X)=0. 


n -— pt \2 (X; x) X—- WU, “s 
z a) 2 Bea (5.9) 
=> V =< W+Z, *  ,..(ix.a) 


where V = >> 


ial 


standard normal variates is a ~?(,) variate. Hence _ 


2 
a , being the sum of squares of n independent 


My) =(1 -22)° s It 1 < 1/2, : .-(X) 
Also X=N (;, 62/n). => X - H NO. 1) ' 
on 
—_ 2 
pref 
= M(t) = (1 -28)712 Qa) 


Further, since X and s? are independent, [see viii (a)], W and Z’are 
uMepiceny distributed. 


Mt) =My.z) =Med) . M20) 
(.- W and Z are independent). 
=> (1-21)*? = My(t) (1 - 20-1? (From (x) and (x)] 
=> My) = (1 -2)-"- Sith < 1n 


which is the m.g.f. of 1?-variate with (n — 1) d.f. Hence by uniqueness theorem’ 
of m.g.f. 


* (X;-X)?_ ns? 
W = 2 (Xi a = me Xn -1) + (adi) 
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Remarks 1. p.d/f. of the sample variance s?=n- 1 (X;- XY. 
iwl 


Since >» (% — Xo nso 
inl oe) 3 
is a y2-variate with (n — 1) d.f., we have ¢ b 
1 -agne (ns? Fo+_, 
dP(ns*/o*)_ = ———————-—- .e ( = | 2 ~*d(ns*/o?) 
20-D2TT(n - 1)/2] o 
(n= 1)/2 
=> dP(s*) - “2h eee (52) 92 G52 Qc 52 < 00, 
Vij - 1)/2) 
ase 
2. We have oo” 7,1 
ns2 
E (= =n-l 
=> 3 Es) =(n-1) : 
n-1)_, ~1).2 0 
=> E(s)= , io = ] 5° ~ 0”, for large n. »o(*) 
‘ag? 
Also Var. oe = 2(n-— 1) 
n? ae 
=> ot Var (s”) = 2(n— 1) t 
=> Var (s?) 22 ( = 7 ot fr largen.. —...€**) 
n n nr mk 
= S.E(st) =02x Vn -(***) 


Theorem 13-6. Let X;, @ =,1, 2, ..., n) be independent N(O, 1) variates, 
Then the conditional distribution of x? = 2.x i, subject to m (< n,) (say), 
iol 


independent homogeneous linear constraints viz., 


C11X1 + €12X2q +...... + €1,X, = 0 
C21X1 + 22X27 +...... aXn,=0 
. \ + C22 - t C2 (13-12) 
CmiX1 + CmoX2 + osecee 7 CmaXn= 0 
is also a*y?-distribution with (n — m) degrees of freedom. 
Proof. Equivalently, the constraints (13-12) can be expressed as 
11X14 + A 2X2 +...... + 41,X,=0 
4>1X1 + An2X> +...... + a2,X, = 0 
a ae (13-12) 


4 
° 


Ami X1 + Om2X2 Viescess + OmaXn =-0 
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where @; = (4;1, 2j2, ..-» Gin)} i = 1,2, ...,m are m unitary, mutually orthogonal 
vectors. 

Let us now transform the. variables 

(X1, Xo, ee, ey. rr rrr, OF ae (Y1, | Car Gare ar pares Yn) 
by means of a linear.orthogonal transformation 


Y=AX ...(13-125) 
Y, ay) Q\2 eae Ain. xX, 
Y> Q) an ooo Dy, : Xo 
Y=. ) ea , As Qn 20 eee Onn and X = Xm 
Ymei Grvi,1 FOmel,2 +++ Ameiva Xmol 
Yn Ons oY Xn 
(13-125) implies that the constraints (13-12a) are equivalent to 
Y,=0, (=1,2,...,m) ,..(13-12c) 


By Fisher’s Lemma (Theorem 13-4) Y,, (i = 1, 2, ..., 2) are also independent 
N(O, 1) variables and 


2X? = 2 ¥? [-.- Transformation (13-25) is orthogonal] 


tal 


= ) ¥Y? [Using (13-12c)] 


toms 


Thus the conditional distribution of 2 X;? subject to the conditions 
i=l 


(13-12) is same as the unconditional distribution of | »y Y;?; where Y; 


sam + 
. ) eae 
(i=m+1,m +2, ...,.2) are independent standard normal variates without any 
constraints on them. Hence 


x? = by X? = » Y,;2, 


i=l iam-+i 


¢ 


being the.sum of squares of (7 ~ m) independent standard normal variates follows 
x2-distribution with (n —,m) degrees of freedom. 


Example 13-1. (a) Show that for 2 df. the probability P of a value of x 
greater than Aor is exp (- + x0"); and hence that 


Xo% = 2 log, (LIP) 
Deduce the value of Xo” when P = 0-05. 
[Sardar Patel Univ. B.Se., 1901) 
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(b) Given different probabilities P,, P>, ..., P, obtained from n independen 
tests of significance, explain how you will pool them to get a single probability 
in order to decide about the significance of the aggregate of these tests. 

[Delhi Univ. B:Sc. (Stat. Hons.), 1990) 

Solution. (a) The p.d.f. of y2-distribution with 2 d.f. is 


fH) = pareaay exp. (-17/2).(x7)@) =! 


an2d 


=5 exp (-42/2),0$ 42 <ee 


P =PQ2>x)={  Lexp(-x32) dr? %) 
o 
2 J | ep) |" = exp (52/2) 
-t iz 
log, P =-X/2 


=> Xo? = -2 log, P =2 log, (1/P) 
When P =0.-05, we get %o* = 2 log, 20 = 3-012 


Remark. The value Yo” of x? defined in (*), is known as the significant or 
critical value [c,f. Remark 4, to Theorem 13-3, page 13-15] of 2 corresponding 
to the probability level P. Thus if P is the significant probability, then 


x? = -2 log, P =2 log, (1/P) 441313) 
is a x?-variate with 2 d.f. 


(b) -2 log, P; (f= 1, 2, ..., n) are independent y?-variates each with 2 df. 
(cf. Remark above and the fact that P;’s obtained from independent tests of 
significance are. independent). Hence by additive property of een 
distribution 


1 : 
2 = E -2 log, P) = 2 log, {| =——— (13-134 
his )=2108.| 5 pp, (13-134) 
is a chi-square variate with 2n d.f. 
If x? > x05 for 2n d.f., then we conclude. that the pooled result (aggregate 


of the tests) is significant at 5% level of significance. 


Example 13-2. (Pearson's P,-Statistic). The variables X,, Xo, «...X_ are 
independently distributed in the rectangular form 


dF =dx,0Sx<1 
Then if P. = X; X>...X,, Show that -2 log,P has %?-distribution with 2n 
degrees of freedom. (Aligarh Univ. B.Sc., 1992) 


Solution. -—2log,P =-2 log, (X; Xo, ... X,) 
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=O: + G+... +65 2 Cis 


where £;=-2logX; => X;=exp (-&,/2). 
The probability function of €; is given by 


aE) =f) 
Since dF(x) =dx, ffx) =1V xin (0, 1) 
{1 
gt&)=1. | exp(- 8/2) x ~ 5)] =pexp 2 
which is the spa function of %2-distribution with 2 4.f. 


. (= ., k) are independent X?-variates each with 2 d.f. Hence by 
additive lt 2 i? -disribution, 


210, P= % E:, ; 


is a y?-variate with 2n df. 

Remark. The significance’ of’ this result lies in testing of hypothesis as 
explained in Example 13-1. 

Example 13°3. Show that if v is even, 


1 oo 
P=20-DRr~yp f exp (-X2/2) x"- \dy 


neon + (%2/2) + Ls xq t+.* ra=| 


and hence the values of P for a given ¥2 can be derived from tables of Poisson's 
exponential limit. 


Solution. Let us consider the incomplete Gamma: integral 


ee gees 
bal e~'t" dt, 


where 7 is a positive integer. Integrating by parts, we get 


etn {@ a ee e7 
— "+ a, ett dt-= +1, _ 1 


ae r! 


which is a reduction formula. eae application of this gives 


[= ot Ps a Bel Sat +...4+ eB? e*B 
ro ort *Or- Ibi" 2! 1! 


But = | etdt= 
B 


+I 
~ em! | i =e-B 
B 
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1 = —1 Br : 
pt e’’' dt=e ji+B +24 ot ST 
Putting: B = 2/2 and r=" 223 1, (since r.is an integer, v =2r + 2 must 


be even), we get 


Wt (2) -1d¢ 


ee (See 
{(v/2) — 1) 2h 
_ : x2 ral ae 2 
= exp ( 7/1 +5 +944 246 tab! 0 
Taking t = 7/2 in the integral on the L.HLS., ‘we. get 


] ag : : 
L.H.S. = {od -)! f ; exp (2/2) (7/2) *2! d(x2/2) 


“TOR f , exp (-y7/2) x"! dy, -(**) 


From (*) and (**), we-get the required result. 
Let the given value of x? be 72, then 


1 oe 
P=PQ?> 4X0) =F0-DA P(/dy I exp (x22) x7 dy 


2 4 v—-2 
7 ey ee aes Ce ——Xo 
w exp (x9)|1 F224 te 24. 2) 


y 
wz a3 y27 | 
—_ —A ing auwe- ~~ —w« ee 


where X =X%?/2. 
2 

The terms on the right hand side viz.,e-*, A e-4, + e->...etc. are the 
successive terms of the Poisson distribution with parameter A = X7/2. 

Hence the result. 

Example 13-4. If X and Y are independent normal variates with means 
Ly, [lp and variances 6,”, 6,” respectively, derive the distribution of 

Z=(X = Wy )/(Y - Wo). 

What is the name of the distribution so obtained ? Mention one importan 
property of this distribution. F 
(X-)? — of _, [X-n)/o] 


Solution. Here Z? = (You)? => g2°°* [cy — ) jel 


But {(X —2,)/0,}? and {(Y — 1,)/6,}2, being the squares of independent 
standard normal variates, are independent x?-variates with 1 d.f. each. 
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0,722 
Thus “ear ; 
d.f. is a By G, 3) variate (c.f. Theorem 13-1). 


Hence its probability differential is given by 


being the quotient of two independent y?-variates each with J 


2 22 ep. 2\(1/2) - 1 
az ea Cory Beaks W mee d(o,2z2/0,2) 
B35 (a4 )2 2 
G,? 
1 
1,1 oie 2- 
Ms+ sa G2 
“Tere (, a or? 
2: 2 ( + ar) 
_~—_ 9192s Qe 52 = ” 
= Toba otgy Ut OK 20 [--Pd2) = Vx) 
Thus the probability differential of Z ts given by 
eee) 7 ae as 
aF(z) =fa)de = TG 2 2+ 0,222) dz, -~<z< 
If 6, = 02=1, . it conforms to standard Cauchy distribution, 
] dz s => 
dF(z) = - ‘42’ +2) —o<z< 
(For its properties c.f. Chapter 8). 
Aliter zg aol 
Y- 2 
=> 27 = ( ~ Hi Joy 
(Y ~ W2)/o2 


Now = Z, being the ratio of two independent standard normal variates is a 
GO} 
standard Cauchy variate 


d sa 
a a (Ge) = 
OC}. 
RK + Gall 
Oj 
0192 


“Ha? + ogee Mh TES 
Example 13-5. X;, i= 1,2, ...,n) are independently and normally 
distributed with zero mean and common variance 0°. 
a n 


Let €;= »y cyX;,; i= 1,2, ...,n, where 2; cy cj = Oi; 
j » ] j =} 
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where §,; is Kroneker delia. Show that 


2, ae z 7 . 


is distributed as y? -variate with'(n — p) degrees of freedom. 
- [Delhi Univ. M.Sc. (Stat.); 1990] 
: 


Solution. Since §,,;’ = x Ci Ci; : 
jel’ 


is a Kroneker delta, we have 
= , {0,84 i 
> Ci CEZ= 
jel | 
i.e., X;’s are transformed to &;’s by means of a linear orthogonal transformation. 
Hence by Fisher’s Lemma, €;, @ = 1, 2, ..., 2) are independent normal variates 
with zero mean and common variance 07. 
Since the transformation is orthogonal, we have 


A a 
dL xz = 2 & 


“ és] izl 


p a -. ; 
2, x? - 2, | 2 Si? z Si >» E? 


lizi’ 


-_> 


On ee 
3 : eo * @ 
: p> ate = 
Now oar al = Eso), 


being the sum of the squares of (n ~ p) independent standard normal variates is a 
x?-variate with (n - p) degrees of freedom, Hence the result. 


Example 13-6. Show that the m.gf. of Y = log x?, where x? follows 
chi-square distribution with n df., is given by 


My(t) = 2! rf; + ‘) va T(n/2) 


If x? and 2? are independent y?-variates each with n.d f. and U = y,7/y%27, 
deduce that for positive integer k, 


EU) = (a+ k)P (5- A [r( 2 i 


Solution. y= logy? => y2=e” =. dy2=e dy. 
The probability differential of x? viz., 


: 1 f- 1, 
dP) = Raye WO)? aX, <2 <0 
transforms to 
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ny 
~e% + =| dy, -2 < yp < oo 


dG(y) = ann aD exp |~! 7° +%5 


M Xt) Sea) {- exp |- der + z + y| dy. 
‘mae |. ecg, (2z = eY) 
“v5 |, e~ oe dz 
-2r(2+1) /r(2) 0) 
EU) =E lea =E exp {oe (i) 


Ele! log x, - k log x2? | 


= E(eé log x1") _E (.-* log x2”) 
[-. 12 and x2? are independent] 


= Mog 12 ® Mog x2 *) 
2 (4 + k) 2p (2-«) 
pee eee ee [From (*)] 
T/2) T(n/2) 


-r(f+4)r(-«)/[r()] 


Example 13:7. /f X is chi-square variate with n.d@f., then prove that for 
large n, V 2x ~N (V2n J) [Delhi Uniy. B.Sc. (Stat. Hons.), 1989] 
Solution. We have E(X) =n, Var (X) = 2 
Z= zak 09 N(O, 1), for large. 1. 


Ox 2n 


Consider, 


* J =P(X<n+z2V2n) 
= P[V2x <(2n + 22 V 2n)!2] 
-4N2x s Vin (1 +24)? x) | 
zt 


=| Vax 5 V0 (1 + i+ “i 
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= P (Vox < V2n + 2], for large n. 
a PIN2x - V2n $2], for large n. 
Since for large n, (X - nN 2n ~ N(O, 1), we conclude that 
2x -V2n ~N(0, 1) for large n. : 
= _—-V¥2X is asymptotically N(V2n, 1). 
Remark. This approximation is often used for the value of n larger than 


30. This result does not reflect anything as to how good the approximation is, 
for moderate values of n. R.A. Fisher has proved that the approximation is 


improved by taking V (2n — 1) instead of V2n . A still better approximation is 
(2/n)"?~ N ( -2.= 
Example 13-8. For a chi-square distribution with n df. establish.the 
following recurrence relation between the moments : 
rer = 2r(U, + A, 1), 72 1. 
Hence find 8, and pb. 
[Delhi Univ.B.Sc. (Stat. Hons.), 1991) 
Solution. If X ~ 77;,) then its m.gf. about ozigin is 
M,(t) = E(e®) = (1 -21)-*2: ¢ < 4 ..(*) 
Also E(X)=n= 1 (say). 
Hence m.g.f. about mean, say, M(¢) is 
M(t) = My_ y(t) = E(e%-) =e -™, E(e%) 
=e-™% (1-2) 
Taking logarithms of both sides, we get 


log M(t) = —nt - * log (1 — 2f) [Using (*)] 
Differentiating w.r. to t, we have 
M%® n 2 2nt 


M() ~~ "2° =2) (1-2) 
= (1 —2t) M() = 2nt M(t) 
Differentiating r times w.r. to t by Leibnitz theorem, we get 
(1 — 21) M™*!(0) + r(-2) M(t) = Ant °4'"(t) + 2nr M7“) 
Putting ¢ = 0 and using the relation, 
br =[ ge MO], _ = MO). we get 


Mei - 2r py, =2nr py 
=r Mrai = 2r (UU, + mp,,), 721. 
Substituting r = 1, 2,3; we get 
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Py = 2p =2n 
Hs = 4(H2 + mp))-= 8n (+. 4, = O and po = 1) 
[ly = 6(fl3 + Ay) = 48n + 12n? 
_ Hs’ 8 143,12 
B, ibe A and Boi ada 


EXERCISE 13(a) 
1. (a) Derive the:p.d.f. of chi-square distribution with n degrees of freedom. 
(b) If X.has a chi-square distribution with n d.f., find m.g.f. My(#). 
Deduce that : 
() pw, = EX’ = 2’ T[(n/2) +r] / F(n/2). 
(ii) k,=rth cumulant =n 27! (7-1)! 
(iii) k, k, = 2kp*, 2B. = 3B, -6=0. 
2. If X ~ X2¢q) » show that : 
(i) Mode is at x =n — 2. 
(i) The points of inflexion are equidistant from the mode. 
Hint. Points of inflexion are at x = (nm ~ 2) + (2(n — 2)]'!2 
3. If X ~ 77m), obtain the m.g.f. of X. Hence find the m.g.f. of standard 


chi-square variate and obtain its limiting form as n —> oo. Also interpret the 
result. 


4. (a) Let X ~ N(O, 1) and Y = X2. Calculate E(Y) in two different ways. 
Ans. E(Y) = 1. (Use Normal distribution and chi-square distribution). 
(b) Let X, and X2 be independent standard normal variates and let 
Y = (X, -X;)*/2. Find the distribution of Y. 
Ans. Y ~ X71). 
5. If X,, Xz, ...,X, are i.i.d. exponential variates with parameter A, prove 
that: 


2A x X; ~ X72) 

6. (a) If X ~ U [0 1], show that -2 log X-~ x%2.9. 

Hence show that if X,, Xo, ..., X, are iid U (0, 1] variates, and if 

P =X, X2... Xj, then — 2 log, P ~ ¥7(2) 

Hint. Find m.g.f. of — 2 log X. 

(b) If X,, Xz, ...,X, are independent random variables with continuous 
distribution functions F,, F2, ..., F, respectively, show that 

~2 log (F1(X1) . Fo{X2) ... FX] ~ X72) 
Hint. Use F(X) ~ U (0, 1] and Part (a) above. 


_ 17,(a)1etX and Y be two independent random variables having chi-square 
distributior: with degrees of freedom m and n respectively. 
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_X__ 
X+Y°* 

(ii) When m = n, show that the distribution of U is symmetrical about : 
Hence or otherwise derive the rth moment about the mean of U when m= n. 

(iif) Deduce the distribution of U when m = n= 1. ‘ 

(b) If X and Y are independently distributed chi-square variates with m and n 
degrees of freedom respectively, show that U = X + Y and V = X/Y-are 
independently distributed. ; (Gujarat Univ, B.Se., 1992) 

(c) If x,2 and x2? are independent y? variates with n, and'n2 degrees of 
freedom respectively, then show that : 

(i) x2 = x1? + Xz" is a x?-variate with (n, + nz) degrees of freedom 


2. mn , 
w= Es is ap, (3, a variate 


@) Obtain the distribution of U = 


[Delhi Univ. B.Sc. (Maths. Hons.), 1987] 
8. If X; @= 1,2, ...,n) are n independent normal variates with zero means 


a aA 
and unit variances, show that 2, X; and 2, (X ,; — X)* are independently 
8 a= i= 


distributed. 
Hence or otherwise obtain the distribution of . 
2X; 
U . i=l 


9. (a) Prove.that _ is distributed like 2 with (n — 1) degrees of.freedom, 


where S2 and o? are the variances of sample (of size n) and the population 
respectively. (Burdwan Univ. B.Sc. (Maths.) Hons.), 1992) 

(b) Let X/o,? and Y/a,7 be two independent chi-square variates with n and m 
degrees of freedom respectively. Find an unbiased estimate of (0,/0,)* and find 
its variance. Show that X/Y and (X/o,”) + (Y/o.*) are independently distributéd. 
Name the distributions of X/Y and (X/0,7) + (Y/o,”). 

10. X denotes the random variable with chi-square distribution having n 
degrees of freedom. Show that for suitably chosen constants a, and b,, the 


, X-a 
moment generating function of i ‘ends to that of the standard normal 
a 


distribution as nm —» o. From this what would you conclude about the 


behaviour, for large n, of P f ; Fe y ) ? 


A ] 
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11. Show that 


oo —Adr 
Plata s22 2a) =o [ €- y"dy = rex A 


Explain the uses of this result.[Déelhi Univ. B.Sc. (Stat. Hons.), 
1990) 
12. X is a Poisson variate with parameter A and x? is a chi-square variate 
with 2k d.f. Prove that for all positive integers k, 
Sk-—1) =P{xy2> 2A) 


Hint. P(x? > 2A) = FET (k) a i exp (-5 x) (y2)*-! ay? 


l t-] 
“ami J, 77 '4 
1 ee Oo nae seis 
“gin be*+(k-1) f e-Y y# > ay} 
By repeated integration, we get the réquired result. 
13. Let X,, Xo, ..., Xm, and Y;, Yo, ..., Y, be independent random samples 


from a normal population with mean zero and variance 07. Let their means be X 


and Y and their variances be Sy” and Sy” respectively. 
Let the pooled variance S,” be defined as : 
52 (in — 1)Sx2 + (n — 1) Sy? 
(m+n — 2) 


Prove that (X — Y) and (m+n —2)S '>*/? are independently distributed, the 
former as a normal variate with zero mean and variance o? {(1/m) + (1/n)} and 
the latter as a chi-square variate with (m + n — 2) d.f. 

[Nagpur Univ B.E., 1992} 

14. If X is a random variable following Poisson distribution with 
parameter A, and A is also a random variable so that 2a is a chi-square variate 
with 2p degrees of freedom, obtain the unconditional distribution of X. Give the 
name of this distribution and find its mean. 

15. X,, Xo, and X3 denote independent central chi-square variates with v,, 
v, and v3 d.f. respectively. 

(i) Show that X,/(X; + X2) is independently distributed of 

(X, + XX + Xo + X3). 
(ii) Obtain the joint density function of the distribution of 
X= XX, + X2 + X3) and Y= X(X, + Xo + X3) 

(iti) Hence or otherwise obtain the mean and variance of X and Y and 
Cov (X, Y). r 

16. Prove that each linear constraint on (f,), i= 1,2, ..., n reduces by 
unity the number of degrees of freedom of the chi-square, 
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2. (fi; -€))7/e;), 
~ where e; = E(fj). 
17. If X follows a chi-square distribution with n d.f. so that E(x) = n and 


V(X) = 2n, prove that (X - n)/V2n is aN (0, 1), for large n. 
18. If ydx is the probability that X lies between x and x + dx and if y is 
given by the solution of the differential equation 


dy _ y(a-x) 
dx bx+ec 


show that, (for suitable values of the constants a, b and c), a certain linear 
function of X has the x?-distribution with n degrees of freedom, where 


n=2(1 +o + 5) 


19. If X, and Xz are independently distributed, each as y? variate with 
2 d.f., show that the density function of Y = +(x 1 ~X>) is 


gy) =4 el! wo cy<en, 


20. If X and Y are independent r.v.’s having rectangular distribution in the 
interval (0, 1), show that 


U =V-2 log X cos 2xY and v=V-2 log X sin 2nY 


are independently distributed as N(0, 1). Hence show that U2 and V2 are 
independently distributed as ?-variates, each with 1 4.f. 


‘ 1 _ouv) 2% 
Hint. Jax) x 


and u*+v*=-2logx => x=exp E  (u? + | 
21. Find the p.d.f. of x, = + Vx42, where x,2 is a y2-variate with n d.f. 
and show that 
is ~ orn Li(n + n/2} 
Hence establish that for large n, 
3 EQ”) = (E(x)? 


(Hint. £(y,2)=n and E(y,) =)’ = 2!2 ta 


Now use ait) = n*, for large values of n. [c.f. Remark to § 14-5-7] 
22. Let 


x 
P J wlt-2)2, e-w2 dw, x >0. 


1 
** 2 T(n/2) J 
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Show that 


x< a 
Ls 


ae 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990) 
23. Let X;, Xz, ....X, be a random sample from N (1, 6), and k be a 
positive integer. Find E(S?4). In particular, find E(S*) and Var (S?). 


[s* = (n - 1)! z (X; - xX)? - X=n7! 2 x, 


rhk+ 7 = 
202 ( 2 
Ans. E(S) = ( | ;k>O,n>1 


E(S*) = 0? , Var (S*) = 204/(n — 1). 

24, Let X, and X, be independent random variables, each N(O, 1). Find the 
joint distribution of Y, =X,? + X,? and Y, = X,/X2. Find the marginal 
distributions of Y, and Y,. Are Y, and Y, independent ? 

Ans. Y; ~ X22) and Y2 is standard Cauchy variate. Yes. 

25. Let X, and X2 be independent standard normal variates. Let 

Y, =X, + X2 and Y, = X,? + xX,*. 
(i) Show that the joint m.g.f. of Y, an Y, is: 


pare Sees af oI. i 
M (¢, f2) = 1 ~ 21, OP Fel -O<h < 9,-m<h <5 


(ii) Hence or otherwise, show that 
Y, is a normal variate and: Y> is a chi-square variate. 


(iii) Are Y, and Y; independent ? If not, find the correlation coefficient of 
Y, and Y>. 


[Delhi Univ. B.Sc. (Maths. Hons.), 1989] 
Ans. (Y,,¥2) are not independent. p(Y,, ¥2) = 0. 


26. If X,,X>, ...,X, are independently and normally distributed with the 
same mean but different variances 67, 637, ..., 6,2 and assuming: that 


& (X;/67) 2 [(xX;- U)? 
a7 d (1/6;”) ae ve >| or | 


are independently distributed, show that U ~N (0, 1/(26;7)) and Vhas x? 
distribution with (n — 1) d.f. 


27. If X,, Xz, ...,X, is a random sample from N (1, 6”), find the mean 
and variance of 
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A = if2 
§= 2, - X)2/(n - »| 
[Delhi Univ. B.Sc. (Maths. Hons.), 1988) 


28. Let X;, Xz, ....X, be a random sample from N (0, 1). Define : 


, 
A » 


x. 


— y & os 
X= 2% sie Xn-k= ak ink+l 


(a) What is the distribution of 3 (X,+Xq-4)? 


(b) What is the distribution of k X,2 + (n —k) X2,_4? 
(c) What is the distribution of X?/X7?, i #j? 
-@) What is the distribution of X;/X;, i #j 7 


Ans. @N (0. anh (6) x72) 


(c) p(3 : ,] or F(1,1) [See § 14-5} 
(a) Standard Cauchy distribution. 


OBJECTIVE TYPE QUESTIONS 


I. Choose the correct answer from B and match it with each item in A. 


A B 
(a) ®, fora chi-square distribution (1) (1 —2i0) 2 
(6) B, for a chi-square distribution (2) 8/n 
(c) Mean for a chi-square distribution (3) 2n 


(@) Variance for a chi-square distribution (4) (1-21) -"2 
(e) Characteristic function for x? distribution (5) (12/n) + 3 


(f) Mode of y?-distribution (6) V2In 
(2) M.G.F. of y?-distribution (7) n 
(h) Skewness of y2-distribution (8) (n—-2) 


State which of the following statement are Truc and which are False. In 
case of false statements, give the correct statement. 
({) Normal distribution is particular case of 1?-distribution for one 
d.f. 
(ii) For large degrees of freedom, chi-square distribution tends to 
normal distribution. 
(iti) The sum of independent chi-square variates is also a chi-square 
variate. 
(iv) For the validity of x2-tcst, it is always necessary that the sample 
observauons should be independent. 
(v) The chi-square distribution maintains its character of: continuity 
if cell frequency is less than S. 


- 
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(vi) Each linear constraint reduces the number of degrees of freedom 
of chi-square by unity. 
(vii) In a chi-square test of goodness of fit, if the calculated value of 
x? is zero then the fit is a bad fit. 
Jil. Mention the correct answer : 
(@) The mean of a chi-square distribution with n d.f. is 


(a) 2n, (b) n?, () Va, @n 
(ii) The characteristic function of chi-square distribution is 
(a) (1 — 2 it)*2, (b) (1 + 2 it)*2, (c) (1 -2 it) ™ 
(iii) The range of ?-variate is 
(a) -— to +00, (6) 0 toc, (c)0 to 1, d)-- to 0. 
(iv) The skewness in a chi-square distribution will be zero if 
(a)n—»co, (b)n=0, (c)n=1, dn<O0 
(v) The moment generating function of a y2-distribution with n. 
degrees of freedom is 
(a) (1~1)-*?, (6) (1-22), (c) Coe (d) (1 - 22"? 
(iv) Chi-square distribution is 
(a) Continuous, (5) multimodal, © symmetrical. 
IV. Mention some prominent features of the chi-square distribution with n 
degrees of freedom. 
V. IfX and ¥ are independent random variables having chi-square 
distribution with m and n degrees of freedom respectively, write down 
the distributions of () X + Y, (i) X/Y, (ii) XIX + Y). 
VI. @ For how many degrees of freedom does the x?-distribution reduce 
tO negative exponential distribution ? 
(b) Give.an example of two independent variates none of which is a 
chi-square variate, although their sum is a chi-square variate. 
13-7. Applications of Chi-square Distribution. 7?-distribution 
has a large number of applications in Statistics, some of which are enumerated 
below : 
(i) To test if the hypothetical value of the population variance is 6% = do? 
(say). 
(ii) To test the ‘goodness of fit’. 
(iii) To test the independence of attributes. 
(iv) To test the homogeneity of independent estimates of the population 
variance. 
(v) To combine various probabilities obtained from independent 
experiments {o give a single test of significance. 
(vi) To test the homogeneity of independent estimates of the population 
correlation coefficient. 
In the following sections we shall briefly discuss these applications. 
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13-7-1. Chi-square Test for Population Varian¢ée. Suppose we 
want to test if a random sample x;, (i = 1, 2, ..., 2) has been drawn from a 
normal population with a specified variance 62 = do’, (say). 

Under the null hypothesis that the population variance is 0? = Go’, the 
Statistic 


42 = bs [i-3¥ = | % x2 - a =nst/do? —...(13-14) 


follows chi-square distribution with (n — 1) d.f. 

- By comparing the calculated value with the tabulated value of x? for (n - 1) 
d.f. at certain level of significance, (usually 5%),. we may retain or reject the null 
hypothesis. 

Remarks. 1. The above test (13-14) can be applied only if the 
population from which sample is drawn is normal. 


2. If the sample size n is large (>30), then we can use Fisher’s 


approximation 
aye N(N2n -1, 1) 
i.e., Z= V2y2 -V2n-1~N (0, 1) . (13-14) 
and apply Normal Test. 


3. For a detailed discussion on the significant values, (critical values), for 
testing Ho : 6? = Op? against various alternatives : {i) 62 > G92; (ii) a? < do? 
and (iii) 0? # Go*, see Remark 1 to § 16-74. 


Example 13-9. It is believed that the precision (as measured by the 
variance) of an instrument is no more than 0-16. Write down the null and 
alternative hypothesis for testing this belief. Carry out the test at 1% level, 
given 11 measurements of the same subject on the instrument : 

2:5, 2:3, 2:4, 2:3, 2:5, 2:7, 2:5, 2-6, 26, 2:7, 2:5, 

[Calicut Univ. B.Sc. (Main Stat.), April 1989) 

Solution. Null Hypothesis, Ho : 07 = 0-16 

Alternative Hypothesis : H, : 07 > 0-16. : 

COMPUTATION OF SAMPLE VARIANCE 


x -%)? 


0-0001 
0-0441 
0-0121 
0-0441 
0-0001 
0-0361 
0-0001 
0-0081 
0-0081 
0-0361 
0-0001 


ee Ea: 


< 


2 [YH HOYHHHYNHOYNY 
AlnIrannnarnaw awn 


Pel 
T 
tr 
> 
u 
LS) 
= 
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Under the null hypothesis Hy : 0? = 0-16, the test statistic is : 


which follows x2-distribution with df (11 -— 1) = 10. 

Since the calculated valuc of y? is less than the tabulated value 23-2 of x? 
for lO df at 1% level of significance, it is not significant. Hence fg may be 
accepted and we conclude that the data are consistent with the hypothesis that the 
precision of the instrument is 0-16. 

Example 3:10. Test the hypothesis that 6 = 10, given that s = 15 fora 
random sample of size 50 froin a normal population. 


Solution. Null Hypothesis, Ho: 6 = 10. 
We are given n= 50, s=15 


Since # is large, pio the test statistic is 
2v2 — V2n-1 ~N (0, 1) 


Now, 1 ~ ale = 15 —9-95 = 5-05 

Sine |Z|> 3, it is significant at all levels of significance and hence Ho is 
rejected and we conclude that o # 10. 

13-7-2. Chi-square Test of Goodness of Fit. A very powerful test for 
testing the significance of the discrepancy between theory, and experiment was 
given by Prof. Karl Pearson in 1900 and is known as “Clii-square test of 
goodness of fit.” It enables us to find if the deviation of the experiment from 
theory is just by chance or is it really due to the inadequacy of the théory to fit 
the observed data. 

If O;, (= 1, 2, ..., n) is a set of observed (experimental) frequencies and E; 
(j= 1, 2, ..., 2) 1s the corresponding set of expected (theoretical or hypothetical) 
frequencies, then Karl Pearson's chi-square, given by 


n O:;-E. 2 n n 
ro = — ’ (2 O; = 2 E; ) .. (13-15) 


follows chi-square distribution with (2 — 1) df. 


Remark. This is an approximate test for large values of n. The 
conditions for the validity of the y2-test of goodness of fit have already been 
given in § 13-5 on page 13-15. 

Example 13-11. The following figures show the distribution of digits in 
numbers chosen at random from a telephone directory : 

Digits : 0 2 3 4 5 6 7? 8 9 Total 


Frequency: 1026 1107 997 966 1075 933 1107 972 964 853 10,000 
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Test whether the digits may be taken to occur equally frequently in the 
directory. [Osmania Univ. M.A. (Eco.), 1992] 


Solution. Here we set up the null hypothesis that the digits occur 
equally frequently in the directory. 


Under the null hypothesis, the expected frequency for each of the digits 
0, 1,2, ..., 9 is 10000/10 = 1000. The value of x? is computed as follows : 


CALCULATIONS FOR x? 


Observed Expected . 
Frequency Frequency (O -E)* (O- E)*1E 
(O) (E) 


0 
1 
2 
3 
4 
5 
6 
7 
; 


ee ae oe 
Y= oe = = 58-542 


The number of degrees of freedom = 10 — 1 = 9, (since we are given 10 
frequencics.subjected to only one linear constraint © O = ¥ E = 10,000). 
The tabulated 72.95 for 9 d.f. = 16-919 


Since the calculated 2 is much greater than the tabulated value, it is highly 
significant and we reject the null hypothesis. Thus we conclude that the digits 
are not uniformly distributed in the directory. 

Example 13-12. The following table gives the number of aircraft 
accidents that occurs during the various days of the week. Find whether the 
accidents are uniformly distributed over the week. 

Days .-« Sun. Mon. Tues. Wed. Thus. Fri. Sat. 
No. of accidents ... 14 16 8 12, «11 9 14 
(Given : the values of chi-square significant at 5, 6, 7, af. are respectively’ 
11-07, 12-59, 14-07 at the 5% level of significance. 

Solution. Here we set up the null hypothesis that the accidents are 
uniformly distributed over the week. 

Under the null hypothesis, the expected frequencies of the accidents on each 
of the days would be : 

Days - Sun. Mon. Tues. Wed. Thus. Fri. Sat. Total 
No. of accidents ... 12 12 12 12 12 12 12 84 
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2 (14 — 12)2 (16 — 12)? (8 — 12)2 (12 — 12)? 
ne 7 i | ain | ae | 


(11 — 12)? (9-12)? (14 - 12 
+"? * 2 T' 


a2 (4 +16 +16+0414+944)= 20 


= 4.17 
The number of degrees of freedom 
= Number of observations — Number of independent constraints. 
=7~1=6 
The tabulated 720.95 for 6 d.f. = 12-59 
Since the calculated x? is much less than the tabulated value, it is highly 
insignificant and we accept the null hypothesis. Hence we conclude that the 
accidents are uniformly distributed over the week. 


Example 13-13. The theory predicts the proportion of beans in the four 
groups A, B, C and D should be 9: 3 : 3: 1. In an experiment among 1600 
beans, the numbers in the four groups were 882, 313, 287 and 118. Does the 
experimental result support the theory ? [Agra Univ. B.Sc., 1991] 

Solution, Null Hypothesis : We set up the null hypothesis that the 
theory fits well into the experiment, i.e., the experimental results support the 
theory. 

Under the null hypothesis, the expected (theoretical) frequencies can be 
computed as follows : 

Total number of beans = 882 + 313 + 287 + 118 = 1600 

iaiachtess eae the ratio9: 3:3: 1 


E(882) = =x 1600 = 900, eae 1600 = 300 
E(287) =2 x 1600'= 300, E(118) = 7x 1600 = 100 


2 =| @= =H] 
_ 900)? 313-300 air 287 — 300)? | 118-100)? 
300 300 100 
= 0- ae + 0-5633 + 0-5633 + 3-2400 = 4-7266 
df.=4-1 =3, and tabulated 720s for 3 d.f. = 7-815 

Since the calculated value of 7? is less’ than the tabulated: value, it is not 
significant. Hence the null hypothesis may be accepted at 5% level of 
significance and we may conclude that there is good correspondence between 
theory and experiment. 

Example 13-14. A survey of 320 families with 5 children each revealed 
the following distribution : 

No. of boys : 5 4 ke 2 I 0 

No. of girls : 0 | 2 3 4 bp] 

No. of families: 14 56 110 88 40 12 
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Is this result consistent with the hypothesis that malé and female births are 


equally probable ? 
Solution. Let us set up the null hypothesis that the data are consistent' 


with the hypothesis of equal probability for male and female births. Then under ° 
the null hypothesis : 


p = Probability of male birth = : =9g 
p(r) = Probability of ‘y’ male births in a family of 5 A 


Ce) 


The frequency of r male births is given by : 
fir) =N. ptr) = 320x( * «(5 J 


5 
= 10 x/ ; ) ai) 


Substituting r = 0, 1, 2, 3, 4 successively in (*), we get the expected 
frequencies as follows : 
f0) =10x1 =10, ft) =,10 x 5C, = 50 
fQ2) =10x5Cz = 100, KG) = 10 x 5C3 = 100 
f4) = 10x5C, = 50, KS) = 10 x 5Cs = 10 


CALCULATIONS FOR 7? 
Observed Expected 


See 


Tabulated 720.95 for 6 ~ 1 = 5 d.f. is.11-07. 

Calculated value of x? is less than the tabulated value, it is not significant 
at 5% level of significance and hence the null hypothesis of equal probability for 
male and female births may be accepted. 

Example 13-15. Fit a Poisson distribution to the following data and test 
the goodness of fit. 

X: 0 I 2 3 4 5 6 

f: 275 72 30 7 5 2. 1 
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Solution. Mean of the given distribution is « 


— YSi%i 189 


X= N “We = 399 = 0-482 


In order to fit a Poisson distribution to the given data, we take the mean 
(parameter) m of the Poisson distribution equal to the mean of the given 
distribution, i.e., we take 


m =X = 0-482 
The frequency of r successes is given by the Poisson law as : 


0-482 (4. Pr 
f~ = Np(r) = 392 x — OBEY, 0, Fe? Er © 


Now (0) = 392 x e°482 = 392 x Antilog [— 0-482 log e} 
= 392 x Antilog [— 0-482 x log 2:7183] (-+ ¢ = 2-7183) 
= 392 x Antilog [~ 0-482 x 0-4343] 
= 392 x Antilog (- 0-2093] 


= 392 x Antilog [1-7907] = 392 x 0-6176 


= 2A2-1 

fll) =mx flO) = 0-482 x 242-1 = 11669 

f?) =7% fQ) = 0-241 x 116-69 = 28-12 
f3) = 2 x2) = 2% x 28-12 = 4.518 

fid) =™@xp3) = x 4.518 = 0.544 

(AS) = Bx 4) = 0-544 = 0-052 

RO) = m9) = 0, 0-052 = 0-004 


Hence the eee saa frequencies correct to one decimal place are as 
given below : 


Expected 
Frequency 242-1 116-7 28-1 


CALCULATIONS FOR CHI-SQUARE 


Observed Expected 


frequency a aes ~ (O-E)? (O -E{E 


1082-41 
1998-09 
3-61 
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x? = YO= EY - 40.937 


Degrees of freedom = 7-1-—1-3=2 

(One d.f. being lost because of the linear constraint © O = > E; 1 d.f. is 
lost because the parameter m has been estimated from the given data and is then 
used for computing the expected frequencies; 3 d.f. are lost because of pooling 
the last.four expected cell frequencies which are less than five.) 

Tabulated value of x? for 2 d.f. at 5% level of significance is 5-99. 

Conclusion. Since calculated value of %? (40-937) is much greater than 
5-99, itis highly significant. Hence we conclude that Poisson distribution is not 
a good fit to the given data. 


EXERCISE 13(b) 


1. (a) Define Chi-square and obtain its sampling distribution. Mention 
me prominent features of its frequency curve. Obtain the mean and the 
variance of the chi-square distribution. 

(6) Show that the sum of two independent variates having chi-square 
distributions, has a chi-square distribution. 

2. (a) Write a short note on the Chi-square test of goodness of fit of a 
random sample to a hypothetical distribution 

(b) Describe the Chi-square test of significance and state the various uses to 
which it can be put. 

(c) Discuss the %?-test of gocdness of fit of a theoretical distribution to an 
observed frequency distribution. How are the degrees of freedom ascertained when 
some parameters of the.theoretical distribution have to be estimated from the 
data ? 

3.:(a) The following table gives the number of aircraft accidents that 
occurred during the seven days of the week. Find whether the accidents are 
uniformly distributed over the week. 

Days : Mon. Tue. Wed. = Thur. Fri. Sat. Total 
No. of accidents : 14 18 12 11 15 14 84 

Ans. Ho: Accidents are imiformly distributed over.the week. y7= 2-143; 
Not significant. Hy may be accepted. 

(6) A die is thrown 60 times with the following results. 

Face : 1 2 3 4 5 6 
Frequency e 8 7 12 8 14 11 

Test at 5% level of significance if the die is honest,. assuming that 

P (y2 > 11-1) = 0-05 with 5 d.f. (Burdwan Univ. B.Sc. (Hons.), 1991] 


Exact Sampling Distributions (Chi-equare Distribution) 13-45 


4. (a) In 250 digits from the lottery numbers, the frequencies of the digits 
0, 1, 2, .... 9 were 23, 25, 20, 23, 23, 22, 29, 25, 33 and 27. Test the 
hypothesis that they were randomly drawn. 

(b) 200 digits we chosen at random from a set of tables. The frequencies of 


the digits were : 
Digits : 0 1 2 3 4 5 6 7 8 9 
Frequency: 18 19 23 #21 #16 #25 22 #20 21 «15 


Use x? test to assess the correctness of hypothesis that the digits were distributed 

in equal numbers in the table, given that the values of x? are respectively 16-9, 

18-3 and 19-7 for 9, 10 and 11- degrees of freedom at 5% level of significance. 
[Delhi Univ. B.Se., 1992] 

Ans, x2 = 4-3. Hypothesis seems to be correct. 

5. Among 64 offsprings of a certain cross between guinea pigs, 34 were 
red, 10 were black and 20 were white. According to the genetic model these 
numbers should be in the ratio 9.: 3 : 4. Are the data consistent with the model 
at 5 per cent level ? 

[You are given that the value of x2 with the probability 0-05 being exceeded 
is 5-99 for 2 df. and 3-84 for 1 df.) 

6. In an experiment on pea-breeding, Mendal obtained the following 
frequencies of seeds : 315 round and yellow, 101 wrinkled and yellow; 108 round 
and green, 32 wrinkled and green. Total 556. Theory predicts that the frequencies 
should be in the proportion 9 : 3 : 3 : 1 respectively. Set up proper hypothesis 
and test it at 10% level of significance. 

Ans. x2 = 0-51. There seems to be good correspondence between theory 
and experiment. 


7. (a) Selfed progenies of a cross between pure strains of plant segregated 
as follows : 


Early flowering Late flowering 
Tall ; 120 48 
Short : 36 13 


Do the results agree with the theoretical frequencies which specify a 
9:3:3: 1 ratio? 

(b)Children having one parent of blood-type M and the other type N will 
always be one of the three types M, MN, N and average proportions of these 
will be 1:2: 1. 

Out of 300 children having one M parent and one N parent, 30% were 
found 10 be of ‘type M, 45% of type MN and the remaining of type N. Use x? 
to test the hypothesis. [Patna Univ. B.Se., 1991) 


(c) A genetical law says that children having one parent of blood group M 
and the other parent of blood group N will always be one of the three blood 
groups M, MN,N ; and that the average number of children in these groups 
will be in the ratio 1 : 2: 1. The report on an experiment states as follows : “Of 
162 children having one M parent, and one N parent, 28-4% were found to be of 
group M, 42% of group MN and the rest of the group N”. Do the data in the 
report conform to the expected genetic ratio 1:2:17? 
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(d) A bird watcher sitting in a park has spotted a number'of: birds belonging 
to 6 categories. The exact classification is given below ° 
Category : 1 2 3 4 5 6 
Frequency : 6. 7 13 17 6 5 
Test at 5% level of significance whether or not the data is compatible with 
the assumption that this particular park is visited by birds belonging to these six 
categories in the proportion 1: 1:2:3:1:1. , 
[Given P (x2 > 11-07) = 0-05 for 5 degrees of freédom] 
(Calcutta Univ. B.Sc. (Maths. Hons.), 1991) 
(e) Every clinical thermometer is classified into one of four categories A, B, 
C, D on the basis of inspection and test. From past experience it is known that 
thermometers produced by a certain manufacturer are distributed among the four 
categories in the following proportions : 
Category : A B Cc D 
Proportion : 0-87 0-09 0-03 0-01 
A new lot of 1336 thermometers is submitted by the manufacturer for 
inspection and test and the following distribution into four categories results : 
Category : A  &B C D 
No. of thermometers reported : 1188 91 47 10, 
Does this new lot of thermometers differ from the previous experience with 
regards to proportion of thermometers in each category ? 
8. (a) Five unbiased dice were thrown 96 times and the number of times 4, 
5 or 6 was obtained is given below. 
No. of dice showing 4, 5 or 6: 5 4 3 2 1 0 
Frequency : 8 18 35 24 10 1 
Fit a suitable distribution and test for the goodness of fit as far as you can 
proceed without the use of any tables and state how you would proceed further. 
(Gauhati Univ. B.Sc., 1992] 
(6) In 120 throws of a single die, the following distribution of faces was 
obtained : 
Faces : 1 2 3 4 5 6 Total 
Frequency: 30 25 18 §=10 22 15 120 
Compute the statistic you would use to test whether the results constituted 
refutation of the “equal probability” (null hypothesis). Also state how you 


would proceed further. {Nagpur Univ, B.Sc., 1992) 
(c) Given below is the number of male and female births in 1,000 families 
having five children : 
’ Number of Number ¥ Number of 
male births female births families 
0 5 40 
1 4 300 
2 3 250 
3 2 200 
4 1. 30 


Test whether the given data is consistent with the hypothesis that the 
binomial law holds if the chance of a male birth is equal to that of a female 
birth. 


Exact Sampling Distributions (Chi-<quare Distribution) 13-47 


(d) Five-pig litters Six-pig litters 
Number of males Number of Number of males Number of 
in litter litters in litter litters 

0 2 0 3 
1 20 1 16 
2 41 2 53 
3 35 3 78 
4 14 4 53 
5 4 5 18 

6 0 


Test whether each of the above two samples is a binomial sample (i) with p 
= 0-5, given a priori and (ti) with p determined from the data. Test the 
significance of the difference between. the two sample p’s. 

9. (a) The following table gives the count of yeast cells in square of a 
cyclometer. A square millimeter is divided into 400 equal squares and the number 
of these squares containing 0, 1, 2, ...cells are recorded— 

Number of cells : 0 1 2 3 4 5 6 7 8 9 10 
Frequency : 0 20 43 53 86 70 54 37 18 10 £=5 
Number of cells: 11 12 13 14 #15 += 16 
Frequency: 2 2 0 0 O 0 

Fit a Poisson distribution to the data and test the goodness of fit. 

(b) The following is the distribution of the hourly number of trucks arriving 
at a company’s warehouse. 

Trucks arriving per hour: 0 1 2 3 4 5 6 7 8 
Frequency : $2 151 130 102 45 12 #«§ 1 2 

Find the mean of the distribution and using its mean, (rounded to one 
decimal) as the parameter A, fit a Poisson distribution. Test for goodness of fit 
at the level of significance a = 0-05 

[Madras Institute of Technology, 1992] 

(c) Obtain the equation of the normal curve that may be fitted to the 
following data : 

Class : 60-65 65—70 70—75 75—80 80—85 85--90 90—95 95—100 
Frequency : 3 21 150 335 326 135 26 4 

Obtain the expected normal frequencies and test the goodness of fit. 

10. Aitken gives the following distribution of times shown by two 
samples of 504 watches each, displayed in watch-maker’s windows : 


Class interval for Frequency of watches Frequency of watches 
time shown from sample I from sample Il 
0—2 75 83 
2—4 93 86 
4—6 94 94 
6—8 76 72 
8—10 80 82 
10—12 86 87 


— = ———— 


Total 504 504 
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Calculate the expected frequencies of watches in the various class intervals 
under the hypothesis that the times shown are uniformly distributed over the 
interval (0, 12), separately for the two samples and also for the combined sample 
of all the 1,008 watches. 

Test the goodness of fit for the two samples separately and for the. combined 
sample. Test also the significance of the sum of the values of y? for the two 
separate samples. ; 

11. The following independent observations were made on the price of 
grain in 10 consecutive months : 

Month a | 2 3 4 5 6 7 8 9 10 
Price (in Rs.): 115 118 120 140 135 137 139 142 144 150 

Test the hypothesis that the expected price in the ith month is 
Rs. (100 + 38,i= 1, 2,..., 10 with a standard deviation of Rs. 5 under the 
assumption that the prices are normally distributed. 

12. To test a hypothesis Hg, an experiment is performed 3 times. The 
resulting values of chi-square are 2:37, 1-86 and 3-54, each of which corresponds 
to one degree of freedom. Show that while Hp cannot be rejected at 5% level on 
the basis of any individual experiment, it can be rejected when the three 
experiments are collectively counted. [Poona Univ. B.Sc., 1991] 


(Hint. Use additive property of chi-square variates.] 

13. (a) Describe the chi-square test for testing a hypothesis that a normal 
population has a specified variance o?. 

(b) Give the approximation to the test statistic in (a) if n, the sample size, 
is sufficiently large. 

(c) A sample of 15 values shows that the s.d. is 6-4. Is this compatible 
with the hypothesis that the sample is from a normal population with s.d.. 5 ? 

Ans, Hy: 6 = 5, x? = 24-58; Significant. Population s.d. is not 5. 

(d) Test the hypothesis that o = 8, given that s = 10 for a random sample of 
size 51 from a normal ' population. 


Ans, Z = V 2x2 - V(2n -le= V2 x 79-69 ~V101 = 2-57. Significant 
at 5% level of significance. 

14. (a) A manufacturer claims that the life time of a certain brand of 
batteries produced by his factory has a variance of 5000 (hours)*. A sample of 
size 26 has a variance of 7200 (hours). Assuming that it is reasonable to treat 
these data as a random sainple from a normal population, test the manufacturer’s 
claim at the a = 0-02 level. 

Hint. Ho : o? = 5000 (hours)*; H, : o? # SC00 (hours)? (Two-tailed) 

Critical region is : x? < 42,95) (0-99) ar. i x2 > %2¢5) (0-01). 

(6) A manufacturer recorded the cut-off bias (Volt) of a sample of 10 tubes 
as follows : 

12-1, 12-3, 11-8, 12-0, 12-4,.12-0, 12-1, 11-9, 12-2, 12-2 

The variability of cut-off bias for tubes of a standard type as measured by 
the standard deviation is 0-208 volts. Is the variability of the new tube with 
respect to Cut-off bias less than that of the standard type ? 
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Hint : Ho: 6? = (0-208)? (Volts)? = 625 (say) ; H; : 0? < do? 
Critical region is : %? < X2¢q-1) (1. — @) = X29) (0-95); a = 0-05 
13-7:3. Independence of Attributes. Let us consider two attributes 
A and B,A divided into r classes Aj, Az, ..., A, and B divided into s classes 
B,, Ba. +++, Bs. Such a classification in which attributes are divided into more 
than two classes is known as manifold classification. The various cell 
frequencies can be expressed in the following table known as r x s manifold 
contingency table where (A;) is the number of persons possessing the attribute 
A;, i = 1, 2,...., 7),(@,) is the number of persons posse. sing the attribute B; 
pe Ve Ze 2c. s) and (A ;B;) is the number of persons possessing: both the 
atone A; and oe {i= 1, "2, .7;J/21,2,..., 5]. Also 


z i= p> (B;) =N, is the total frequency. 


rxs CONTINGENCY TABLE 
n = 


wey Co 


(A,B 8 co (A; ap | oe | 


aa eterere 


The problem is to test if two attributes A and B under consideration are 
independent or not. 


Under the null hypothesis that the attributes are independent, the theoretical 
cell frequencies afe calculated as follows : 


P{A;] = Probability that 4 person possesses the, attribute A; 
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P[B;] = Probability that a person possesses the attribute B; 


(B;) } 
— +42. —_ 
—_ N ry j= 1, Zz. eae 


P{AB,) = ean that a person possesses the attributes A; and B; 


(By compound ie theorem, since the attributes A; and B; are 
independent, under the, null hypothesis). 


fo By). 


PIAB =“. b= 1,2, SFE 12, 
. A;Bio = dieeiaas number of persons possessing both the attributes 
A; and B; 
(A) 6;) 
= N.P(A;B) = 
= (A; Bio _ Ane) WOH d, 2..0g 63S 2h Ze 20855) - (13-16) 


By using this Balle we can find out expected frequencies for each of the 
ceil- frequencies (A;B;), @ = 1,2, ....73j = 1, 2, ...,.5), under the. null 
hypothesis of independence of attributes. 

The exact test for the independence of attributes is very complicated but a 
fair degree of approximation is given, for large samples, (large N), by the x?-test 
of goodness of fit, viz., 


2 © y | LA4)D- eenl) anni: 
x = 2 2 k (AB;)o Noten 


which is distributed as a y?-variate with (r — 1)(s — 1) df. [c.f Note below on 
degrees of freedom]. 


Remark. ¢7=¥?/N is known as mean-square contingency. 


Since the limits for ~2 and ¢? vary in different cases, they cannot be used 
for establishing the closeness of the relationship between qualitative characters 
under study. Prof. Karl Pearson suggested another measure, known as 
“coefficient of mean square contingency” which is denoted by € and is given by 


2 2 
Ce ee ee 4403-17) 


Obviously C is always less than unity. The maximum value of-C depends 
on r and s, the number of classes into which A and B are divided. Inarxr 


contingency table, the maximum value of C = V (r—1)/r. Since the maximum 
value of C differs for different classification, viz. rxXr(r=2,3,4,...), strictly 
speaking, the values of C obtained ‘from different types of classifications are not 
_ comparable. 
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Nore on Degrees of Freedom (d.f.). The number of independent 
variates which make up the statistic (e.g., %7) is known as the degrees of 
freedom (d,f.) and is usually denoted by v (the letter ‘Nu’ of the Greek alphabet). 

The number of degrees of freedom, in gencral, is the total number of 
observations less the number of independent constraints imposed on the 
observations. For example, if k is the number of independent constraints in a set 
of data of n observations then v = (n — k). 


Thus in a set of n observations usually, the degrees of freedom for x? are 
(n — 1), one d.f. being lost because of the linear constraint }; O; = XE; = N, on 
4 t 
the frequencies (c:;f. Theorem 13-3, page 13-12.) 
If ‘r’ independent linear constraints are imposed on the cell frequencies, then 
the d.f. are reduced by ‘r’. 


In addition, if any of the population parameter(s) is (are) calculated from the 
given data and used for computing the expected frequencies then in applying 
y?-test of goodness of fit, we have to subtract one d.f. for each parameter 
calculated. Thus if ‘s’ is the number of population parameters estimated from 
the sample observations (n in number), then the required number of degrees of 
freedom for x?-test is (n — s — 1). 


If any one or more of the theoretical frequencies is less than 5 then in 
applying x?-test we have also to subtract the degrees of freedom lost in pooling 
these frequencies with the preceding or succeeding frequency (or frequencies). ° 

In ar Xs contingency table, in calculating the expected frequencies, the row 
totals, the column totals and the grand totals remain fixed. The fixation of ‘r’ 
column totals and ‘s’ row totals imposes (r + 5) constraints on the cell 
frequencies. But since 


2X (A)= 2 BEN, 
iel jel 


the total number of independent constraints is only (r + s — 1). Further, since 
the total-number of the cell-frequencies is r x s, the required number of degrees 
of freedom is : 
v=rs—~(r+s—l)=(r-1)(s-1) 

Example 13:6. Two 
sample polls of votes for two 
candidates A and B fora 
public office are taken, one 
from among the residents of 
rural areas. The results are 
given in the table. Examine 
whether the nature of the area 
is related to voting preference 
in this election. 


[Gujarat Univ. B.Sc., 1990] 
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Solution. Under the null hypothesis that the nature of the area is 
independent of the voting preference in the election, we get the observed 
{requencies as follows : 


1170 x 1000 830 x 1000 


£(620) = PED 1OOO = 585, E(380) = SPO XTOO0 = 415, 
E(550) = aeons = 585, and £(450) = Sonu = 4154 


Aliter. In a 2 x 2 contingency table, since: 
d.f.=(2~1)@Q-1)=1, 
only one of the cell frequencies can be filled up independently and the remaining 
will follow immediately, since the observed and theoretical marginal totals 
are fixed. Thus having obtained any one of the theoretical frequencies, (say), 
E(620) = S85, the remaining theoretical frequencies can be easily obtaincd as 
follows : 


E(380) = 1000 — 585 = 415, E(S50) = 1170 — 585= 585. 
and E(450) = 1000 — 585 = 415 


2 = Y] e= 2) _ (620 — 585)? | (380. : AI 5)? 
7 E ~ 585 


, 050 - ne , (450 - 4159 
585 415 


A: ees Cs 
= (35) Es * 415 * 585° * a3 | 


= (1225)[2 x 0-002409:+ 2 x-0-001709] = 10-0891 


Tabulaied 7299s for (2 — 1) (2-1) = 1G.f. is 3-841. Since calculated x? is 
much greater than the tabulated value, it is highly significant and null 
hypothesis is rejected at 5% level of significance. Thus we conclude that nature 
of arca is rclated to voting preference in the election. 

Example 13-17. (2 x 2 contingency table). For thé 2 x 2 table, 


ce fae 


prove that mail test of independence gives 
N(ad — bc)? 


13-18 
YGENesDGsyey es ..(13-18) 


[Gauhati Univ. B.Sc., 1992} 
Solution. Under the hypothesis of independence of attributes, 
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E(a) = (a + 2 (a + C). 


(a + b) (b+d) 
E(b) = a+) Ore 


(a +c) (c +a) 
E(c)= N 
and E(d) = ee derd 


_[a-E@P , (0-E®? ,le= E(c)}?  [d- E(d)}? 


2 8 
EG) E(b) E@ ~ Ed) “2 
{a+ b) (a+ c) 
G=bG sas a+ Oars 
_afa+b+c+d)-(a*+ac+ab+ bc) ad-be 
~ N ~ ON 


Similarly, we will get 


ad — bc _ad—be 
N =c-E(c);d-E(d)= N 


Substituting in (*), we get 


_(ad-bePP 1 1 ta 
ay Eo * Eb) * EO * za | 


b-E(b) =- 


(ad —bc)* | i: | 
N (a+b\(a+c) (a+b)(b+ a) 


\eraeea* oracrSy| 


_ (ad -bcy b+d+a+c | b+dt+a+c 
~ ON kK +b)(a+c)\(b+d) (at+c)(c+d)(b+ D| 


_c+d+a+b | 
(a + b)(a + c)(b + d)(c + ad) J. 


_ N(ad — bc)? 
(a+ by(a+c)(b + d)(c + d) 


Example 13-18. A random sample of students of Bombay University 
was Selected and asked their opinions about ‘autonomous colleges’. The results 
are given below. The same number of each sex was included within each class- 
group. Test the hypothesis at 5% level that opinions are independent of the class 
groupings :-— 


= (ad - ve] 


13-54 Fundamentals of Mathematical Statistics 


Favouring Opposed to 
‘autonomous ‘autonomous 
colleges’ colleges’ 


FLY. B.A./B.Sc./B.Com. 
S.Y. BAJ/B.Sc./B.Com. 
T.Y. B.A./B.Sc./B.Com. 
M.A/MSc.1M.Com. 


[Bombay Univ. B.Sc., April 1989] 


Solution. We set up the null hypothesis that the opinions about 
autonomous colleges are independent of the class-groupings. 

Here the frequencies are arranged in the form of a 4 x 2 contingency table. 
Hence the d.f. are (4 - 1) x (2-1) 33 x 1 = 3. Hence we need to compute 
independently only three expected frequencies and the remaining expected 
frequencies can be obtained by subtraction from the row and column totals. 

Under the null hypothesis of independence : 

400 x 200 400 x 200 


£(120) = 7026200 _ 13333 ; £(130) = O09 _ 133.33 
E(10) = AOU 100 - 96.67 


Now the table of expected frequencies can be completed as shown below : 


Favouring Opposed to 


‘autonomous ‘autonomous 
colleges’ colleges’ 


133-33 200 — 133-33 
= 66-67 

S.Y.B.A./B.Sc.{B.Com. 133-33 200 -— 133-33 
= 66-67 

T.Y.B.A./B.Sc./B.Com. 66°67 100 — 66-67 
= 33.33 

M.A./M.Sc./M.Com. 66-67 ~ 100 — 66-67 
= 33.33 


ee ee 


F.Y.B.A./B.Sc./B:Com. 
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CALCULATIONS FOR CHI-SQUARE 


177-6889 


11-0889 0-0832 
11-0889 0-1663 
177-6889 2:6652 
177-6889 2:6652 
11-0889 0-1663 
11-0889 0-3327 


177-6889 


x? => (0-5 = 12-7428 


Tabulated (critical) value of x? for (4 — 1) x (2-1) =3- df. at 5% level of 
significance: is 7-815. 

Conclusion. Since calculated value of y? is greater than thé tabulated value, 
it is ‘significant at 5% level of significance and we reject the null hypithesis. 
Hence, we conclude that the opinions about autonomous colleges’ are dependent 
on the class-groupings. 

Example 13-19. Two researchers adopted different sampling techniques 
while investigating the same group of students to find the number of students 
fallirig in different intelligence levels. The results are as follows : 


No. of students in each level. 
Below Average Average Above Average Genius 


Would you say that the sampling techniques adopted by the two researchers 
are significantly different ? (Given 5% value of x for 2 df. and 3 df. are 5-991 
and 7-82 respectively.) 


Solution. We set up the null hypothesis that the data obtained’ are 
independent of the sampling techniques adopted by the two- researchers. In 
other words, the null hypothesis is. that their is no significant.difference between 
the sampling techniques used by the two researchers for collecting the required 
data, 
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Here we have a4 <2 contingency table and df. = (4-1) x(2-1)=3x1 
= 3. Hence we. need to compute only 3 independent expected frequencies and the 
remaining ‘expected frequencies can be obtained by subtraction from the marginal 
row and column totals. 


Under the nult hypothésis of independence, we have 


| 69x 200. , 
E(44) = 300 = 46 


The table of expected frequencies can now be.completed as shown below : 


No. of students in each level 


Researcher 
Below Average Average 


_ Since we cannot apply ‘the re straightway here as the last frequency is 
less than 5, we should use the technique of pooling in this case as given below : 


CALCULATIONS FOR CHI-SQUARE 


After pooling, y2= aes). 0.923 


andthe df. = (4-1) (2-1)~1=3-1=2, since 1 df. is lost in the 
method of pooling. 
Tabulated value of x? for 2d,f. at 5% level of significance is 5-991. 
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Conclusion. Since calculated value is less that the tabulated value, null 
hypothesis may be accepted at 5% level of significance and we may conclude 
that there is no significant difference in the sampling techniques used by the 
two researchers. 

13-8. Yates’ Correction. In a 2 x 2 contingency table, the number of 
df. is (2-1) (2-1) =1. If any one of the theoretical cell frequencies is less 
than 5, then the use of pooling method for y?-test results in x? with 0 df. 
(since 1 df. is lost in pooling) which is meaningless. In this case we apply a 
correction due to F. Yates (1934), which is usually known as “Yates’ Correction 
for Continuity”. [As already pointed out, x? is a continuous distribution and it 
fails to maintain its character of continuity if any of the expected frequency is 
less than 5; hence the name ‘Correction for Continuity’J. This consists in 
adding 0-5 to the cell frequency which is less than 5 and then adjusting for the 
remaining cell frequencies accordingly. The y?-test of goodness of fit is then 
applied without pooling method. 


For a 2 x 2 contingency table, , we have 


a N(ad - bc)* 
X= (a+ c) (b+ d) (a+b) (c +d) 


According to Yate’s correction, as explained above, we subtract (or add) ; from a 


and d and add (subtract) + to b and c so that the marginal totals are not disturbed 
at all. Thus, corrected value of x? is given as 
N[ot hat )-OtH (e+) 
Numerator =N [ (ad — bc) ¥ ; (a+b+c +d) 


2 
=n} tad — bc | 
2 
> -—N[lad-bel -N/2)?_. oe 
x" =(@ +c) (b+ d) (a+b) (c +d) epi Te4) 

Remarks I. If'N is large, the use of Yate’s correction will make very 
little diffcrence in the value of x. If, however, N is small, the application of 
Yates’ correction may overstate the probability. 

2. It is recommended by many authors and it seems quite logical in the 
light of the above discussion that Yates’ correction be applied to every 2 x 2 
table, even if no theoretical cell frequency is less than 5. 

13-9, Brandt and Snedecor Formula for 2 x k Contingency 
Table, Let the observations a, (i = 1, 2; j = 1, 2, ..., k) be arranged ina 2 xk 
contingency table as follows : 
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Under the hypothesis of independence of attributes, we have 
Fay) =; B(ay) =, i= 1,2,. 


; 
k 
_ (a,;-E(a,))? | (ay - E(a,,))? 
e- >| Hla) * Elan) a 


ge ha 

ZUF ber) 116) } 
where Pi = ge l-pe 
and pay =t-p=~ 


=D 0, (>: -P | *4 [sp+q=1] 
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k 
-1| x n,p? + p?N - 2p wp | 
PO Li = 
L| 
=—_ Nn; 2=N | 
+) a : 
k k k 
But 2 np = 2 Nj Pi-Pi = 2 1;Pi 
gs § = 


k 
x? 1/5 a\i:p;—- Np 2|-2 3 > ane np?| ..-(13-19) 


int Nj 


k 2 k 2 , 

My ae | al 5 fie a 

= - = — .»L13-19(@ 

“Re ne Lis nN ! )) 

Example 13:20. The following table shows three age groups of boys 

and girls, (a) the number of children affected by a non-infectious disease and (b) 
the total number of children exposed to risk. 


Boys Girls 
I pT | It I I i 
(a) 60 25 48 96 18 42 
(b) 240 470 350 530 200 210 


@) Test whether there are differences between the incidence rates in the three 
age groups of boys. 
(ii) Test whether the boys and girls are equally susceptible or not. 
Solution. (i) We set up the null hypothesis (Ho) that there is no 
significant difference between the incidence rates in the three age-groups of boys. 
In the notations of § 13-9, we have 
Q,, = 60, a,2 = 25, a)3 = 48, m, = 133 
n, = 240, a, = 470, nz = 350, N = 1060 
- 7 133_ 
~ 1060 
eaieails these values in (13-19a), we get 


[15-00 + 1-33 +-6-58 - 133 x 0-1255] 


===. = 0.1255, g = 1 -p = 08745 


x ae oe 
(0-1255)(0-8745) 


6.2187 _¢, 
= 0.1097 = 70-688 


Here vy =(3 — 1) (2-1) =2. 


13-60 Fundamentals of Mathematical Statistics 


The tabulated value of x? for 2 degrees of freedom at 5% level of 
significance is 5-991. Since calculated valuc of y? is much greater than tabulated 
value, we reject the null hypothesis and conclude that the incidence rates in the 
three age-groups of boys differ‘significantly. 

(ii) Here we set up the null hypothesis that the boys and girls are equally 
susceptible to the disease. In the usual notations, we have 

ay,= 60+ 25+ 48 = 133 and a).= 96+ 18+ 42= 156, mt = 289 

= 240 + 470 + 350 = 1060 and nz = 530 + 200 + 210 = 940, N = 2000 


_ 289 _ = = 0. 
= 5000 = 0-:1445, g=1-—p=0-8555 


= 289 x 0-1445 = 41-76 
> = ianiealicate [ 16-69 + 25-89 — 41-76 | = 6-605 

Here vy =(2-1)(2-1)='1 

From the tables, the value of x? for 1 degree of freedom at 5% level of 
Significance is 3-841 which is much less than the calculated value. We, 


therefore, reject the null hypothesis and conclude that boys and girls are not 
equally suscepuble to the disease. 


Example 13-21. Two samples of sizes N,,Nz have respectively 
frequencies fi, fo, ....f, And fy’, fo’, ....f, under the same headings. Show that 
x? for such a distribution is equal to 


[ (fe. Sf , 
as N, Ny 
> NN ; 
r=] si ie + ft; 
[Allahabad Univ: B.Sc., 1992] 


Solution. The 2 x n contingency table for which y2 is to be calculated is 
given below : 


Under the hypothesis of independence of attributes, we have 
By) = MELD py) Nae fed 
+N 2 
2: fast a if - | 
‘ 3 EG) ~ EG) 


ral 
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siz 5 Fey if a 
ral NiG,+f,) © ‘i N,+N.2 
, +N) ee | 
* No, +r) |" N, +N 


=) E (N2f-- Ni f,)? (Ni fh, — Nof,)? 
N,(Ni+ N)6-+f-) * NAN, + N2) (f, +f) 


: piel tte | 1 | 
721 (Ny + Naf, +f) N, 


: | _NNa_ fey) 
reiL Se +r \N1 N2 
EXERCISE 13(c) 


1. (2) What is contingency table ? Describe how the x? distribution may be 
used to test whether the two criteria of classification in an m x n contingency 
table are independent. 

(b) State the hypothesis you test using the Chi-square statistic in a 
contingency table. 

(c) Describe the x? test for independence of attributes, stating clearly the 
conditions for the validity. Give a rule for calculating the number of degrees of 
freedom to be assigned to x. Illustrate your answer with an m x n contingency 
table explaining the null hypothesis that is being tested. 

2. Of ‘A’ candidates taking a certain paper, ‘a’ are successful, of ‘B’ taking 
another paper, ‘b’ are successful. Show how the significance of the difference 
between the ratios a/A, b/B may be tested (i) by a x2 test on contingency table 
and (ii) by comparing the difference with its standard error assessed by means of 
a binomial distribution. (You may assume all frequencies are sufficiently large.) 
Prove algebraically that the value of x? is the square of the ratio of (@/A — b/B) 
to its standard error. 


3. (a) Show that for the entries in the following 2 x r contingency table, 
A, A» A; A, Total 


= 
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a ny; b 
where pire pes, oat and q;=1-pi q=7 


(Madurai Univ. B.Sc., Oct. 1988] 
(b) Given x? contingency table representing two independent samples : 


Total 
Sample I [4 [Hy dan H, | m 
Sample II Vv} V2 ag vp n 
Total H+ vy [lo + Vo see Up,+v, m+n, 
show that 
l td 
) a ee = 
xc) ea mo | 
where OF - —Fi_ and w=—” , 


HL; + V; m+n 

can be used to test whether the samples are drawn from the sample population. 
Clearly state the underlying assumptions, and give the number of degrees of 
freedom. 

(c) In a 2 x 3 contingency table if N=x+y+z,N’=x’ + y’+2z’ and 
N =N’, show that 

yx (x - x’)" (y- hae ie 20 
x+x’ yty z+2 
(Poona Univ. B.Sc., 1990) 

(d) Show that for a 2 x 2 table, the value of %?, after applying Yates’ 

correction for continuity is 


N NY? N N\? 
5, (ad - bc -4) or 5 (ad - bc +5) 


according as ad — bc > 0 or < 0 respectively, where 
D=(a+b)(a+c)(b+d) (c +d). 

(e) What is Yates’ correction ? Show that for a 2 x 2 contingency table, the 

value of y? after applying this correction is : 
N[ lad - bc !-N/2]? 
“(a +b) (a+c)(b+d)(c +4) 
[Marathwada Univ. M.Sc., 1991] 

4. Consider the following 2 x 2 table of observed frequencies based on 

random samples (with replacement) of sizes n.; and n.. from two populations : 


Population!  Populationil Total 
Class A Ry ny My. 
Class B Mo, No» No. 
Total Nn. N.o n 


(i) Define the 2-statistic to be used for test of homogeneity of the two 
populations. 


(ii) Show that 


2 . 2Cri1 222 — My2N1)? 
(ny. ni, N2. N.2) 
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(iii) Let 

y otf 
ny no 

Calculate the mean and variance of u and indicate how you may estimate 
them. 

5. (a) In an epidemic of certain disease 92 children contracted the disease. 
Of these, 41 received no treatment and of these 10 showed after-effects. Of the 
remainder who did receive treatment, 17 showed after-effects. Test the hypothesis 
that treatment was not effective. 

(b) Can vaccination be regarded as a preventive measure of small-pox as 
evidenced by the following data ? 

“Of 1482 persons exposed to smallpox in a locality, 368 in all were: 
attacked, Of these 1482 persons, 343 were vaccinated and of these, only 35 were 
attacked”. 

6. (a) Define x2. Cite some statistical problems where you can apply x? for 
testing Statistical: hypothesis. 

In an experiment on immunization of cattle from tuberculosis the following 
results were obtained : 


Affected Unaffected 
Inoculated 12 28 
Not inoculated 13 7 


Examine the effect of vaccine in controlling the incidence of the disease. 
(b) What are contingency tables ? What is tested there ? Explain the test 


procedure therein> 
The following data is collected on two characters : 
Cinegoers Non-cinegoers 
Literate 83 57 
Illiterate 45 68 


Based on this, can you conclude that there is no relation between the habit 
of cinema going and literacy ? 

7. (a) To find whether a certain vaccination prevents a certain disease or 
not, an experiment was conducted and the following figures in various classes 
were obtained, A showing vaccination and B attacked by the disease. 


Using x?-test, analyse the results of the experiment for independence 
between A and B; examine whether Yate’s correction modifies the conclusion or 
not. Test also the significance of the difference between the proportions of 
persons attacked by the disease among vaccinated and non-vaccinated which are 
69/160 and 10/40. 


(b) A theory in finance known as Random Walk Theory suggests that short 
term changes in stock prices follow a random pattern. According to this theory, 
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yesterday’s price change can tell us virtually nothing of value about to-day's 
price change. Let us denote the change in price of a stock on day ¢ by A P, and 
the change on the next day by A P,,,. Suppose we observe price changes of 240 
stocks that have been randomly selected and obtain the results shown in the table 
below : 


Test the hypothesis that the change in stock price on day (¢+1) is 
independent of that on day . 


[Dethi Univ. M.A. (Eco.), 1987] 
8. (a) Show that the value of 7? for 2 x 2 contingency table 


Eike Nad =p 


: = Ge) (b+d)(at+b)(e+d)’ 


where N=a+b6+c+d. 


(6) Let X and Y denote the number of successes and failures respectively in 
n independent Bernoulli trials with p as the probability of success in each tnal. 


Show that 
(X —np)?  [Y¥ —n(1- p)}? 
+ 
np n(1 - p) 
can be approximated by a chi-square distribution with one degree of freedom 
when n is large. . [Delhi Univ. M.A. (Eco.), 1986] 


9. Show that for r x s contingency table : 
(a) Number of degrees of freedom is (r — 1) x (s — 1) 
(b) x2 =N (s ~1) or x2=N (r — 1), whichever is less 
(c) EQ?) =N(r- 1) (s- DAN- 1) 
@ max (C) = [(s- 1)/s]'?, r= s, 
where C is the coefficient of contingency and N is ths total frequency. 


10. (a) 1072 college students were classified according to their intelligence 
and economic conditions. Test whether there is any association between 
intelligence and economic conditions. 


Intelligence 
Excellent Good Mediocre Dull 
Economic Good 48 199 181 82 
Conditions Not good 81 185 190 1,06. 


(b) Below is given the distribution of hair colours for either sex in a 
university : 
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(i) (2) (3) (4) (5) 


Hair colour Fair Red Medium Dark Jet black Total 
Boys 592 119 849 504 36 2100 
Girls 544 97 677 451 14 1783 
Total 1136 216 1526 955 50 3883 


Test the homogeneity of hair colour for either sex. If the result is 
significant at 5 per cent level, explain the reason why it should be so. 

11. (a) The following data are for a sample of 300 car owners who were 
classified with respect to age and the number of accidents they had during the 
past two years. Test whether there is any relatinnship between these two 
variables. 


Accidents 


3 or more 


(6) For the data in the following table, test for independence ‘between a 
person’s ability in Mathematics and interest in Economics. 


Ability in Mathematics 


Average 
42 


61 


Economics 


47 


State clearly the assumptions underlying your test procedure. 
[Delhi Univ. M.A. (Eco.), 1988 


12. The following table gives for a sample of married women, the level of 
education and marriage adjustment score : 


Marriage—adjustment score 


Low High Very high 


College 


62 38 


Level of 


Education 
High school 


Middle School 


Can you conclude from the above, ‘the higher the level of education, the 
greater is the degree of adjustment in marriagé’ ? 
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13. (a2) The table below shows results of a survey in which 250 

7 respondents were categorized according to level of education and attitude towards 

Students’ demonstrations at a certain college. Test the hypothesis that the two 
criteria of classification are independent. Let a = 0-05. 


Attitude 
Education Against Neutral For 
Less than high school 40 25 51 
High school 40 20 5 
Some college 30 15 30 
College graduate 15 15 10 


(b) Test the hypothesis that there is no difference in the quality of the four 
kinds of tyres A, B, C and D based on the data given below. Use 5% level of 
significance. 


Tyre Brand 
A 
2 


Failéd to last 40,000 kms. 


e res ; 6 
Lasted from 40,000. kms. to 118 
60,000 kms. 

Lasted more than 60,000 kms. 56 


(c) The results of a survey regarding radio listeners’ preference for different 
types of music are given in the following table, with listeners classified by age 
group. Is preference of type of music influenced by age ? 


{Bangalore Univ. B.E., 1992) 


Type of music Age group 

preferred 19—25 26—35 Above 36 
Nationa] music 80 60 9 
Foreign music 210 325 ; 44 
Indifferent 16 45 132 


14. (a) If x1, X, ..., X_ represent the respective number of successes in k 
samples cach of n tials, by considering a suitable 2 x & contingency table, 
derive an expression for 7 to test the homogeneity of this data. 

(6) It was decided to check the dental health of children in 8 districts of a 
town. The condition of the teeth of 36 children from each district was examined 
and classified as either good or poor. The number of children with teeth in a poor 
condition froin each of the districts was 9, 14, 12, 18, 7, 10, 15, 11. Can it be 
concluded that the dental health of children does not vary between districts ? 

13-9-1. y?-test of Homogeneity of Correlation Coefficients. 
Let r), 72, ..., 7% be k estimates of correlation coefficients from independent 
samples of sizes nm), mo,’,.., Mg respectively. 

We want to test the hypothesis that these sample correlation coefficicnts are 
the ‘estimates of the same corrclation coefficient p from a bivariate normal 
population. 
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Obtain the values of z,, 22, ..., 2, from the Table of Fisher’ z - 
transformation or from 


1 +r; 
z=} log, ( or tanh-!7y;; i= 1,2,...,&% .--(13-20) 


These z;’s are normally distributed about a common mean 


at 1+p eae 

5 =41og, (4 _ 2) and variance = 7 = 
The minimum variance estimate z of the common mean & of Z’s is Obtained 

by weighting the values z;’s inversely with their respectively variances. The 

estimate of z is, therefore, 


.. (13-21) 


22; (nj - 3) 
ge eee 


7="S@,-3 (c.f. § 14-7-2) 


so that (z;- 7) ¥n;-3;i=1,2,..., & are independent standard normal 
k 

variates. Hence > (n; —3) (z; -z )* is a y?-variate with (& — 1) d.f. [By 
i=l 

additive property of ?-distribution, one d.f. being lost since z has been 


determined from the data.]} 


If x? value thus abtained is greater than 5 per cent value of x2 for (k — 1) 
d.f., the hypothesis of homogeneity of correlation coefficients is rejected. If not, 
the correlation coefficients are supposed to be homogeneous in which case we 


e ° e ° e A : 
combine the sample correlation coefficients to find the estimate p of the 
population correlation coefficient p. 


We have z= ; log tear] 
1-p 
= (l+p) =(1-p)e# 
=> (1+) =e_] 
me ae ae (13-22) 
e= +] 


Remark. For testing the homogeneity of indepcndent estimates of the 
parent partial correlation coefficient, the above formulae hold with the only 
difference that for a partial correlation coeffi, ient of order s, n; will be replaced t-v 
n- S. 


Example 13-22. The correlation coefficient between daily ration of green 
grass and rate of growing calves on the basis of observations taken on 10, 14, 
16, 20, 25 and 28 cows at six farms were found to be 0-318, 0-106, 0-253, 
0-340, 0-116 and 0-112. Can these be considered homogeneous ? If so, estimate 
the common correlation coefficient. 
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Solution. Ho : The given values of sample correlation coefficients are 
homogeneous or the samples are from equally correlated populations. 


Using (13-20), we get 
= 0-3294, 2=0-1063, 23> 0:2586 
z4=0-3541, 2z5;=0-1165, and 2,=0-1125 ., 
z= 2 2; (nj — 3)/2 (nj — 3) = 01919 : 
Now x? = ¥ (n; - 3) (2; -z )* = 0-1008 


Tabulated value of x? for (6 - 1) = 5, degrees of freedom at 5% level of 
significance is 11-070. 

Since the calculated value is less than the tabulated value, we may accept 
the null hypothesis that the sample correlation coefficients are homogeneous. 


A 
If p is the pooled estimate of the population correlation coefficient, then 
using (13-22), we get 


13-10. Bartlett’s Test for Homogeneity of Several Independ- 
ent Estimates of the Same Population Variance. Let 
“3 
S22 >» (Xj-X)% @=1,2,..50 


é j= 


be the unbiased estimate of the population variance, obtained from the ith 
sample Xj, = 1, 2, ..., nj) and based on v; = (n; - 1) dégrées of freedom, all the 
k samples being independent. 

Under the null hypothesis that the samples come from the same population 
with variance 07, i.e., the independent estimates 57, (i = 1, 2, ..., k) of o? are 
homogeneous, es proved that the statistic 


>} (v; log =)/ E *sa oD = (3) a) (13-23) 


iu] 


$2 §2 
where g2 = 2UiSE 2MSE yy L, 


Lv; = v ta si) 


follows chi-square distribution with (k — 1) degrees of freedom. 
Remarks 1. S? defined in (*) is also an unbiased estimate of o?, since 
_ 2 ES; om Gv) O° | 


2. Let S;2 and S?; i#j,1¢ 4 j) <k be the smallest and the. largest 
values of thé estimates respectively. If on the basis of F-test (cf. Chapter 14), 
these do not differ significantly, then all the estimates 5 which lie between 57 
and S won’t differ significantly either and consequently all the estimates can be 
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reasonably regarded as homogeneous, coming from the same population. In this 
case, therefore, there is no need to apply Bartlett’s test. 

13-11. ~2-Test for Pooling the Probabilities from Independent 
Tests to give a Single Test of Significance (P,-Test). See Examples 
13-1 and 13-2 for detailed discussion. 


EXCERCISE 13 (d) 


1. Define y2-statistic. What are its uses ? What is Fisher’s z-transformation 
for correlation coefficient and what are its properties ? How is it used to combine 
the correlation coefficients between two random variables computed 
independently from different sources. 

2. Explain the use of the chi-square statistic for testing the homogeneity of 
several independent estimates of population correlation coefficient, clearly 
stating the underlying assumptions. 

3. (a) The correlation coefficients between wing length‘and tongue length 
were eStimated from 2 samples each of size 44 to be 0-731 and 0-690. Test 
whether the correlation coefficients are significantly different or not. If not, 
obtain the best estimate of the common correlation v vefficient. 

(b) Test for equality of the correlation co-efficients between the scores in 
two halves of a psychological test .applied to different groups of sizes 30, 20 and 
25 if the corresponding sample values are 0-63, 0-48, 0-71, respectively. 

4. (a) Independent samples of 21, 30, 39, 26 and 35 pairs of values yielded 
correlation coefficients 0-39, 0-61, 0-43, 0-54 and 0-48, respectively. Can these 
estimates be regarded as homogeneous ? If so, find an estimate of the correlation 
coefficient in the population. 

(b) Test whether the following set of correlation coefficients between stature 
and sitting heights obtained for persons from 8 districts can be regarded as 
homogeneous. 

Sample size 130 60 338 78 $4125 $299 170 139 

Corr. coefficient : 0-718 0-961 0-825 0-685 0-700 0-548 0-793 0-687 

(c) The correlation coefficients between fibre weight and staple lehgth in six 
cotton crosses were estimated as : 

— 0-129, 0-1138, — 0-2780, 0-0033, 0-2331 and 0-0550 
based on, samples of sizes 73, 81, 67, 83, 71, 57 respectively. Test the 
homogeneity of 7;’s and obtain their best esumate. 


13-12. Non-central ‘y?2-distribution. The x2-distribution defined as 
the sum of the squares of independent standard normal variates is often referred to 
as the central y2-distribution. The distribution of the sum of the squares of 
independent normal variates each having unit variance but with possibly non- 
zero means is known as non-central chi-square distribution. Thus if X;, i = 1, 2, 
..-, M) are independent N(j,, 1), r.v.’s then 


x7= x X;, ..(13-24) 
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has the non-central x? distribution with x d.f. Intuitively, this distribution would 
seem to depend upon the n parameters [1;, 2, ..., U, but it will be seen that it 
‘epends on these parameters only through the non-centrality parameter 


= 5 (a + [ly? T sa0 ut,”) .-.(13-24a) 


and we write,y ~ 72 (n, A). 
13-12-1. Non-central y?-distribution with Non- -centrality 


Parameter i. The p.df. is given by 
e*N 
lias Ft PkPns >) ...(13-25) 


where P(X2n12;) is the p.d.f. of (central) x?-variate with n + 2j df. 
Thus fa (A) is the mixture of central y?-distributions with d.f. n, n+ 2, 


n+4, ..., the corresponding weights being the succéssive terms of the Poisson 
distribution with parameter A. 

Derivation .of p.d.f. of x’. We shall obtain the p.d.f. of non-central 
1?-distribution through moment generating function (m.g.f.), by using the 
uniqueness theorem of m.g.f. 

13-12-2. Moment Generating Function vf Non-central ?- 
Distribution. If X ~N (u, 1) then 


My(=—= |] ee WR 
M ya wef 


exp [or- bow? = xp [- - = 1)x?- px cad 
= exp | - (45%) { 2- ian Toad] 
= exp | - CA (G- aT REY, + S- ao } | 
=exp (5 a) L- 3") -745)1 
~ Myat) =exp (5 fore! __ex [- CHYE- tu) Je 


sexed |) (-} 2) 4 
=o (HS) EI “ap 24 Va 22 


| a 1 
2 (12 exp (7HS;) 11-2120 => t<5 oo o*) 
If X;@=1,2,. ssi), She IMME PERERE AN GIs 1) then the m.g.f. of the non- 
central 7?-variate 7’? = x X? is given by 


Exact Sarnpling Distributions (Chi-square Distribution) 13-71 


a 
My) 2M Ss y= I] My2@) — Since X?,s are independent) 
a 


iz] 


= |] [a ~ 2t)-'/2 exp [From (*)] 
| 


= (1 — 2s)-"? exp ata 2 x sil 


= (1 - 20)-*2, exp [2a /(1- 20], ¢< 3 ..(13-26) 
where X =} Er 17, is the non-centrality parameter. 
i=] 


(13-26) can be re-written as° 
l 
M, At) = (1 — 2)-* exp. [a(- 1+ l- a) | 


~7) 
2t 


= (1 —22)-A2 9 -2 
= (1 —2s)"““e _ 5 (4) 


= (1 — 2s)? e-exp 


r} 


= -A4 r 
Di: ev a ...(13-26a) 
r=aQ . 

Thus the m.g.f. of a non-central x? distribution is seen to be a convex- 
combination of y? m.g.f.’s with df. n,n+2,n + 4, ... The coefficients 
appearing in the convex combination are merely the Poisson probabilities. 

Hence by the uniqueness theorem of m.y.f.’s the p.d.f. of non-central y?- 
distribution with n d.f, and with — parameter A is given by 


fx) = _ 


A ply 244+ 2r)s 


~  ._ ‘ 
Ma + 2/2 7 ("3") 


is the p.d.f. of central x?-distribution with (n + 2r) d.f. 
Remarks 1. We can also write the m.g.f. of non-central x? distribution 
with non-centrality parameter A as 


E[(a-2)) 2” 
where Y is 4 Poisson variate with parameter i. 

2. If we take A =O >p;=O0Vi=1,2,..., , the m.g.f. of the non- 
central x? distribution reduces to the m.g.f. of centfal x? distribution; viz., 
(1 — 2¢)-*2 

3. Taking 2 = 0 in the p.d.f. of non-central x?-variate,, i.e., in (13-25), 
we get 


1.2 x 
where. pO? rs2) = “2¥ YA)2* 7 OS 42 <0 
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ao 
fo?) = Pu = sary & 22?! 0s 42 <0 


[-- we get contribution only when r = 0, the other terms vanish when A = 0]; 
which is p.d.f. of central y?-distributioa with n df. 


13-12-3. Additive or ee ped get of Non-central 
Chi-Square Distribution. /f Y;, (i = .. k), are independent non- 
k 


central 7°-variates with n; 4.f. and non-centrality element A,, then L Y; is also a 
isl 
: k k 
non-central 7°-variate with ¥ n; df.and non-centrality elementX= Y i,. 
jel ial 


Proof. We have from (13-26), 
My (0) = (1 — 21) *t? exp (204; / (1 - 20], G = 1, 2, ..., © 


 Msy(0= -TIm,0- (1 = 2 


which is the m.g.f. of anon-central ¥?-variate with Yn; d.f. and non-centrality 
8 
parameter A, = 2A,;. Hence by uniqueness theorem of m.g.f.’s 
8 


éxp (2¢ Ea, /(1 - 20), 


k 
2 Yi ~ En, Ad 


13-12-4, Cumulants of Non-central Chi-square Distribution. 
Cumulant generating function is ial by 


K, At) = log M, A(t) =- 5 log (1 = 20) + 20 (1 = 2¢)"! 
2. a\r 
5 | 2 + Cor. ie Gor + ./ 2a| 14204 (21)? +...4 (207+ al 


the expansion peing valid for ¢ < 1/2. 
° ‘ r-l 
K,/2(1) =(n+2A)\t+(n+4A) 2+... 4 (= ne 242"-1) +... 


=> K,, = Coefficient of in K. 72 = r| ¢ + 22) 271 
~ 2 2F-Mp 1) a oe .-.(13-27) 
Kp, = 272 (r—-2)! [m+ 2A (r- 1] 
& (K,-1) = 22 (7-2)! Ar-1) = 24 (r- DI 
K 
=n + 2An). [From (13-27)] 


=> K, = (n+ 2Ar) £ (Kr) . ..6(13-28) 
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(CONTINUED) 
(t, F AND Z DISTRIBUTIONS) 


14-1. Introduction. The entire large. sample theory was based on 
the application of “Normal Test” (cf. § 12-9). However, if the sample size n 


is small,the distribution of the various statistics, e.g., Z = 7% o 
o/Nn 


Z=(X ~ nP)IN nPQ etc., are far from normality and as such ‘normal test’ cannot 
be applied if n is small. In such cases exact sample tests, pioneered by W-S. 
Gosset (1908) who wrote under the pen name of Student, and later on developed 
and extended by Prof. R.A. Fisher (1926), are used. In the following sections we 
shall discuss 

‘@ t-test, (ii) F-test, and (iii) Fisher's z-transformation. 

The exact sample tests can, however, be applied to large samples also 
though the converse is not true. In all the exact sample tests, the basic 
assumption is that “The population(s) from which sample(s) are drawn is (are) 
normal, i.e.,the parent population(s) is (are) normally distributed.” 

14-2. Student’s ‘t’. Definition. Let x;,@ = 1, 2, ...,n) be a random 
sample of size n from a normal population with mean 1 and variance o?, Then 
Student's ¢ is defined by the statistic 


X- 
t= (14-1) 
SN n 
where Fa D x is the sample mean and 
%=—— Ff (x; -x )*, ,..(14-1a) 
n—-l1 on | 


is-an unbiased estimate of the population vanance o?, and it follows Student's 

t-distribution with v = (n — 1) df. with probability density function, 

f= nm tices ~-(14-2) 
vit 

Wa(} +) [i+] 


Remarks 1. A statistic ¢ following Student’s ¢-distribution with n d.f. 
‘will be abbreviated as ¢ ~ f,. 
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2. If we take v = 1 in (14-2), we get 


1 ] 
Kid ee ee ‘Gd +P)’ 
(3.3) 
1 1 1 
“Cen oe [-- PG) = Vx] 


which is the p.d.f. of standard Cauchy distribution. Hence, when v = I Student’s 
t distribution reduces to Cauchy distribution. 
14:2:1. Derivation of Student’s t-distribution. The expression 
(14-1) can be re-written as 
fa2 x — 1)? _ n(x — p)* 
$2 ns?/(n — 1) 
is 2 (e-p)? —1 (x-pr/(or/n) 
(n-1)  o4Yn ns2/o? ~ ns?/o* 
Since x;, @ = 1, 2, ..., 2) is a random sample from the normal population 
with mean 1 and variance o?, 


[-.- ns? = (n — 1) S?] 


Z~N(u, 0%) = =~ 0,1) 
oNn 


2 
, being the square of a standard normal variate is a chi- 


xX- 
Hence 
o2/n 


square variate with 1 d.f. 
2 
Also, a is a y2-variate with (n - 1) df. (cf, Theorem 13-5). 


Further since x ands? are independently distributed (c.f. Theorem 
13-5), a being the ratio of two independent y?-variates with*l and (n— 1) 


d.f. respectively, is a B. ¢ ; 1) variate and its distribution is given by : 


So 
] (Zhv)2 
aF(t) =———- d(12/v), 0 < 12 < 0 


a3 +) [, * epene 
2’ 2 ve [where v = (n- 1)] 


1 1 


om ft vy 2Yv+ne 
Vv B & 5) E 4 


the factor 2 disappearing since the integral from — ~ to co must be unity. This is 
the required probability function as given in (14-2) of Student’s ¢-distribution 
with v = (nm — 1) df. 

Remarks on Student’s ‘t’. 1. Importance of Student's t-distribution in 
Statistics. W.S. Gosset, who wrote under pseudonym (pen-name) of Student 


dt; — 0 <{<oo 
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defined his ¢ in a slightly different way, viz., t = (x —)/s and investigated its 
sampling distribution, somewhat empirically, in a paper entitled ‘The probable 
error of the mean’, published in 1908. Prof. R.A. Fisher, later on defined his 
own ‘t’ and gave a rigorous proof for its sampling distribution in 1926. The 
salient feature of ‘2’ is that both the statistic and its sampling distribution are 
functionally independent of o, the population standard deviation. 

The discovery of ‘t’ is regarded as a landmark in the history of statistical 
inference because of the following reason. Before Student gave his ‘?’ it was 


customary to replace o” in Z —- , by its unbiased estimate. S? to give 
o/Vn 

t= a and then normal test was applied even for small samples. It has been 

SIND 
found that although the distribution of ¢ is asymptotically normal for large n 
(cf. § 14-2-5), it is far from normality for small samples. The Student’s ¢ 
ushered in an era of exact sample distributions (and tests) and since its discovery 
many important contributions have been made towards the development and 
extension of small (exact) sample theory. 

2. Confidence or Fiducial Limits for p. If to.o5 is the tabulated value of ¢ 
for v= (n — 1) d.f. at 5% level of significance, i.e., 

P (lt'> tos) = 0:05 => PCItIS to.95) = 0-95, 

the 95% confidence limits for p1 are given by : 


lel Stoos, i.e., rrr S t0.05 
7 § gee ow 
= OO eee on ae 
Thus, 95% confidence limits for 1 are : 
GS ace —- [14-2] 
Similarly, 99% confidence limits for 2 are : 
Xt loo = »-L14-2(6)] 


where {o.9; is the tabulated value of ¢ for v = (n — 1) df. at 1% level of 
significance, 

14-2:2, Fisher’s ‘t? (Definition). It is the ratio of a standard normal 
variate to the square root of an independent chi-square variate divided by its 
degrees of freedom. If € is a N (0, 1) and x? is an independent chi-square variate 
with n df., then Fisher’s ¢ is given by 


2 
t=t J NE (143) 


and it follows student’s ‘rf’ distribution with n degrees of freedom. 
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14:2-3. Distribution of Fisher’s ‘t’. Since € and x? are 
independent, their joint probability differential is given by 


Lee exp (-47/2) (x2)? 
aF (&, x2) = exp (— &2/2 2 
Let us transform to new variates ¢ and u by the substitution 
Se and u= 2 => &=1Vuin and y2 =u 


Vx2in 


Jacobian of transformation J is given by 


ix ag, x2) 2) _ Vuln th Vun)|_ gy 


a ay u) 0 , 


The joint distribution of ¢ and u becomes 


1 t ie ; 
dG(t.u) = =e At iia du dt: 
) Von - T(n/2) wa wt Pope ee 


Integrating w.r.t. ‘u’ over the range 0 to oo, the marginal distribution of ¢ 
becomes 


dG ee) ee u (1 “ (a-1)/2 g dt 
ie V2n 2"? r(n/2) Vin {co x? 27 a” 


2 l T[(n + 1)/2] dt 
V2n 282 T'(n/2) ual (1 . a) form 
2 n 


’. dG, (4) = tfn+ DP) dt, — co <I < oo 
Vn P(Qn2)\0Q) E : aie ue 
n 


a ey —0oo < [ < 00 


t, 


- 1 n 12 [e+ 2 
Vn B (5-3) [2 + =| 


which 1s same as the probability function of Student's ¢-distribution with n f. 
Remarks f. In Fisher’s ‘t’ the d.f. is the same as the d.f. of chi-square 
variate. 
2. Student's ‘t’ may be regarded as a particular case of Fisher's ‘t’ as 
explained below. 


Since x ~ N (yu, 02/n), b= Xo N(O, 1) *) 


oNn 
and. 4? = nS yr ,~ x P/o2 ..(**) 
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is independently distributed as chi-square variate with (mn — 1) d.f. Hence Fisher’s 
tis given by 


pe & Nn (=) S 
Vx2H(n ~ 1) ™ VSG; ~2 Pn 1) 


7 Vn (x -u) _x = 1. (*%) 
5 SN'n 
and it follows Student’s t-distribution with (n —- 1) d.f. (c.f. Remark 1 above.) 


Now, (***) is same as rie ‘t? defined in (14-1). Hence Student's ‘?’ is 
a particular case of Fisher’s 


14-24, Constants ‘t-distribution. Since f(t) is symmetrical 
about the line ¢ = 0, all the moments of odd order about origin vanish, i.e., 


\t’o,41 (about origin) = 0; 7=0, 1, 2, ... 


In particular, 
pi,’ (about origin) = 0 = Mean 
Hence central moments coincide with moments about origin. 
ee Hors, = 9, (r= 1, a es) (14-4) 
The moments of even order are given by 
Ko, =’, (about origin) 


= | ” Pr fly ar=2f Pf dt 


oo 2r 
1 n 0 [ + al | A+ 
B (3 : 5 Na - 
This integral is absolutely convergent if 27 < n. 


2 1 n 
Putl+—=- => f=n(l- Le., 2tdt=-—3d 

ae (1 - y)/y yy 
When 0, y = 1 and when ¢ = oo, y = 0. Therefore, 


e 2 o ¢ =n, 
2° 3 
n 

—— | (12)(2r- U2 y {n+ 12) -2 gy 
Vna(t,2)° | 
bead hae 

Vn! (t=2\p-2 
ee (n + 1)/2}-2 
as Sle fy ee 
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a ee (2- er on >2r, ... 14-40) 


P[(n/2) - rl T(r +5) 
PG) Mn/2) 
w-bo@-3... 259PQ Tian -n 


"TE ((n/2) — 1ln/2) ~ 2)...U0n12) — FIPUn?2) — 
2r— 1) (2r — 3)...3-1 ne 


aa ee 2) (n -4).. ae 2r)’ 27 -oL14-4 (6) 

In particular 
tA 
[> aa = a2)” A? ,in > 2} .L14-4(c)] 

are es 114. 

and Ly =n? (n~2)(n~4)~(n-2)(n 4)’ »In>4] ...[14-4@] 
—-2 

Hence Bh = =0 and B= HE ike 


Remarks 1. As n > ©, B, = 0 and 


B, = lim 3 tex )=3 lim [4=Gind| ...[14-4(e)] 
a — oo n—> 0 


2. Changing r to (r — 1) in [14-4(6)], dividing and simplifying, we shall 
get the recurrence relation — the moments as 
_ nar -lI) an 
— a (n—2r) 2 ...[14-4(c)] 
3. Moment Generating Function of t-distr:bution. From 
[14-4(6)] we observe that if ¢ ~ ¢,, then all the moments of order 27 < n exist but 
the moments of order 2r > n do not exist. Hence the m.g.f. of t-distribution does 
not exist. 


Example 14-1. Express the constants yo, a and m of the distribution : 


aro) =y0| 1~ 75] dx, -asxSa se) 


in terms of its [to and Bo. 
Show that if x is related to a variable t by the equation 


_ een: Sane 
~ {2(m +1) +4 2)}12? 
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then t has Student's distribution with 2(m + 1) degrees of freedom. Use the 
sransformation to calculate the probability that t 2 2 when the degrees of 
freedom are 2 and also when 4. (Madras Univ. M.Sc., 1991) 


Solution. First of all we shall determine the constant from the 
consideration that total probability is unity. 


zd x? 
: vf” (1 - a) a =] 
2 nC =r) 1 
=> = -_ = 
Yo ‘ 2 


(-.- Integrand is an even function of x) 


r/2 

=> 2yo | cos" @,.acos@d@ =1 (x =a sin 6) 

1/2 
=> ayo | cos***+!@d@ =1 

0 
But we have the Beta integral, 

n/2 

2 ip sinP@cost0d@0 =B eH ce (A) 


7/2 
ayy.2 J, cos 2"*+1 @sin® @d@ =1 


> ayo B(m + 1,5) =] [Using (1)] 
1 
-3 Yo = 7 . (2) 
aB(m+1,35) 
Since the given probability function is symmetrical about the line x = 0, 
we have as in § 14-2-4. 
Mone 1 =o 41=0;r=0, Li; 2. ooo [-.- Mean = Origin] 
The moments of even order are given by 
Lo, = [ly (about origin) 


=|" x?" f(x) dx = = yo | “eu ~ =) 


2 
= 2Yo xr {i- zy. ax 
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m/2 
= 2yo J (a sin 0)” cos*” 0. acos 0 d@ [ x =a Sin 6) 


m/2 
=yoa%*1.2 J sin” @ . cos***! 6 d@ 


= yoa¥*! Bir +5, m + 1) [Using (1)}) 


| 1 3 
l r+t)r(m +3) 
+= + 2 2 
B(r a> m 1) | Q 


= g¢————_——_=4 


: 
B(m + 1, 9) rm +r+S)r( 5 | 


| P{m + (3/2)) . £1(1/2) ge 
In particular, Ha= a. (372)) Fims B/D) FU/D = 2m 43 
@ = (2m + 3) ,..(3) 


7 T(5/2) T'(m + (3/2)} 
Also Ha = 4" Tim + (7/D)*~ Td?) 
34 


(#8) 


= (2m + 5) (m+ 3) (On simplification) 


B = = 3(2m + 3) 
~ (2m + 5) 


= 5 ; 
= ms = 5b (On simplification) ... (4) 


Equations (2), (3) and (4) express the constants yo, a and m in terms of 
H2 and B,. 
a | ee an ae 
~ [2(m + 1) + 22]'/2 ae 2(m+1)+ 0 


: x? (m+ 1) 
Lé., LL 2°= are Dara -(14' =) (n = 2m + 2) 
dt 1 2tdt 


wal eect 
(ne Py one? 


Os —_—1—__. 
Hence the p.d.f. of X transforms to 


1 
a) = yy ———— .-2 SF _— 
jis =] ee al 


dt 
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e——!t ea 
; Vali pr 
aBlm+1, 35 n 
1 
ny OS <oo ...(5) 
aL 1+ — 
ae) eS 


phi is the probability differential of Student's ¢-distribution’ with 
= 2(m + 1) d.f. Hence the result. 


ss 2 df. i.e., n = 2, we get 2m +1)=2 => m= 0. Hence from (**), 
we get (for m = 0), 


ae = ee when t = 2 
(2+ pn a 


P(t>2)=P (x >VQf)a) = Som 


a l : 
A, NZ NNB END 
2a 3 v3 
Bd, ky = ELEC . _ Tap) 
ie °27" TGR) ~ (1/2) Td/2)~ 2| 


For 4 d.f., i.e., n = 4, we get m = 1. Proceeding exactly similarly we 
shall obtain 


EXERCISE 14(a) 
1. (a) Given that 
(@) wis normally distributed with zero mean and unit variance, 

' (ii) v*hasa chi-square distribution with n degrees of freedom, and 

(iii) uand y are independently distributed, 
find the distribution of the variable 

uNn 
~ oy 

(b) Find the variance of the ¢ distribution with a degrees of freedom, (n > 2). 


(c) If the variable ¢ has Student’s ¢ distribution with 2 degrees of freedom, 
prove that 
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| [Shivaji Univ. B.Sc., 1990] 

2. (a) State, (without proof), the sampling distribution of Student’s ¢t. Who 
discovered it ? 

(b) ‘Discovery of Studént’s ¢ is regarded as a landmark in the history of 
Statistical inference’. Elucidate. . 

(c) Let ¢ be distributed as Student’s ¢-distribution with 2 d.f. Find the 

probability P(- V2 < 1 < V2). 

3. (a) Show that 


P(t 22) = 


fri2 Tr bi a _T k-r 
E(T’) = : if ri f 1 k 
T(/2) . F/2) , ifr is even for-1l<r< 
0, if ris odd 
where T has Student’s t-distribution with k degrees of freedom. 
(6) For the ¢-distribution with n d.f., establish the recurrence relation 
n (2r— 1 
[oy ew Mor2,m> 2r 
(Poona Univ. B.Sc., 1990; Delhi Univ. B.Sc. (Stat. Hons.), 1992) 
(c) For how many d.f. does (i) %?-distribution reduce to negative exponential 
distribution and (ii) t-distribution reduce to Cauchy distribution ? 
4, Suppose X,, X2, ...,X, (m > 1) are independent variates each distributed 
as N (0, 6). Find the p.d.f. of 


12 1/2 
wen / fh z xe 


. 


Why does not W follow the ¢-distribution ? 
[Dethi Univ. B.Sc. (Stat. Hons.), 1988] 
5. Let x), X2, ..., X, be independent observations from a normal universe 
with mean yt and variance o? and let x and s? be the sample mean and sum of the 
Squares of the deviations from the mean respectively. Let x’ be one more 
Ibseryation independent of previous ones. Show that . 


x= E| ale = Df 
S n+1 
has a Student ¢-distribution with (n — 1) degrees of freedom. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1989) 
6. (a) Let X, and X, be two independent normal variates with the same ‘ 
normal distribution N(p, o”). Obtain the distribution of 
? 7 X 1+ », ¢ y ha 2u 
VIX, —X, 2 


Ans. Standard Cauchy distribution. 
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(b) If X is t-distributed with k degrees of freedom, show that 


1 
1 + (X2/k)’ 
has a beta distribution. 
[Dethi Univ. B.Sc. (Maths. Hons.), 1988) 
7. Define Student’s t-statistic and state its probability density function. 
If x; @ = 1, 2, ..., n), is a random sample of n independent observations 
from a normal population with mean 1 and variance o?, show that 


yo Ewan 1) 
7 (x;- x )? 


in] 
conforms to Student’s ¢-variate. If x is an additional observation drawn 
independently from the same normal population, show that 


~—_(*¥-x) ,./niv-1)) 


a n+ 1 
\) 2 (x,;-x ) 


also conforms to Student’s ¢-variate. 


1 a 


t= 1 


8. Let X,,X2, ...,X, be a random sample from N(1, G2), and X and S?, 
respectively, be the sample mean and sample variance. Let X, .; ~ N (ut, 6), 
and assume that X,, Xo, ..., Xn, Xa +1 are independent. Obtain the sampling 
distribution of 


| eee n = 
U = —————— ye fe aS STS x. FP 
i=] 
9. If the random variables X, and X2 are independent and follow chi-square 
Vn (X 1 — X9) 


distribution with n d.f., show that is distributed as Student’s ¢ 


2VX,X> 


[Calcutta Univ. B.Sc. (Hons.), 1992} 
el + x2)/2 x"! xf! : 


with n d.f., independently of X, + X>. 


: 1 
Hint. p(x, x2)= 2 (nae : 
0 SX, <0, 0S x2 < 0% 
Na Nan (% - X2) 
Vx; X2 


and v=x) + x2 


m= L - 
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Jacobian of transformation is J et ee 
(4) Vn [1 + u2/n]3?2 


The joint p.d.f. of U and V becomes 


1° e -wW2 y a-1 
gu, v) = p(-'), X2) [J l= u2/n)(a+i)/2 > 


224-1 F(n/2) T(n/2) Vn (+ 


—-o<cuco, J0Syv<o 


Tn = 27" T(n/2) r(* ae ) V fr => T(n/2) -—Iavn , we get 


na n+ 
pr(tg 
22-1 T(n/2) T(n/2) Vn = pane red \va 
piett : 
2 
=2°Vn VnB G, n/2) [.. Vr=rQ] 
ry Ok ERs ee, | en, cn ce eee eae! FA 
g(u, v) Gua v Je exe ( ay 
1+ a | 


O<v< Co, = 00 < UC o9, 
10. Let X;, X2, ... Xm , and Y;, Yo, ves , Y, be.independent random samples 


Using Legender’s duplication formula, viz., 


from N(1, 6”) and N (to, G7), respectively. if X and Y denote the 
corresponding sample means and if 


(m — 1817s = (&; ~xy, (n - 1) S, = * (Y; ~Y)2, 


obtain the sampling distribution of 
a(X - 1) + b(Y - 2) 


(m - 1) $2 + (n - 1) S27] (@ , o]] 
(m +n — 2) AC n| 


where a and 6 are two fixed real numbers. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989) 


11. If J, @, qg) represents the incomplete Beta function defined by 


| z 
1. @. o-a5). (1 -at' dt; p>0,q>0, 


show that the distribution function F(.) of Student’s t-distribution is given by 


F(t) = 1-21, h ey 
() = 1-5 = p)ewhere x=(1 + 6 : 


|Delhi Univ. M.Sc. (Stat.), 1990; Nagpur Univ. M.Sc. (Stat.), 1991} 
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Hint. If f(.) is p.d.f. of t-distribution with n d.f, then 
t oo 
FQ) = J flu) du = 1-| flu) du 
— oo t 


= 2 \ (a+ 
=1-——_1—__[ (1+ ey Oe i 
Vn (3 aN " 


y ae} 
0 ~12 
= 1 eee ) 2 Ben A= 2) dz, 
2B(+, 2 (: an 

where E = 1 + a 

2 n 
x 2 *!i1 

= ] —— a i 2(2)-1 () — 2)-12 gz, E = ( + = 


1h 
20(3. 5) 

! n 1/y) 
1-5 4(5> 5) 


12. Show that for ¢-distribution with n d.f., mean deviation about mean is 


given by 
Vn r(25*) [Varo 


(Shivaji Univ. B.Sc. Oct., 1992] 
Hint. E(¢) = 0. 


MD. about mean = J Leif de 


= 1 ~ Ieldt 
Layee ee 
Vn B 5° 7 (1 + 4 
o 2 [ tdt 
0 2 \(ati 
Vn B 4 (1+ ©) : 
2 n 


noe rt (E- >) 


0 


| 


‘ 


it 

_ 
by 
, Ve 
Nn]! 

pat 
feed 
aa 


l earn 
8.5) 
13. If X ~ tq), show that 
(n — 5) log E + | ~x7 0)» 
ee 
.for large n. 
You may assume that for large n, 
n. 1 


27 2 


NE (2) 


14. If X and G? = S? be the usual sample mean and sample variance based 
on a random sample of n observations from ,N(u, 02), and if T = (X -p) 
Vn/S, prove that 
(i) Var (T) = (@@- Di - 3) 
(i) Cov & ;T) =o MAL I= D2} 
V2n TU(n - 19/2} 
ode tee ae 
Gi) r&,H=l@-3) The-aV/riy@ -v) 
14:2-5. Limiting Form of t-distribution. Asn — ©, the p.df. of 
t-distribution with n df. viz:, 


2 \(n +1)/2 
AQ =—_1—— (i+ Sy" bs et Fer oe 
Vn B (7 . is V20 


Proof. lim ——)——= lim +s Pi(n + 1/2) 
R—> oo Vn B (3 A aro VN eaEO) 


1 4(t fe 
Va Va\ 2) Von 
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« lim fo= lim cee es lim (1 1+ as ay 
i Rn—> co 1 nm) 27>” 
Vn B & 5 | 


2\1 
x lim (1+ 2)? 
n— Rn 


— exp (— 12/2), -x2 <t< co 


V2n 


Hence for large d.f. t-distribution tends to standard normal distribution. 
14:2-6. Graph of t-distribution. The p.d.f. of ¢-distribution with 


d.f. is 
2 -@+1)/2 
go=c.[ 1+ “| bai ,- 0 << oo 


Since f (— #) = f(a), the probability curve is symmetrical about the line ¢ = 0. 
As ¢ increases, f(t) decreases rapidly and tends to zero as ¢ — », so that f-axis is 
an asymptote to the curve. We have shown that 
ah. 3(n ~ 2) 
bl, = »n>2; ay 4)? n>4 


Hence for n > 2, {t) > Li.e., the variance of t-distribution is greater than 
that of standard normal distribution and for n > 4, B2 > 3 and thus t-distribution 
is more flat on the top than the normal curve. In fact, for small n, we have 


P[ tela] 2P[iZl2e], Z~N 0; 1) 


i.e., the tails of the t-distribution have a greater probability (area) than the tails 
of standard normal: distribution. Moreover we have also seen [§ 14-2-5] that for 
large n-(d,f.), t-distribution tends to standard normal distribution. 


f(t) 


Normalicurve 
n= 7 


-O-, -3 2 -f) tO 44°42 430 +4 ow 


14-2:7, Critical Values of t. The critical (or significant) values of ¢ 
at level of significance a and df. v for two-tailed test are given by the equation: 


PEiti>t(@]=a .»(14-5) 
=> P(t(tlst()J=l-a ..(14-5a) 
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CRITICAL VALUES’ OF t-DISTRIBUTION 


Rejection 


Rejection 
region («/2) 


aay. g 


The values ¢t, (a) have been tabulated in Fisher and Yates’ Tables, for 
different values of a and v and are given in the Appendix.at.the end of the book. 
Since ¢t-distribution is symmetric about ¢ = 0, we get from (14-5) 
P (t>t,(@)] +P [t<-t,(@)] =a 


=> 2P [t>t,(a)] =a 
=> P(t >t, (@)] =a/2 
= P[t>t,(2a)] =a .»( 14-55) 


ty (2a) (from the Tables in the Appendix) gives the significant value of ¢ for 
a single-tail test, [Right-tail or Left-tail-since the distribution is symmetrical], at 
level of significance a and v d/f. 

Hence the significant values of t at level of significance ‘a’ for a single 
tailed test can be obtained from those of two-tailed test by looking the values at 
level of significance ‘2a . 


_ For example, 
ts (0-05) for single-tail test = tg (0-10) for two-tail test = 1-86 
t:5(0-01) for single-tail test = t,; (0-02) for two-tail test = 2-60. 
14-2-8. Applications of t-distribution. The ¢-distribution has a 
wide number of applications in Statistics, some of which are enumerated below. 
(?) To test if the sample mean ( x ) differs significantly from the 
hypothetical value j1 of the population mean. 
(ii) To test the significance of the difference between two sample means. 
(iit) To test the significance of an ve sample correlation co-efficient 
and sample regression Coefficient. 


(i v) To test the significance of ened partial and multiple corrélation 
coeffirients, 


{n the following sections we will discuss these applications in detail, one 
by one. 

14-2-9. t-Test for Single ren pees we want to test : 

(i) if a random sample, x; (i= ., n) Of size n. has been drawn from a 
normal population with a anon mean, n, say [Uo Or 

(ii) if the sample méan differs significantly from the-hypothetical vahie Lo: 
of the population mean. 

Under the null hypothesis Ho : 
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(i) The sample has been drawn from the population with mean pL 


or (ii) there is no significant difference between the sample mean x and the 
population mean jl, 


the statistic t~ 7 —Ho ...(14-6) 


= (x; —x )?, ...[14-6(a)] 


ea | 
where x= 


follows Student’s ¢-distribution with (n — 1) df. 


We now compare the calculated value of ¢ with the tabulated value at certain 
level of significance. If calculated | ¢ | > tabulated ¢, null hypothesis is rejected 
and if calculated | ¢ | < tabulated ¢, Hy may be accepted at the level of significance 


adopted. 


Remarks 1. On computation of S* for numerical problems. \f x comes 
out in integers, the formula:(14-6a) can be conveniently used for computing S?. 
However, if x comes in fractions then the formula (14-64) for computing S? is 
very cumbersome and is not recommended. In that case, step deviation method, 
given below, is quite useful. 


If we take, d;= x;-A, where A is any arbitrary number then 


S=F i | 1-27] | Lx;7- (2) | .»-L14-6(5)] 
j2 
= | d?- =| .-[14-6(0)] 
since variance is independent of change of origin. 
Also, in this case X= A +e [14-60] 
2. We know, the sample variance 
2 =+5 (5-27 
ns 
ns =(n-1)S? 
S&S $s 
=> 7 oe ..L14-6(e)] 


Hence for numeyjcal problems, the test statistic (14-6) on using [14-6(e)] 
becomes 


Pea. pee de | a [14-60] 
VSin Vs%in-1) *" 
3. Assumptions for Student’s t-test. The following assumptions 
are made in the Student’s f-test : 
(i) The parent population from which the sample is drawn is normal. 
(ii) The sample observations are independent, i.e., thé sample is random. 
(iii) The population standard deviation o is unknown. 
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Example 14-2. A machinist is making engine parts with axle diameters 
of 0-700 inch. A random sample,of 10 parts. shows a mean diameter of 0-742 
inch with a standard deviation of 0-040 inch. Compute the statistic you would 
use to test whether the work is meeting the specifications. Also state how you 
would proceed further. 


Solution. Here we are given : - 


ut = 0-700 inches, x:= 0-742 inches, s = 0-040 inches and n= 10 
Null Hypothesis, Ho : = 0-700, i.e.. the. product is conforming to 
specifications. 
Alternative Hypothesis, H, : 1 # 0-700 
Test Statistic. Under Ho, the test statistic is : 


‘ S2/n 1 s2/(n - 1) 
_ ¥9(0-742 - 0-700) _ 
Now t= 0.040 = 3.15 


How to proceed further. Here the test statistic ‘:’ follows Student’s ¢- 
distribution with 10 — 1 = 9 d.f. We will now compare this calculated value with 
the tabulated value of ¢ for 9 d.f. and at certain level of significance, say 5%. Let 
this tabulated value be denoted by &. 


(i) If calculated ‘t’ viz., 3-15 > t), we say that the value of fis significant. 
This implies that x differs significantly from 1 and Hp is rejected at this level of 
significance and we conclude that the product is not meeting the specifications. 

(ii) If calculated ¢ < to, we say that the value of ¢ is not significant, i.e., 
there is no significant difference between x and pt. In other words, the-deviation 
( x -{1) is just due to fluctuations of sampling and null hypothesis Hy may be 
retained at 5% level of significance, i.e., we may take the product conforming 
to specifications, 

Example 14-3. The mean weekly sales of soap bars in departmental 
stores was 146-3 bars per store. After an advertising campaign the mean weekly 
sales in 22 stores for a typical week increased to 153-7 and showed a standard 
deviation of 17:2. Was the advertising campaign successful ? 

Solution. Weare given : n = 22, x = 153-7, s =17-2. 

Null Hypothesis. The advertising campaign is not successful, i.e., 

Ho: = 146- 3 
Alternative Hypothesis. H, : > 146-3.(Right-tail). 
Test Statistic. Under the null hypothesis, the test statistic is : 


fas nis 
V s(n - 1) 


Nee 2153-7 - 146-3 _ 7-4 x V21 _ og, 


Vazapni . (172 
Conclusion. Tabulated value of ¢ for 21 df. at 5% level of significance 
for single-tailed test is 1-72. Since calculated value is much greater than the 
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tabulated value, it is highly significant. Hence we reject the null hypothesis and 
conclude that the advertising campaign was definitely successful in promoting 
sales. 

Example 14-4. A random sample of 10 boys had the following 1.Q.'s : 
70, 120, 110, 101, 88, 83, 95, 98, 107, 100. Do' these data support the 
assumption:of a population mean IQ. of 100 ? Find a reasonable range in which 
most of the mean I.Q. values of samples of 10 boys lie. 

(Madras Univ. B.E., April 1990] 

Solution. Null hypothesis, Ho: The data are consistent with the 
assumption of a mean I.Q. of 100 in the population, i.e., 4 = 100. 

Alternative hypothesis, H, : 1 # 100. 

Test Statistic. Under Ho, the test statistic is : 


(x=), 
VS2/n (n—1)> 
where x and S? are to be computed from the sample values of I.Q.’s. 
CALCULATIONS FOR SAMPLE MEAN AND S.D. 


t= 


Heicens 10,3 = 22 = 97-2 and $2 = 183500 203-73 


197-2-100! _2-8 2:8 


Tabulated f9.95 for (10 — 1) i.e., 9 d.f. for two-tailed test is 2-262. 


Conclusion. Since calculated ¢ is less than tabulated t.95 for 9 d.f., Ho may 
be accepted at 5% level of significance and we may conclude that the data are’ 
consistent with the assumption of mean I.Q. of 100 in the population. 

The 95% confidencé limits within which the mean I.Q. values of samples 
of 10 boys will lie are given by 


X ttoo5 S/N n = 97-2 £2:262 x 4-514 


14.20 Fundamentals of Mathematical Statistics 


= 97.2 + 10-21 = 107-41 and 86-99 
Hence the required 95% confidence interval is [86-99, 107-41]. 


Remark. Aliter for computing x and S?. Here we see that x comes 
in fractions and as such the computation of (x —x )* is quite laborious and time 


consuming. In this case we use the method of step deviations to compute x and 
$2, as given below. 


Here d =X —A, where A = 90 


= =A +1342 9042972 
‘i 92 =| aa. 5 | 2352- CA) = 293-73 


Example 14-5. The heights of 10 males of a given locality are found to 
be 70,67, 62, 68, 61, 68, 70, 64, 64, 66 inches. Is it reasonable td believe that 
the average height is greater than 64 inches ? Test at 5% significance level, 
assuming that for 9 degrees of freedom P (t > 1-83) = 0-05. 


Solution. Null Hypothesis, Ho: p = 64 inches. 
Alternative Hypothesis, H, : 1 > 64 inches. 


CALCULATIONS FOR SAMPLE MEAN AND SD. 


Aan 62 68 61 68 70 64 164. 66 | Total 
660 


60, 
10 


De: 
n 
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1 =, - 90 
§2 = a4 d (x -x) 9 10 
Test Statistic. Under Ho, the test statistic is 


t= bb _ 66-64 _ / 
= S2/n ~ Vio/10 10/10 
which follows Student’s ¢-distribution with 10 -1=9 df. 

Tabulated value of ¢ for 9 df. at 5% level of significance for single (right) 
tail-test is 1-833. (This is the value ¢p.19 for 9 df. in the two-tailed Table given 
in the Appendix.) 

Conclusion. Since calculated value of ¢ is greater than the tabulated value, 
it is significant. Hence /7, is reiected at 5% level of significance and we conclude 

iat ine average height is greater than 60 inches. 

Example 14-6. A random sample of 16 values from a normal population 
showed a mean of 41-5 inches and the sum of squares of deviations from this 
mean eqial to 135 square inches. Show that the assumption of a mean of 43-5 
inches for the population is not reasonable. Obtain 95 per cent and 99 per cent 
fiducia. limits for the same. 


Yu may use the following information from statis:ical tables : 
5 \ = 0-05, t = 2-131 
VE) |p = 0-01, t = 2-947 


Solution. We are given n = 16, x = 41-5 inches and 
‘Sx - x)? = 135 ” inches. 


92 =- = 


7 Ux - xP~ets =9 => S=3 


Null Hypothesis, Hy: \ = 43-5 inches, i.e., the data are consistent with 
the assumption that the mean height in the population is 43-5 inches. 
Alternative Hypothesis, H, : 1 # 43-5 inches. : 


Test Statistic. Under Ho, the test statistic is : 


x-w tase 
SINn 


141-5 — 43-5 1_8_ 
Now l¢l= ry” i 3 = 2-667 
Here number of degrees of freedom is (16 — 1) = 15. 


We are given : 
to.os for 15 d.f. = 2-131 and to; for 15 d.f..= 2-947 
Conclusion. Since calculated | t ! is greater than 2-131, null hypothesis is 
rejected at 5% level of. significance and we conclude that the assumption of mean 
of 43-5 inches for the population 1s not reasonable. 
Remark. Since calculated | ¢ | is less than 2-947, null hypothesis 
(ut = 43-5) may be accepted at 1% level of significance. 
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95% fiducial limits for j : (d.f. = 15) 
Et toos Xe = 415 $2131 x2 = 415 + 1598 


Vn 
39-902 < p < 43-098 


99% fiducial limits for pp : df. = 15) 
Et too, X DE = 41-5 + 2947 x >= 43-71 and 39-29 


aps 4 
39-29 <p < 43-71 


EXERCISE 14(b) 


1. (a) Write a short note on: Student’s ¢-distribution and point out.its uses. 

(6) Show how the ¢-distribution has been found useful in testing whether 
the mean of small sample is significantly different from a hypothetical value. 

(c) It is desired to test the hypothesis that the mean of a normal population 
iS [1 = Uo against the alternative that 11 #[lo. Explaining the assumptions 
involved, develop the statistic suitable for testing this hypothesis if the size of 
the ample is small. What modification do you suggest when the sample size is 
large ? 

2. What is a test of significance ? 

To test the hypothesis that the mean of a normal distribution is zero, two 
independent observations x, and x2 are taken from the distribution. Show that 
the hypothesis is rejected at 10% level of significance, using ¢ test with equal 
tail ends, if 

Ix, + Xo I> lx, ~X2 l tan 81° 

3. It is required to test that the mean of a normal population is zero. A 
random sample drawn from the population -gives the values x), X2, ..., X,. 
Show that the t-test for acceptance of the hypothesis reduces to 


n. to te 
(,2, =) s <asG-D (2,7) 


wliere ¢, is-the value of Student’s ¢ at the desired level of significance @ for 
(n-l) df. 

4, (a) Find the Student’s ¢ for following variate values in a sample of 
eight : —4, -2, -2, 0, 2, 2, 3, 3, taking the mean of the universe to be zero. 
How would you proceed further ? 

(b) Ten individuals are chosen at random from a normal population and their 
heights are found to be 63, 63, 66, 67, 68, 69 70, 70,71, 71 inchés. Test if the 
sample belongs to the population whose medn heights is 66” 

[Given to.95 = 2-62 for 9 df.) 

(c) A random sample of 9 experimental animals under a certain diet gave the 
following increase in weight : Lx; = 45 Ibs, 2x7 = 279 Ibs., where x; denotes 
the increase in weight of the ith animal. Assuming that the increase in weight is 
rormally distributed as N (pu, 0) variate, test Ho : 1 = 1 against H, : pp #1 at 
5% level. Given P (\¢ | > 2-306) = 0-05 for 8 degrees of freedom. 

(Calcutta Univ. R.Sc. (Maths.Hons.), 1991) 
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5. A manufacturer of gunpowder has developed a new powder which is 
designed to produce a muzzle velocity equal to 3000 ft/sec. Seven shells are 
loaded with the charge and the muzzle velocities measured. 

The resulting veiocities are as follows : 3,005; 2,935; 2,965; 2,995; 3,905, 
2,935; and 2,905. Do these data present sufficient evidence to indicate that the 
average velocity differs from 3,000 ft./sec. 

6. The average length of time for students to register for summer Classes at 
a certain college has been 50 minutes with a standard deviation of 10 minutes. A 
new registration procedure using modem computing machines is being tried. Ifa 
random sample of 12 students had an average registration time of 42 minutes 
with s.d. of 11-9 minutes under the new system, test the hypothesis that the 
population mean has not changed, using -05 as level of significance. 

7. The nine items of a sample had the following values : 45, 47, 50, 52, 
48, 47, 49, 53 and 51. 

Does the mean of the nine items differ significantly from the assumed 
population mean of 47-5 ? Given that 

re 0-945 for «= 1-8 
"=" |P = 0-953 for t = 1-9 

8. A time study engineer developed a new sequence of operation elements 
that he hopes will reduce the mean cycle time of a certain production process. 
The results of a time study of 20 cycles are given below : 

cycle time in minutes 
12-25 11-97 12. 15 12: O08 12-31 12-28 11:94 11-89 12-16 12-04 
12:09 12-15 12-14 12-47 11-98 12-04 12-11 12-25 12-15 12-34 

If the present mean cycle time is 12-5 minutes, should he adopt the new 
sequence ? 

9. (a) The average breaking strength of steel rods is specified to be 18-5 
thousand pounds. To test this a sample of 14 rods was tested. The mean and 
standard deviations Obtained were 17-85 and 1-955 thousand pounds respectively. 
Is the result of the experiment significant ? Also obtain the 95 per cent fiducial 
limits from the sample for the average breaking strength of stee} rods. 

(6) A sample of 9 shafts is inspected from a production line. The following 
measurements are the diameters (in mm.) of shafts : 45-010, 45-020, 45-021, 
45-015, 45-019, 45-018, 45-020, 45-023 and 45-005. If the production line 
meets the specifications laid by the I.S.L., with S.D. 0-006 mm, estimate the 
95% confidence interval within which the true diameter of the shaft lies. 

(Madras Univ. B.E., 1989] 

10. a random sample of 8 envelopes is taken from letter box of a post 
office and their weights in grams are found to be 12-1, 11-9, 12-4, 12-3, 11-9, 
12-1, 12-4, 12-1. 

(a) Find 99% confidence limits for the mean weight of the envelopes, | 
received at that post office. ; 

(b) Using the result of part (a), does this sample indicate at 1% level that 
the average weight of envelopes received at that post office is 12-35 gms. 

11. A random sample of nine from men of a large city gave a mean height 
68 inches and the unbiased estimaté of the population variance found from the 
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sample was 4-5 inches. Proceed as far as you can to test for a mean height of 
68-5 inches for the men of the city. Also state how you would proceed further. 

14-2-10. t-Test for Difference of Means. Suppose we wait to test 
if two independent samples x; (= 1, 2, ...,1;) and y, (= 1, 2, ..., m2) of sizes 
n, and nz have:been drawn from two normal populations with means [ly ane [ly 
respectively, 

Under the null hypcthésis (Ho) that the samples have been drawn oe the 
normal populations with means [ly and [ty and under the assumption that the 
population variance are equal, i.e., Oy” = Oy” = 5? (say), the statistic 


_ (x - y )- (x - Hy) 


..(14-7) 
dg ke 
mn nM, 
me bee a ee 
where a Xi Ue 2) 
1 = - 
ee —_ 2 = 2 ; 
and S ie hk (xj; -x )* + XY; y | ..[14-7(a)] 


is an unbiased estimate of the common population variance 07, follows 
Student's t-distribution with (n, + n> — 2) Lf. 


Proof. Distribution of t defined in (14-7). 
& = _(x-y )-E(x-y) _ ~N (0, 1) 
VVC -y) 
But E(x-y) =E(x)-E(yY) =x -Hy 


ad V(x -y) =V(x)+V(y) 
[The.covariance term vanishes since samples are indepéndent.] 


_ Ox’ oy? _ 2f1,1 
= a + a o fs + ) (By assumption) 
_ —-: 
E = - {2-7 )—Wx-Hy) _ )= x =H) (0, 1) se) 


| % 
let = 2, (4-2? + Ey 01-79) H o 
t= = 
2 


= = NSy? 7 
-[s (x; - x v0? | + ls (yj-y VI 0? | = + ee 
8 J. 
sas(**) 
_ Since nsx2/o? and nosy7/o? are independent y2-variates with (m, — 1) and 
(n> — 1) d.f. respectively, by the additive property of chi-square distribution, x? 
defined in (**) is a y2-variate with (mn, — 1) + (np - 1), Le., my tng -2 df. 


- 
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Further, since sample mean and sample variance are independently distributed, ¢ 
and x? are independent random variables. 
Hence Fisher’s ¢ statistic is given by 


WY) 
Peres EGF +E (yj-¥ )? 0] 


ta 


[Ber-# 24 B0;-F)2| 
i j 


o nS eae 


and it follows Student’s ¢-distribution with (1, + nz — 2) d.f. (cf. Remark ° 
§ 14-2-3, page 14-4). 


Remarks 1. S?, defined in 14-7(a) is an unbiased estimate of tt 
common population variance 67, since 


E(S2) “Tampa de | Mei F 24 Bo, - FF | 
: / te 
wre n= E[(m - 1) Sx? + (m2 - Sy] 
= ape Le - DESPA) + (2 ~ DEY?) 
=a lim - Dot + (- No? = 0? 


2. An important deduction which is of much practical utility is discuss 
below : 

Suppose we want to test if : (a) two independent samples x; (i = 1, 2,. 
n;), and y; j = 1, 2, ..., m2), have been drawn from the populations with sat 
means or (b) the two sample means x and y differ significantly or not. 

Under the null hypothesis Ho that (a) samples have been drawn from ti, 
populations with the same means,i.e., Uy = Uy or (b) the sample means x ai 
y do not differ significantly, [From (14-7)] the siatistic : 
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(tm) 
—+ 
mh ™& 
where symbols are defined in (14-7a),follows Student’s ¢-distribution with 
(ny + Nz — 2) df. 

3. On the assumption of t-tést for difference of means. Here we make the 
follewing three fundamental assumptions : 

(i) Parent populations, from which the samples have been drawn are 
normally distributed. 


(ii) The population variances are equal and unknown, i.e., Oy? = dy” = 07, 
(say), where o2 is unknown. 


(iii) The two samples are random and independent of each other. 


Thus before applying ¢-test for testing the equality of means it is 
theoretically desirable to test the equality of population variances by applying F- 
test. If the variances do not come out to be-equal then ¢-test becomes invalid and 
in that case Behren’s ‘d’-test based on fiducial intervals is used. For practical 
problems, however, the assumptions (A) and (ii) are taken for granted. 


4. Paired t-test For Difference of Means. Let us now consider the case 
when (i) the sample sizes are equal, i.é., ny = nz =n (say), and (ii) the two 
samples are not independent but the sample observations are paired together, i.e., 
the pair of observations (x;, y;), (i = 1, 2, .... 2) corresponds to the same (ith) 
sample unit. The problem is to test if the sample means differ significantly or 
not. 

For example,suppose we want to test the efficacy of a particular drug, say, 
for inducing sleep. Let x; and y;(i = 1, 2, ..., n) be the readings, in hours of 
sleep, on the ith individual, before and after the drug is given respectively. Here 
instead of applying the difference of the means test discussed in § 14-2-10, we 
apply the paired /-test given below. 

Here we consider the increments, d; = x; -— y;, (i= 1,2, ..., n)- 


Under the null hypothesis, Ho that increments are ae to fluctuations of 
Sampling, i.e., the drug is not responsible set these increments, the statistic. 


[-.- Hx = My, under Ho) (14-8) 


t= (14-9 

SIN n Sled 

where ee > d; and S*= i ¥ @-_dy .--[14-9@)] 
Nia) n-1;0) 


follows Student’s ¢-distribution with (n — 1) d.f. 
_ Example 14-7. Below are given the gain in weights (in lbs.) of pigs fed 
on two diets Aan B. 
Gain in weight 
Diet A : 25, 32, 30, 34, 24, 14, 32, 24, 30, 31, 35, 25 
Diet B : 44, 34, 22, 10, 47, 31, 40, 30, 32, 35, 18, 21, 35, 29, 22 
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Test, if the two diets differ significantly as regards their effect on increase in 
there is no significant 


weight. 
Solution. Null hypothesis, Ho : Wx= Uy, fe 
difference between the mean increase in weight due to diets A and B 
Diet B 


Alternative hypothesis, H,: Uy # Wy (two-tailed) 
Diet A 


Total 


Total 


—2 


} 
Ly = 450 > 
J 


Here 
Lx = 336 and 
X“y-y)7= 2410 


L(x -x )2 = 380 
=_ 336 49 ~_450_ 


[sor -¥)? + DEV - 7 )?| =71.6 


a ereey 
e 28 — 30 
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v 10-74 
Tabulate 9.95 for (12 +15 — 2) = 25 d.f. is 2-06. 
Conclusion. Since calculated | ¢ 1 is less than tabulated t, Hy may be 


accepted at 5% level of significance and we may conclude that the two diets do 
not differ significantly as regards their effect on increase in weight. 


= — 0-609 


‘Remark. Here x and y come out to be integral values and hence the direct 
method of computing ©(x-x )? and L(y + y )?2 is used. In case x and (or) y 
comes out to be fractional, then the step deviation method is recommended for 
computation of (x -—x )? and X(y —y )2. 

Example 14-8. Samples of two types of electric light bulbs were tested 
JSor icngth of life and following data were obtained : 


Type | Type Il 
Sample No. ny =8 no =7 
Sample Means X, = 1,234 hrs. X2 = 1,036 hrs. 
' Sample S.D.'s 5, = 36 hrs. Sq =40 hrs. 
Is tne difference in the means sufficient to warrant that type 1 is superior to 
type Il regarding length of life ? ‘ 


Solution. Null Hypothesis, Ho: ty = Wy, i.e., the two types I and II of 
electric bulbs are indentical. 

Alternative Hypothesis, H, : ty > Wy, i.e., type 1 is superior to type II, 

Test Statistic. Under Ho, the test Statistic is : 


ees o> ee : 
c= ‘ay tag 2 = h3, 
§2 (i =) 
mn 
where S?= ee ree [xc - Xi)? + L(xe ~ ¥2)? 


= 2 tne | — — 2 yd ; 
=F aD [ny512 + 2522] = 13 + [8 x (36) + 7X (40)?] = 1659.08 
___ 1234 ~ 1036 _ 198 


t= = 
V 1659-08 x 0-2679 


Pe ree 
1659-C3s & ) 


Tabulated value of ¢ for 13 df. at 5% level of significance for right (single) 
tailed test is 1-77. [This is the value of fo; for 13 df. from two-tail tables 


given in Appendix]. 

Conclusion. Since ‘calculated ‘t’ is much gréater than tabulated ‘t’, it is 
highly significant and Ho is rejected. Hence the two types of electric bulbs differ 
significantly. Further since x, is much greatér than x,, we conclude that type I is 
definitely superior to type II. 


= 9.39 
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Example 14-9. The heights of six randomly chosen sailors are in 
inches : 63, 65, 68, 69, 71, and 72. Those of 10 randomly chosen soldiers are 
61, 62, 65, 66, 69, 69, 70, 71, 72 and 73. Discuss, the light that these data 
throw on the suggestion that sailors are on the:average taller than soldiers. 

Solution. If the heights of sailors and Soldiers be represented by the 
variables X and Y respectively then the Null Hypothesis is, Ho : [xy = fy, i.e., 
the sailors are not on the average taller than the soldiers. 

Alternative [lypothesis, H, : Uy > ty (Right-tailed). 

Under //o, the test statistic is : 


Z ‘ene? =b4 


= 68 +:0 = 68 eet ae 
em ee “247 GA and Z(y -y)? = yp2_G2Y : 


= 60-0 =66 = 186 - 24 - 153.6 


1 [S(x- 8)? + EQ - WF] =4 Gos 1536) = 152571 


y+ Ny - 
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Az 68 -— 67-8 7 0-2 = 
1 1 \2 V'15-2571 x 0-2667 
V 15-2571 é + fl 


Tabulated tos for 14 df. for single-tail test is 1-76. 

Conclusion. Since calculated ¢ is much less than 1-76, it is not at all 
significant at 5% levels of significance. Hence null hypothesis may be retained 
at 5% level of significance and we conclude that the data are inconsistent with 
the suggestion that the sailors are on the average taller than soldiers. 

Example 14:10. A certain stimulus administered to each of the 12 
patients resulted i in the following increase of blood pressure : 

5,2, 8,-1, 3, 0, -2,1, 5,0, 4 and 6 
Can it be concluded that the stimulus will, in general, be accompanied by 


an increase in blood pressure ? [Delhi Univ. B.Sc. 1989] 
Solution. Here we are given the increments in blood pressure i.e., 
d; (= x;- yj). . 


Null Hypothesis, Ho : Uy = Wy, i.e., there is no significant difference in the 
blood pressure readings of the patients before and after the drug. In other words, 
the given increments are just by chance (fluctuations of sampling) and not due to 
the stimulus. 

Alternative Hypothesis, Hy : Wy < Wy, i.e., the stimulus results is an 
increase in blood pressure. 

Test Statistic. Under Ho, the test statistic is : 


te 


t 
i als 


d_ _2:58xV12_ 258 x 3-464 _ 444 
SNn V9-5382° 3-09 


Tabulated fp.95 for 11 d.f. for right-tail test is 1-80. [This is the value of 
to.190 for 11 df in the Table for two-tailed test given in the Appendix]. 


f= 
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Conclusion. Since calculated ¢ > to.95, Ho IS rejected at 5% level of 
significance. Hence we conclude that the stimulus will, in general, be 
accompanied by.an increase in blood pressure. 

Example 14-11. In a certain experiment to compare two types of pig 
foods A and B, the following results of increase in weights were observed in 
<a 


Increase in [_Pie ne —_ 499 $3 S1 $2 47 SO $2 = 353 407 


weight in 
lb 52 SS S$2 S53 S$0 S4 S4 53 423 


(i) Assuming that the two samples of pigs are independent, can we conclude 
that food B is better than food A ? 


(ii) Also examine the ‘case when the same set of eight pigs were used in 
both the foods. 


Solution. Null Hypothesis, Ho. If the increase in weights due. to foods A 
and B are denoted by X and Y respectively then Ho : pty = py, i.e., there is no 
significant difference in increase in weights due to diets A and B. 


Alternative Hypothesis, H, : ty < Wy (Left-tailed). 


(If the two samples of pigs be assumed to be independent, then, we will 
apply ¢-test for difference of means to test Ho. 


Test Statistic. Under Hg : pty = py, the test criterion is 


p 
0 
9 
0 
1 
4 
4 
4 
1 


5 = 5245552875 
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and E(e- zy = re OE L(y - yp = LD? - = 
=37- a = 23 - 7 
= 30-875 = 16-875 
$2 = aaa) [su x)*#Xiy-y 7] 
= (30-875 + 16-875) = 3-41 
50-875 - 52875, na 


eS { =—_"_—___. 
' — 4 
h) Ete ) oa 


Tabulated t.95 for (8 + 8 — 2) = ) d.f. for one-tail test is ) -76. 

Conclusion. The critical region for the left-tail test is ¢ < —1-76. Since 
calculated ¢ is less than —1-76, Ho is rejected at 5% level of significance. Hence 
we conclude that the foods A and B differ significantly as regards their effect 6n 
increase in weight. Further, since y > x, food B is superior-to food A. 

(ii) If the same set of pigs is used in both the cases, then the readings X and 
Y are not independent but they are paired together and we apply the paired ¢-test 
for testing Hp. 

Under Ho : jtx = py, the test statistic is 


|44- es = 1-714 


Ia, ot 


“Ysim  Vieniaaye | 04629 


Tabulated t.95 for (8 — 1) =7 d.f. for one-tail test is 1-90. 
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Conclusion. Here also the observed value of ‘’ is significant.at 5% level 
of significance and we conclude that food B is superior to food A. 


EXERCISE 14 (c) 


1. Explain, stating clearly the assumptions involved, the r-test for testing 
the significance of the difference between the two sample means. 

‘2. Two independent samples of 8 and 7 items respectively had the 
following values 

Sample I... 9 11 13 11 15 9 12 14 

Sample II... 10 12 10 14 9 8 10 

Is the difference betweén. the means of samples significant ? 

3. (a) Two horses A and B were tested according. to the time (in seconds) to 
run a particular track with the following results : 

Horse A: 28 30 32 33 33 29 34 

Horse B_—: 29 - 30 30 24 27 29 

Test whether the two horses have the same running capacity. [S per cent 
values of ¢ for 11 and 12.degrees of freedom respectively are 2:20 and 2-18]. 

Ans. Calculated ¢ = 2-5 (approx.) 

(b) The gain in weight of two random samples of rats fed on two different 
diets A and B are given below. Examine whether the difference in mean increases 
in weight is significant. 


Diet A : 13. 14 10 11k 12 16 10 8 

DietB : 7 10 12 8 10 11 9 10 11 

4. (a) Show how you would use Student’s ¢-test to decide whether the two 
sets of observations \ 


[17, 27, 18, 25,27, 29, 27, 23, 17) and [16,16, 20, 16, 20, 17, 15, 21) 
indicate samples drawn from the same universe. 
(b) A reading test is given to an elementary school class that consists of 12 


Anglo-American children and 10 Mexican-American children. The results of the 
test are : 


Anglo-American Mexican-American 
x, = 74 x2 = 70 
$s, 38 82 = 10 


Is the difference’ between the means of the two groups significant at the 0-05 
level ? Given to9 = 2-086, t22 = 2074 at 5% level. 


[Dethi Univ. M.C.A., 1986] 


5. (a) For a random sample of 10 pigs, fed on a diet A,the increases in 
weight in a certain period were 10, 6, 16,17, 13, 12, 8, 14, 15, 9 Ibs. 

For another random sample of 12 pigs fed on diet B, the increases in the 
same pericd were 7, 13, 22, 15, 12,14, 18, 8, 21, 23,-10, 17 Ibs. 

Find if the two samples are significantly different regarding the effect of 
diet, given that for d.f. v = 20, 21, 22, the five per cent values of ¢ are 
respectively 2-09, 2-07, 2-06. 

Ans, ¢ = 1-51; Sample means do not differ significantly. 
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(b) Two independent samples of rats chosen among both the series had the 
following increase in weights when -fed on-a diet. Can you say that the.mean 
increase in weight differs significantly with sex ?° 

Male : 96, 88, 97. 89, 92, 95 and 90 

Female: 112, 80, “98, 100, 84, 82, 89, 95, 100 and 96. 

6. (a) Ten soldiers visit a riffle range for two consecutive weeks. For the 
first week their scores are 

67, 24, 57, 55, 63, 54, 56, 68, 33, 43 
and during the second week they score in the same order— 
70, 38, 58, 58, 56, 67, 68, 72, 42, 38 

Examine if there is any significant difference in their performance. 

(b) Two independent groups of 10 children wére tested to find how many 
digits they could repeat from memory after hearing them. The results are as 
follows : 

GroupA_: 8 6 5 7 68 7 4 5 6 

GoupB : 1067 8 6 9 7 67 =7 

Is the difference between the mean scores.of the two groups significant ? 

(c) Measurements of the fat content of two kinds of ice cream, Brand A and 
Brand B, yielded the following sample data : 

‘Brand A: 13-5 14-0 13-6 12-9 13-0 

Brand BS: 12-9 13-0 12-4 13-5 12-7 

Test the null hypothesis 1, = 2, (where 1, and 1, are the respective true 
average fat contents of the two kinds of ice cream), against the alternative 
hypothesis 1, # [, at the level of significance a = 0-05. 

[Madras Univ. B.E;, 1990) 


7. (a) A random sample of 16 values from a normal population has a mean 
of 41-5 inches and sum of squares of deviations from the mean is equal to 135 
inches. Another sample of. 20 values form an unknown population has a mean 
of 43-0 inches and sum of squares of deviations from their mean is equal to 171 
inches. Show. that the two samples may be-regarded as coming from the same 
normal population. 

(b) A company is interested in knowing if there is a difference in the average 
salary received by foremen in two divisions. Accordingly samples of 12 foremen 
in the first division and 10 foremen in the second division are selected at random. 
Based upon experience, foremen’s salaries are known to bé approximately 
normally distributed, and the standard deviations are about the same. 


First Division Second division 
Sample size 12 10 
Average monthly salary 
of foremen (Rs.) 1,050 980 . 
Standard deviation of 
salaries (Rs.) 68 74 


The table value of ¢ for 20 d.f. at 5% level of significance is 2-086. 
Ans, ft = 2-2. Rejéct Ho : by = Uv 
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(c) The average number of articles produced by two machines per day are 
900 and 250 with standard deviations 20 and 25 respectively on the basis of 
records of 25 days production. Can you regard both the machines equally 
efficient at 1% level of significance ? 

Ans. t= -7-65. Hint. Here nj =n, = 25. 


8. Eleven school boys were given a test in Statistics. They were given a 
month’s tuition and a second test was held at the end of it. Do the marks give 
evidence that the students have benefited by the extra coaching ? 

Boys 1 2 3 4 5 6 7 8 9 10 15 
Marks in Isttest' 23 20 19 21 #18 20 48 #4417 =23 #16 «#19 
Marks in 2ndtes- 24 19 22 18 20 22 20 20 23 #20 J8 

Ans. Ho: ty = Ho; Hy : Wy< 2 . Paired It! = 1-483. Not significant. 
Hence, students have not benefited from-extra coaching. 

9..(a) The following table gives the additional hours of slecp gained by 10 
patients in an experiment to test the effect of a drug. Do these data give evidence 
that the drug produces additional hours of sleep ? 

Patients b 2 3 4 5 6 7 8 9 10 

Hours gained: 0-7 O-1 0-2 1-2 O-310-4 3-7 O-8 3-8 2-0 

(b) A drug was administered to 10 patients, and the increments in.their 
blood pressure were recorded te be 6, 3, -2; 4, -3, 4, 6,0, 3, 2. Is it reasonable 
to believe that the drug has no effect on change of blood pressure ? Use 
5% significance Icvel, and assume that for 9 degrees of freedom, 
P(t > 2-26) = 0-025. [Calcutta Univ. B.Sc.(Maths. Hons.), 1986] 

(c) The scores of 10 candidates prior and after training are given below : 

Prior: 84 48 36 37 34 69 83 96 90 65 

After: 90 38 56 49 62 8] 84 86 84 75 

Is the training effective ? [Calicut Univ. B.Sc., Oct. 1992] 

10. The following table gives measurements of blood pressure on subjects 
by two investigators : 

Subject No. | 2 3 4 5 6 7 8 9 10 

Investigator | : 70 68 S56 75 80 90 68 75 56 58 

Investigator Il : 68 70 52 73 #75 78 67 #70 54 = 55 

No other details of the experiment were given. 


(i) If a valid inference has to be drawn about the difference between the 
investigators, mention the precautions that should have been taken in 
conducting the experiment with respect to the time of measurement, interval 
between the first and second measurements, the order in which the investigators 
measure, etc. 


(ii) After the experiment was conducted it was discovered that all the 
Subjects were unrelated except that No. 10 was the father 6f No. 9. Assuming 
that all the precautions you mention in (a) are satisfied, analyse-the data to draw 
an inference on the difference between the investigators. 5 per cent values of the 
t-statistic corresponding to various degrces of freedom are as follows : 

5 per cent valucs of tf... 2-40 2-31 2-26 2:23 2-10 2-09 

Degrees of freedom... 7 8 9 10 18 19 
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11. The following are the values of the cephalic index found in two 
samples of skulls, one consisting of 15 and the other of 13 individuals. 

SampleI: 74-1 77-7 74:0 74-4 73-8 79:3 75-8 82-8 

72-2 75-2 78:2 77-1 78-4 76-3° 76:8 

Sample IT: 70-8 74-9 74-2 70-4 69-2 72:2 76:8 72-4 

77-4 78-1 72-8 «74:3 74-7 

(i) Test the hypothesis that the means of population 1 and population II 
could be equal. 

(ii) Is it possible that the sample II has come from a population of mean 
72:0 ? 

(iii) Obtain confidence limits for the mean of population I and for the mean 
of population II. 

(Assume that the distribution of cephallic indices for a homogeneous 
population .is normal.) 

12. (a) The following table gives the gain in weight in decagrams in a 
feeding experiment with pigs on the relative value of limestone and bone meal 
for bone development. 

Limestone 49-2 53-3 50-6 52:0 46-8 50-5 52-1 53-0 

Bone meal 51:5 54:9 52:2 53:3 51-6 54-1 54-2 53:3 

Test for the significance of difference between the means in two ways : 

(i) by assuming that the values are paired. 

(ii) by assuming that the values are. not paired. 

(b) The following table shows the mean number of bacterial colonies per 
plate obtainable by four slightly different methods from soil samples taken at 
4 P.M. and 8 P.M. respectively. 


Method A B C .D 
4 P.M. 29-75 27-50 30-25 27-80 
8 P.M. 39-20 40-60 36-20 42-40 


Are there significantly more bacteria at 8 P.M. than at 4 P.M. ? 

[Given los (3)= 3-18 and fg9; (3) = 5-84] 

13. (a) It is believed that glucose treatment will extend the sleep time of 
mice. In an experiment to test this hypothesis ten mice selected at random are 
given glucose treatment and are found to have a mean hexabarbital sleep time of 
47-2 min with a standard deviation of 9-3 min. A further sample of ten untreated 
mice are found to have a mean hexabarbital sleep time of 28-5 min. with a 
standard deviation of 7-2 min. Are these results significant evidence in favour of 
the hypothesis ? 

Find 95% confidence limits for the population mean difference in, sleep 
time. State any assumptions made concerning the data in carrying out the test 
and finding the limits. (Bangalore Univ. B.E., Oct. 1992] 


(b) An expeiiment was performed to compare the abrasive wear 0: two 
different laminated materials. Twelve pieces of material I were tested, by 
exposing each piece to a machine measuring wear. Ten pieces of material II were 
similarly tested. In each case the depth of wear was observed. The sample of 
material I gave an average (coded) wear 8-5 units with a standard deviation of 0-4 
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while the sampic of material! II gave an average of 8-1 and a standard deviation of 
0-5. Test the hypothesis that the two types of material exhibit the same mean 
abrasive wear at the 0-10 Icvcl of significance. Assume the populations to be 
approximatcly normal with cqual variances. 

If the level of significance is 0-01, what will be your conclusion ? 

[Delhi Univ. M.E., 1992] 

14-2:11. t-test For Testing Significance of an Observed 
sample Correlation Coefficient. If r is the observed correlation 
coefficient in a sample of n pairs of observations from a bivariate normal 
population, then Prof. Fisher proved that under the null hypothesis Ho: p = 0, 
Le. population correlation coefficient is zero, the statistic : 


—"— V(n - 14.9 
ny (n — 2) »-(14-9) 


follows Studcnt’s ¢-distribution with (# — 2) d.f. (c.f. Revise to § 14-3 page 
14-41). 

If the value of ¢ comcs out to be significant, we reject Ho at the level of 
significance adopted and conclude that p #0, i.e.,'‘r’ 1s significant of correlation 
in the population. 

If comes out to be non-significant then Hg may be accepted and we 
conclude that variables may be regarded as uncorrelated in the population. 

Example 14-12. A random sample of 27 pairs of observations from a 
normal population gave a correlation coefficient of 0-6. Is this significant of 
correlation in the population ? 

Solution. We set up the null hypothesis, Ho : p = 0, i.e., the observed 
sample correlation coefficient is not Significant of any correlation in the 


| 


population. 
; at v. no ~ 
Under Hp: (t= Va-r) l(n-2) 
Here _0-6V27-2 | 3 = 3-75 


V(1-0-36) 0-64 
Tabulated f.95 for (27 — 2) = 25 d.f. is 2-06. 
Conclusion. Since calculated ¢ is much greater than the. tabulated 4, it is 
significant .and hence Ho is discredited at 5% level of significance. Thus we 
conclude that the variables are correlated in the population. 


Example 14-13. Find the least value of r in a sample of 18 pairs of 
observations from a bi-variate normal population, significant at 5% level of 
significance. 


Solution. Here n = 18. From the tables to.95 for (18 — 2) = 16 df. is 2-12 
r \ n-2 
Va- r?) 

In order that the calculated value of tis significant at 5% level of . 
significance, we should have 


Under Ho: p = 0, t= 


~ bn-2) 
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rv n—2 3 rV16 > 212 
0-05 : 
\ (1- r?) V (1-7?) 
= 16r2 > (2-12P(1—r2) => 20-4937? > 4-493 
4-493 
2 = ‘ 
=> r+ > 70-493 7 0-2192 
Hence Irl > 04682 


Example 14-14. A coeffi cient of correlation of 0-2 is derived from a 
random sample of 625 pairs of observations. (i) Is this value of r significant ? 
(ii) What are the 95% and 99% confidence limits to the correlation coefficient in 
the population ? 


Solution. Under the null hypothesis Ho: p = 0, i.e., the value of r = 0-2 
is not significant; the test Statistics is : 


_rNn- 2 - 
V1-—r2 


Now , Ox VO? = 5:09 
\ (1 — 0-04) 
Since df. = 625 — 2 = 623, the significant values of ¢ are same as in the 
case of normal distribution, viz., to.95 = 1-96 and fo.9, = 2-58. Since calculated ¢ 
is much greater than these values; ‘it is highly significant. Hence Hy: p = 0 is 


rejected and we conclude that the sample correlation is significant of correlation 
in the population. 


95% Confidence Limits for p (population correlation coefficient) are 
r+196 S.E.(r) =r+196(1-r)Nn [Since n.large] 
= 0-2 + (1-96 x 0-96/V 625 ) 
= Q-2 + 0-075 = (0-125,.0-275) 
99% Confidence Limits for p are: 
0-2 + 2-58 x 0-0384 = 0-2 + 0-099 = (0-101, 0-299) 


EXERCISE 14 (d) 


1. A restaurant owner ranked his 17 waiters in terms of their speed and 
efficiency on thé job. He correlated these ranks with the total amount of tips 
each of these waiters received for a one-week period. The obtained value of 
correlation coefficient is 0-438. What do you conclude ? 

Given : t)5 (0-05) = 2-131, t¢-(0-05) = 2-120 for two-tailed tést. 

[Delhi Univ. M.C.A., 1990] 

2. Test the significance of the values of correlation coefficient ‘r’ obtained 
from samples of size n pairs from a bivariate normal population. 

(i) r=0-6,n = 38 (ii) r=0-5,n=11 
Ans. (i) t= 4-5; Significant at 5% level; Ho : p = 0 rejected. 
(i) t = 1-73; Not significant at 5% level. 
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(i) Consistent Statistic 
(ii) Unbiased Statistic 
(iii) Sufficient Statistic 
(iv) Efficiency. [Dethi Univ. B.Sc. (Stat. Hons.), 1987, 1982) 
2. What do you understand by Point Estimation ? When would you say 
that estimate of a parameter is good ? In particular, discuss the requirements of 
consistency and unbiasedness of an estimate. Give an example to show that a 
consistent estimate need not be unbiased. 


[Delhi Univ. B.Sc. (Stat. Hons.), 1992, 1986] 


3. Discuss the terms (2) estimate, (ii) consistent estimate, (iii) unbiased 
estimate, of a parameter and. show that sample mean is both consistent and 
unbiased estimate of the population mean. 

[Calcutta Univ. B.Sc. (Maths. Hons.), 1986] 

4. (a) If 5,2, 592, ..., 5,2 are r sample variances based on random samples of 
SIZES Nj, No, ..., nN, respectively, and if T is some statistic given by 


r S17 + N2S2? +... + N,S,2 
aa a) 


a 
for estimating o? as an unbiased estimator, find the value, of a, supposing 
population is very large and for every sample 


224 ¥(x;-FP 
Ans, a ="(n, +m) +... #2,)—- TF. 


(b) If X), Xs. Xa; ..., X, are the sample means based on samples of sizes 
N,, No, Ns, ..., M, respectively, an unbiased estimator, 


has been defined to estimate ut. Find the value-of.k. 
Ans. k=n, +o +.:. + M,. 
5. (a) For the geometric distribution, 
fix, 0) =8 (1 -6)*-!, @=1,2,...),0<6<T, 


Obtain an unbiased estimator of 1/6. [Ans. E(X) = 1/0.] 


(b) The random variable X takes the values 1 and O with respective 
probabilities 6 and 1 —6. Independent observations X,, X>, ...,X, on X are 
available. Write © =X, +X,+...+X,. 


Show that & (n — &)/n(n — 1) is an unbiased estimate of 0(1 — 6). 

6. Show that if T is an unbiased estimator of a parameter 0, then XT + A, 
is an unbiased estimator of 1,6 + Az, where A, and X, are known constants, but 
T? is a biased estimator of 67, 


7. For the following cases determine if the given estimator is unbiased for 
the parametric function. When it is biased, derive an unbiased estimator from it. 


x is the sample mean. 
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Proof. Let (,, y;), @ = 1, 2, ..., 2) be a random sample of size n 
drawn from an uncorrelated bivariate normal population (p = 0) in which 
E(X) = E(Y) = 0 and V(X) = oy”, V(Y) = oy”. Let the variable Y be transformed 
to the variable Z by means of a linear orthogonal transformation, viz., 

Z=CyY 
where Dine 1= (21, 225 cory Zn), Ynx l= (y1, V2» eves Ya) and Coe = (cj), C is an 
orthogonal matrix. Let us, in particular, take , 


Cy, =H CyQ=...=Cy,= uNn ’ 
so that HG 4 one Nay 
Van 
Now proceeding as in (Theorem 13-5), we get 
a a 
Eats E Oi-F)*= ns 
§ = 


t= 2 


Since in a bivariate normal distribution, the marginal distributions of X and 
Y are also normal, we have Y ~N (0, oy*). Hence by Fisher’s Lemma 
(Theorem 13-4) z;, (i= 1,2, ...,n) are independent N (0, oy’). 


2 (x;- x) Oi-Y) 


Now p - Lov %, Y) is 
SySy nm Sy Sy 
A A 
eee eet Se ie ee n . 
a) 
7 n Sy S¥ ~ nSx Sy 
Vnsyr = i= *) Yi, (say), : »o6(**) 
Vn sy 


{since the ‘sum of the squares of coefficients of y,, y2, ...,Y,_ in (**) is unity.} 
From (*) and (**), we get 


a a a 
nsy= DL z2= YL 224+2,%= LF 22 +4nrsyY 
i=a2 ia 3 i= 3 


= (l=r)asfe= . z z2 warn) 


Since z;, (i = 1,2, ..., m) are independent N (0, oy”); 
(z;/oy), i = 1,2, ..., 2) are independent NW (0, 1). 
Hence from (**), 


being the square of a standard normal variate is a y?-variate with 1 d.f. and from 
(¥**), 
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= z z?/oy" = E (2/67)? = farynsr , 


peing the sum of squares-of (m ~ 2) independent standard normal ae is an 
independent x-variate with (n — 2) d.f. Se 

Further, since z2 and (z3, 24, ..., Z,) are independent r.y.’s, U and V are 
independent chi-square variates with 1 and (n — 2) d.f. respectively. 


Uo nr sy lo ~ g, (1, 2=2 
U+V [nresy?+(1—r2)nsyoy (2° 2 
[c.f. Theorem 13-2] 


7 on -2 
Hence the shin function of r? is given by 
a-2 
dF(2) =——_——-. (729-1! 12) 2 a2), OS 22 <1 
B 1 a-2 
7 2 
=> dF) s=———"\ —_[1-y- 92 gp, 1 ere) 


1 a-~2 
a(5 a 
the factor 2 disappearing from the fact that total probability in ad range 
-1 <r <1 must be unity. 


Remark. Jf p = 0, then t = oe 
t with (n~ 2) d. f. 


— ia 2) is distributed as Student’ s 


male - 2) ** 
Proof. = tac suc") 
(2 eel: aoe a l 
> n-2 (l-r?) 1-r? 
2 -1 
= (1 -r?) -|1+ sna sa) 
From (*), 
= V(n -2) d[riV( - r?) ] 


| ae dr r\ _(@2rdr 

dt = | (1 — Fr) : - 4 (1 Mts, 
dr re 

4 NO-8 FB Toa 


dr . e ' l 
=> dt =Vin -2) X zona Le., dr = = (1 - 1)? at 
(1 — r2)3/2 V(n — 2) 
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As r ranges from —1 to 1, from (*), ¢ ranges from — ~ to 9, 
When p = 0, the p.d.f. of ‘r’ is given by (14-12) and it transforms to 


1 (n-4)2___1 3 
dG(t) =———— [1 - 71° - °°? ——— (1 - 1)? at 
pi 2=2 V (i -'2) 
> 2 
ee) ee eee See See eens r 
1 n-2 ° (2 (a - 1)/2 
Vin 2) 2(5. 2 )[- a) 
' [From (**)] 
ee, eee ee eae: RE ae 
l _2 pz (n-2+41)/2 , 
ame 58 [ioe 
\ — oS < { <oo 


which is the p.d.f. of ¢-distribution with (n — 2)d.f. 


Hence t rk V (n — 2) ~ lu 2) 


Example 14:15. (a) If (x;, y;) is a@ random sample drawn from an 
uncorrelated bivariate normal population, derive the distribution of 


bee L(x4;- x) Oi-Y ) 


» VEG, - 2) L0,-7)? 


(6) Further, when n = 5 and if P (/r/2C) = a, show that C is a root of 
the equation, 


CVT = C2) + sinc + MED 9 
Solution. (a) cf. § 14-3. 
(6) P(lrlec) =1-P(lrlse@=1-PCCeErs<C) 
=1-2P@srsc=i-2 f° f(r) dr 
[-.- f(r) is seta about r = 0] 


1 
Whenn=5, f() =—~>—.(1- 2dr sc f. Equation (14-12)] 


_ 2) . (° 
Pirl2C) =1-25a rap : (1 —r2)12 dr 


1 fi is . 
1-2x>7—-[or(l-7r?)'? + 5sin' +r 
a psn] 


1-2 [5c - C2)!" + Ssin'C =a, Given) 
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1 
1 -2\ca — C2)? + sin-! c| =a 


=> C(1 - C2)!2 + sin! C+ (a- 1) 50 


14-4. Non-central t-distribution. The non-central ¢-distribution -is 
the distribution of the ratio of a normal variate with possibly non-zero mean and 
yariance unity, to the square root of an independent ?-variate divided by its 
degrees of freedom. If X ~ N (1, 1) and Y is an independent x?-variate with 
n af., then 

; X 


= (14-13 
V Yin 
is said to have a non-central ¢-distribution with n dj. and non-centrality 
parameter |. Non-central /-distribution is required for the power functions of 
certain tests concerning normal population. 
p.d.f. of ‘t°’. Since X~ N (i, 1), its p.df. f(.) is 


fia) == exp [- 3(x - H)? ] 


1 ! ~ (tix)! . 
= exp | — 5 (2? + x?) w= 00 <X < 00 
Von [ 2 ] %, i! 
Since Y ~ x?(,), its p.df. g(.) is 
1 
= ————- ey y (w2)-1 
gy) 22T(nf2) ° y ,0<y<oo 

Since X and Y are independent, their joint p.d.f. becomes 


er: eee eae 2 2 (nf2) - 1. a (ux 
am V 2m 22 F(n/2) exp [ ae +y]y ie e! 


Let.us transform to new variables ‘and z by the substitution : 


, a Vax 
(= = » Z=+ Vy 
Vyin Vy 
= x =2ttvn, y=2 
Jacobian of transformation J is 


Ox ox Zz. it 
ay ay : 
ot’ = az 0 22 


The joint p.d.f. of «and z becomes 


, exp (— 12/2) 37! = (uzt NV n)' 22? 
A(t',z) = z ; . 
V2n 2"2 (n/2) @) fog. ot! Vn 
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42 
x exp [-3 (1+ —)22] ; 00 <f’< 00, DN< zc 


exp (- 2/2 ~ |. (ur)! . 1, ica eee 
= ecu [Ft ee FEC + Spe 
Vin 20-DP P(n/2) i= 
Integrating w.r.t. z in the range 0 to ce, we get the p.d.f. of t’ 


In(t’) = __exp (— /2) 
Vic 2@-D?2 F(n/2) 


an 5 | ten” exp | b(t Spee ae 


exp (— ae 


~ Vn200-1V2 P(n/2) 


oo ri (= orien 
x Zz era exp {- (1 + ON os | 


pia p(2+it l | 
_ £xp (— p?/2) - hess 2 t? 
~ Ve (n/2) i=0 ELAO INS (14° n+i+1)/2 
n 
.--L14-13@)] 


which is the p.d.f. of non-central ¢-distribution with n d.f. and non-centrality 
element L. 
Remark. If 1 = 0, get from (14-13 (a)] 


hy(t) = a Ti+ )/2) ee eee 
VnT(n/2) Vn | oe ci 1/2 
n 


1 E ” “| —(n+1)/2 ete 
VabG, peo "dt 


which is the p.d.f. of central ¢-distribution with n d.f. 
14-5, F-statistic. Definition. If X and Y are two independent chi- 
square variates with v, and v2 df. respectively, then F-statistic is defined by 


F=>— (14-14) 


In other words, F is defined as the ratio of two independent chi-square 
variatcs divided by their respective degrees of freedom and it follows Snedecor’s 
F-distribution with (v;, v2) d.f. with probability function given by 
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(2) F 
f(F)= ol 0S Fc co... (14-14) 
-p(m V2 ed 
2° 2 V2 


Remarks 1, The sampling distribution of F-statistic does not involve any 
population parameters and depends only on the degrees of freedom v, and vy. 

2. A Statistic F following Snedecor’s F-distribution with (v;, v2) df. will 
be denoted by F ~ F (v,, v2). 

145-1 Derivation of Snedecor’s F-distribution. Since X and Y 
gre independent chi-square variates with v, and v, d.f. respectively, their joint 
probability differential is given by 


dF(x, y) as exp (-2/2) xP ay 
1 
—— nn iy pny ylva/2?-1 
x ia T(z) (-y/2) y ay| 


1 
he + v,)/2 T'(v,/2) T(v,/2) 
x 21/2) -~1 y (v,/2) -1 dx dy, 4) < (x, y) < 00 
Let us make the following transformation of variables : 
x/v, 


F =— and u=y, so that0< F <0, 0 <u<oco 
yiv2 


exp {- (x + y)/2} 


x ww) Pye Fy and y=u 
V2 V2 
Jacobian of transformation:J is given by 


al 
—u 0 
ye 9G _ | _ “iu 
~ OF P u) i V1 ~ ‘Vo 
—F tI 
V2 


Thus the distribution of the transformed variable is 
1 u Vy 
F,4)) =—_———_———_—_—_ st ee as 
GG(P t= eva Tl) Pg) { 2 (1 +h, } 


(v,/2)—1 
52 (Fe) yl Vir du dF 
2 ‘ 


aati eb 87) 
299? Tey f2)P(v2/2) 


x yl * %2/2) -1 POr)~1 du:dF;-0 < u< 00,0 <F <co 
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Integrating out u over the range 0 to ©, the distribution of F becomes 


BF) P= 0+? Fy 12) Fond) 


x exp = ees yl radi2V-1 
2 v 
0 2 


_ vy? Pi I + 2/2] 
~ 91 + ¥2)/2 T'(v,/2) T'(v,/2) F ' v1 (v, + v,)/2 
et) 


(v,/v.)" po 


B(3, 2 E 4 
2 2 va 


which is the required probability function of F-distribution with (v,, v2) d.f. 
~ YIve 
= ; . being the ratio of two independent chi-square variates with 


0<F <0 


a(F) = 


Aliter 

V1 

v2 

v,; and v2 d.f. respectively is a Bz (3 i 4 variate. Hence the probability 
function of F is given by 


ue, (v,/2)-1 
[> | 
dPF) =——- ,—" aa | 
“ v2) [14 dialed 
e € ; 2) V2 
Sl 
v2 Fold) -! : 
= =. —__— ar, 0S F << 
p(w vw [e+ the [om 
| v2 


‘14-5:2. Constants of F-distribution. 


ww’, (about origin) = E(F”) = f- F’ f(F) dF 
yyw? x0 (vyf2)-1 
on ila) f pa — dF 
p( %2)°° fis tele 
a “A2* 2 V2 

To evaluate the integral, put 

YM pe _ 2 2 

7 F=y ,so that dF = dy 


...(*) 
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V2 + (v,/2) -1 
% v/v)" ae de V2 
Lt, = — | dy 
0 [l+y)@*? (vy 
pi “2 
2 2 
v2 | 4 
Vy an r+(v,/2) -1 


= v +r+{(v -?r dy 


e 


-(PY a4 (r+ Bor | vor (14-15) 
ia) a(; 2) 2 e522 


Aliter for (14-15). (14-15) could also be obtained by substituting 
ot F = tan?‘6 in (*) and using the Beta integral : 
2 


rw /2 P 
2" sin? 8 cos’? 6 dd=B eae 


2 
, _(V2V Ulr+ (v1/2)) Piv2/2) - 7] V2 
“c} PT S27 GTO) 
In particular 
, 2 FUL + (1/2) PU(vz/2) - 1) 
Pry P42) T/2) 


ager? E-TO=C-DTG-D) ...1416(0) 
Thus the mean of F-distribution is independent of v,. 


, _ (¥2% Tl(vi/2) + 2) Ti(v2/2) - 2) 
xa a T'(v;/2) T'(v2/2) 


_ (v2 _ [(v,/2) + 1) (v,/2) 

- ) * ((v2/2) - 1) [(v2/2) - 2] 

_ v22(V, +2). 

~ Wi (¥2 - 2) (v2 - 4) 

= LL’ ___¥2(v, +2) v2? ~ 
He = Ha — Wi = v1(¥2 - 2)(v2 -"4) (v= 2)? 


2v.7 (v2 + Vv; — 2) 
= Vlvy - DAvy— 4)” v2 >4 ..-[14-16(6)] 


»V¥2 > 4. 


Similarly, on putting r = 3 and 4 in 1,’, we get 1,’ and 1,’respectively, 
from which the central moments jt, and [4 can be obtained. 
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Remark. It has been proved that for large degrees of freedom, v, and vp, 


F tends to N[1, 2 {(1/,) + (1/v2)}] variate. 


14-5-3. Mode and Points of Inflexion of F-distribution. We 
have 


log f(F)= C + ((v,/2) - 1) log F ~ cm log (1 + (v;/v2) F) 


where C 1s a constant independent of F. 


ce (Ms 1 @itv) 1 
5p [los AF)) -(3- 1).5 - ) ‘ ve Ss 
E + 2 P| 


_ 3 _ v)-2 Vv, (Vy + V2) | 
fF) = 55 /F) = 0 => oF - 20, +¥,F) 7" 


_ 42 (1 ~ 2) 
Hence =v, (vp + 2) »+(14-17) 
It can be easily verified that at this point f* (F) < 0. Hence 

_ V2 (vy = 2) 


~ V4 (v2 + 2) 
Remarks 1. Since F > 0, mode exists if and only if'v, > Z. 


2. Mode = Cex) : cw 
v2 + 2 Vy 
Hence mode of F-distribution is always less than unity. 


3. The points of inflexion of F-distribution exist for v; > 4 and are 
. equ. -listant from mode. 


a oe 
Proof. We have wf *¥ Bo Cl, m), (*) 


where [= v,/2 and m = v2/2. We now find. the points-of inflexion of Beta 
distribution of second kind with parameters / and m. 
If X ~ Bo (I, m), its p.df. is . 
oe ee ** 
fix) =B, m) (1+ aire OS*S% ---(**) 
Points of inflexion are the solution of 
f*(%) =0 and f°’) #0 
From (**), 
log f(x) = — log BCI, m) + (I - 1) log x - (1 + m) log (1 + x) 
Differentiating twice w.r.t x, we get 
£@ -f-1 l+m (***) 
Sf) x l¢x — 
f(x) f" (x) - [fF (454) l+ m 
(%)P TOU? IT (+x)? 


€ 
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If f” (x) = 0, then we get 
-| 42 --(5 1\ l+m 


ff) xt J (1 +x)? 
eos ea ; eri aes (Het) ees. (On using ("**)] 
= ‘'5ta-1 ~1) - 2S tax (4 ms 1)=0 
= (I~ 1Xl-2)(1 + x)? — 2x(1 + x) — 1) (1 + m) + x21 + m)(1 + m + 1) =0 


oe (tee) 


which is a quadratic in x. It can be easily verified that at these values of x, 
f” @) #0, if f> 2. 
« The roots of (****) give the points of inflexion of B2 (J, m) distribution. 
The sum of the points of inflexion is equal to the sum of roots of (****) and is 
given by : 
Coefficient of x in (****) 
7 | Suclcien of x? in | 
2(1 — 1)(L- 2) - 2(1- 1)(1 + m | 
“LU - 1) - 2) —-2 (- 12) + m) + (+ m)(L + m + 1) 
a 2(1 - 1){( + m) - (I - 2)] 
~Ud- 1) -2) -(- 1) + m) - (Ul - 1) + m)+ (lL + m)(l +m +1) 


7 2(l-— 1) (m +2 
~“d-1I)(d-2-l-m) + (+ m)(l+ m+1-1 4 1) 
2(1 —- 1)\(m + 2 


= (1 l(t 2) + (1+ m) (m + 2) 
- 2d-1) 2-1) 


l+m~-I1+1 (m+ 1) 


Sum of points of inflexion of js F ) distribution 


2/1 
_2d-1) 2 J 2 = 2). 
* Ge +1)", = (yo + 2) 

(Z+ 1) 


=> Sum of points of inflexion of F(v,,v2) distribution 


— BV) ha Vi. 
(+2) , provided l=5 >2 


_ o (v; — 2) 
~ wi(vg + 2) 
= 2 Mode, provided y>4 
Hence the points of inflexion of F(v,, v2) distribution, when they exist, 
(i.e., when v, > 4), are equidistant from the mode. 
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4; Karl Pearson’s coefficient of skewness is given by 
S; = Mean— Mode - 0, 

since mean > 1 and mode < 1. Hence F-distribution is highly positively skewed. 

5. The probability p(F) increases steadily at first until it reaches its peak 
(corresponding to the modal value which is less than 1) and then decrzases 
stowly so as to become tangential at F = ©, i.e., F-axis is an asymptote to the 
right tail. 

Example 14:16. When v,= 2, show that thé singificance level of F 
corresponding to a significant probability p is 


:_ Y2 (_-QH2) 
Fez (p - 1) 
where v, an v2 have their usual meanings. 


Solution. When v, = 2, 
1 2Z aF 


= va cy Be eee 
af) 2) *. E + 2 Ff ae 
v2 
+1) 2 
a Se 
(DI W/2) 7 2 Ww ia 
& ) F+ ,) 


v2 


v2 (%2/2) +1 
2 


> v> [(v,/2) +1 


(c.f. §14-14a) 


2 


Hence p -[- AF) dF 


f v5 a +1 f eo dF 
-[epr'x ff —«__ 
2 F | pie 7 +1 


: 2 
| [F + 7" 
v5 ' +1 2 
=| + x | —————— 
fa 
i 2 F 


V2 \ v2 
2 1 


F + a E 2 F[* 
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=> po 14e => F=2[p _ i] 


Example 14-17. If F(n,, nz) represent an F-variate with n, and nod f., 
prove that F(n2, n) is distributed as I/F (n,, nz) variate. Deduce that 


P[F(n,,n2) 2c] =P | Fen, m) < | 


Or 
Show how the probability points of F(n2, n,) can be obtained from those of 
F (m1, nN»). 
Solution. Let X and Y be independent chi-square variates with n, and n2 
d.f. respectively. Then by definition, we have 


_X/n) | 
F (Y/n>) F(n,, Nn») 
1 _ (Wn) | 
F = (Xin) F(n2, n)) (*) 
Hence the result. 


We have: 
¢ — a i 1 
PLF(m, n2)2 ¢] =Pl Fan § r| 


=P [F (n3,n,) < 74 [From (*)] 


Remark. P[F(m,n,)=c] =P |F (n2, 1) = 4] 


Let P[F(n,n.)2c] =a 
i.e., let c be the upper o-significant point of F(m,, m2) distribution. 
1 1 
1-a =1-P[F(n,n.) 2c] ='1-P ors Pa < | 

1 1 

=> a = P| Firm) < t]=1-P| Fimam) 2 Al 
1 

=> P| Fim, m) > t]=1-0 


Thus (1 — &) significant points of F(n, n,) distribution are the reciprocal 
of a-significant points of F(n,, m2) distribution, e.g., 
Fe4 (0-05) = 604 = Fa2(0-95)= oo, 


Example 14-18. Prove that if n, =n, the median of F-distribution is at 
F-= 1 and that the quartiles Q, and Q; satisfy the condition Q,Q, = 1. 
[Dethi Univ. B.Sc. (Stat.Hons.), 1989] 
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Solution. Since n, = nz = n, (say), the median (M) of F(n,,n2) = F(n, n) 
distribution is given: by : 


P[F(n,n) <M] =055 +0(*) 
1 
= Pla Fane ul = 0-5 
= P| Fin n)2 i = 0-5 |: — = F(n m)| 
M - F(m, n) ; 
= P| Fo, n)s i =1-A Fn, n)2 a4 
21-05 
= 0-5 ‘oC 
From (*) and (**), we get 
=o => M*=+1 => Me=1 


the negative value M = ~1, is discarded since F >0. _ 
Hence the median of F (n, n) distribution is at F = 1. 
Similarly, by definition of Q, and Q3, we have : 
P[F(n, n)<sQi]  =.0-25, -o(499*) 
ad  P[F(n, n)2Q,] =0-25 


1 1 
nn hr, ao 2, = 0-25 


= P| Fen n)s a = 025 | Fam ay Fm w) | (A) 


From (***) and (****), we get 
1 . 
2=5, => -2,03;=1 
O% 
Example 14-19. Let X; X2, ...,X, be a random sample from N(0, 1). 


Define Rolex, and Zoate S x, 

Find the distribution of:: 

@ $(% + ¥._2) (b) kX 2+ (n- bX, 
() XK,2, @) Xi, 


[Dethi Univ. B.A, (Stat. Hons. Spl. Course), 1989] 
Solution. (a) Since X,, X2, ....X, is a random sample from N (0, 1), 


a 1 = , 1 
Z-n(0, i) and Xy-a-N (0, 3 su) 
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Further, since (X,, Xo, ..., Xs) and (X41. Xee2----» X,) are independent, X, and 


xX a+ are independent, Hence, 


ly [te ivf) 1,1 
(Xt Xo = aXe +5Xn-e N(0.3¢+ Wb) 


1 — —_— 
=> Oa +Ke-a)~N (0, rey 
(b) From (*), we get 


Ant __ (0,1) 


Vif(n - k) 


= kXP2~ x20) and (n-b X2,, ~ x74) 


Since X , and X a- are independent, by additive property of chi-square 
distribution, 
k XP + (n -k) Xa ~ Vasey =X0 
(c) ae X, ~ N(O, . ag X,~N 0, 1) are eee 
Pm yay and Xo? ~ 77a), 
are also independent. ea by definition of F-statistic, 
XP 
X2/1 
(d) X,/X2, being the ratio of two independent standard normal variates is a 
standard Cauchy variate. [See Example 8-43]. 


| . EXERCISE 14(e) 


1. (a) Derive the distribution of F = S,2/ S,?, whére S,2 and S,2 are two 
independent unbiased estimates of the common si ae variance Go, defined 
by 


| x, 
~Fay > 5 ea Fur, 1) 


1 
S= 7? E (x1; - 4) 7; a ea : (x2; - X2)? 


(6) Find the limiting —_ when the degrees of ce of the x? in the 
denominator tend to infinity and give an intuitive justification of the result. 


Z(G) TEX (Xoo tie has A we ie sees » Xmen are independent normal 
variates with zero mean and ake deviation o, obtain the distribution of 
mtr 
r X7 = Xx? 
i= 1 ixcm-+i 
Ans. F(m, n). 


(b) If X has an F distribution with n, and n, d.f., find the distribution of 
1/X and give one use of this result. 
(c) If X is t-distributed, show that X? is F-distributed. 
(Dethi Univ. B.Sc. (Maths. Hons.), 1990] 
Hint. See § 14-5-6. 
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3. (a) Derive the distribution of the F-statistic on (n,, m2) degrees of 


Bs | 
freedom and show that the statistic ( + Ps F) has a Beta distribution. 


(b) Show that the probability curve of the distribution of F is positively 
skewed. 
4. Prove the following : 


1 
® Fame Fas, my 


. Nn x Pe nck ‘ 
(ii) Fain, = hs Ee where x has Beta-distribution. 


5. If X and Y are independent chi-square variates with v, and v2 d.f. 


respectively, show that U=X +YandV= is are independently distributed. 


vil 
Find the distribution of V. 
6. Prove that if X has the F-distribution with (m, n) d.f. and Y has the 
F-distribution with (n, m) d.f., then for every a > 0, 


Px <a)+PI¥ < fet 


7, Show that the mode of the F-distribution with v, (2 2), v2 d.f. is given 


v2 (v,; — 2) 
vi(v2 + 2) 


8. X is F-variate with 2 and n (n 2 2) degrees of freedom. Show that 


PF zk)=(1 + ey 


and is-always less than unity. 


(Gujarat Univ. B.Se., 1992] 

Deduce the significance level of F corresponding to the significance level of 
probability P. 

9. Let X,, X2 be independent random variables following the density law 
Fix) = e*, 0 < x < 00, Show that 

Z = X,/X2, has an F-distribution, 

10. (a) If X ~ F (n,, n2), show that its mean is independent: of nj. 

(b) Obtain the mode of F-distribution w‘ih (71, 72) d.f. and show that it lies 
between 0 and 1. 

(c) Show that for F-distribution with n, and n> d.f., the points of inflexion 
exist if n, > 4 and are equidistant from the mode. 

11. X is a binomial variate with parameters n and p and Fy, ,, is an 
F-staustic with v, and v, d.f. Prove that 

POX sk-1) = P| Fe, 2(n-k+1)> a a eri 
[Delhi Univ. B.Sc. (Stat. Hons.), 1985) 
Hint. If X ~ B (n, p), then we have [c.f. Example 7-23] 
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n q 
P(X <k-1) =(n-k+)). m-k (1 —pk-1 at 
k-1 5 
ee (eee : k k~-1 
~ Blk, n—k +1) ; ers (1p at 


-~-k+1 = 
Picts -~k+1) a fa) | PLF x, n-b+ 1] AF 


1-p n-k+1 p 
k °@q 
_ 1 = [k/(n —k + 1)]*. F*-'dF 
~ Bk, n-—k + 1) n+ 
nats E kF 1 
q n-k+ ] 


1 q ; 
“Bk n-k+ 1) | yr(-y)*" dy 
i 0 


kF 1 
where 1 oe i7y: 
12. (a) If X ~ F (1), nz) distribution, show that 
_ ny X ~B m7 = 
~ Ne + ny X ; 


(Delhi a ae (Maths. Hons.), 1992] 
Hence obtain the distribution function of X. 
Hint. The distribution function of X ~ F (n,, nz) is given by 


i = . fy x 
| fUF) dF = | h(u)du, ly = ard 
0 


Norn, xX 
0 2 1 


Gy(x) 


2° 2 


=1,[2, 2], 
2° 2 


where I,(—.Q= 


y ay 21 
---— | u” (1—u)’ du 
B(} 3) 0 


is the incomplete Beta function. Henge the distribution function of F 
distribution can be obtained from the tables of incomplete Beta function. 


(6) X ~ F (m, n), show that _ 
we—inXln (im, in) 


Deduce the variance of X from p.d.f. of W. 
[Delhi Univ. B.A. (Stat..Hons. Spl. Course), 1989] 
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13, Let X, and X, be a random. sample of size 2 form N (0, 1) and Y, and 
Y, be a random sample of size 2 from N (1,'1), and let the Y,’s be independent 
of the X;’s. Find the distribution of the following : 
() (Xi -X2V2 i) (X1 + X2)*K2-X1)? 
Wi) X+Y (iv) (Y, + Y2- 2)*/(X2-X,)? 


v) (X1 + XV KX. — Xi)? + (Yo - ¥1)71/2 
4) (1 — Yo)? + XK, — Xo)? + (X, + X2)71/2 
[Dethi Univ. B.Sc. (Maths. Hons.), 1988, 1987} 
Ans. ( N(O, 1), (ii) F (1, 1), (iii) N (1, 1) 
(iv) F (1, 1), (v) t); (vi) X73). 

14, Let X;~N (i, ),i = 1, 2,3 be independent random variabies. Using 
only the three random variables X,, X2, and X3, give an example of a statistic 
that has : 

(@) Achi-square distribution with 3 d.f. 
(ii) An F-distribution with (1, 2) d.f. 
(iz) A t-distribution with 2 df. 
[Dethi. Univ. B.A. (Stat. Hons. Spl. Course), 1986) 
Ans. Hint. Z; = (X¥;-)/i, i= 1, 2,3, are iid. N (0, 1). 


3 
: 2 Z12 ae Zi 
2 me Yon. ° — ° —————}____——— -, 
(i) : 2 Z; xX (3) » (ii) Z? + Z32 F(1, 2); (iit) ((Z2? + Z42)f2\'2 bey 
15. Let X,, X2, ..., X,, be a random sample from N (1, 67). Define : 


= ed oy 
X= UXis Xnwe yee Xi, XSL EX: 
2 L_« y.\2 ¢2 l Y 2 
EL er er te poke ae ee oe 
Aa 
and §* = l Dy (XK; -Xy. 
n-1y 


Answer the following questions : 
() What is the distribution of 
o 2 ((k-— 1) 8% + (n-—k-1) S*,_,]? 

(ii) What is the distribution of S?,/S?, _,? 

(iii) What is the distribution of (¥ - p) ¥n/S? 
[Delni Univ. B.Sc. (Maths Hons.), 1989] 

(iv) What is the distribution of 5X, + X,_2) ? 

(v) What is the distribution of (X; - 1)?/o? 2 

Ans. (i) Xe-1) + (a-k-1) = Xn -2) > Gi) Ft, a-k-1) 


ee f° no* 
(tdi) tqiiys GV).N (u, ropes | (¥) X71) 
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16. If X ~ F(1, n), show that 


(n - ,) log [1 + (X/n)] ~ x2), 


for large n. 
17. If X;, X2,X3 and Xq are independent observations from N (0, 1) 
population, state giving reasons, the sampling distributions of. 


V2 x3 , 3X2 
(i) Ux 4x,? and G)V= Yay X,2 4 ks? 


Ans. (i) U ~ 2) > (ii) V ~ Fd, 3). 
18. Let (X,, X2) be a random sample from N(0, 1). Answer the following, 
giving reasons ; 
(i) What is the distribution of (X — X,)?/2? 
(ii) What is the distribution of (X, + X)?/(X2— X,)*? 


(iit) What is the distribution of (X2 +X i V (X, —X)2? 
(iv) What is the distribution of 1/Z, if Z = X,2/X,?? 
[Delhi Univ. B.Sc. (Maths. Hons.), 1992) 
Ans, (i) ¥2q); Gi) FCI, 1) ; (iii) Standard Cauchy ; (iv) F(1, 1) 
14-5-4. Applications of F-distribution. F-distribution has the 
following applications in Statistical theory. 
14-5-5. F-test for Equality of Population Variances. Suppose 
we want to test (¢) whether two independent samples x;, (f= 1, 2, ..., m,) and y,, 
(j= 1,2, ..., m2) have been drawn from the normal populations with the same 
variance 67, (say), or (ii) whether the two independent estimates of the 
population variance are homogeneous or not. 
Under the null hypothesis (Ho) that (i) oy? = Gy? = 0?, i.e., the population 
variances are equal or (ii) Two independent estimates of the population variance 
are homogeneous, the statistic F is given by 


Sy? : 
=o (14-18) 
2 ] < i 2a 2 l Ke 7 2 
where Sy = a | 2 ee) and Sy* = ae yi w- yd) ...(14-18a) 
t= “juei 


are unbiased estimates of the common population variance 6? obtained from two 
independent samples and it follows Snedecor's F-distribution with (v,, v2) d.f. 
(where v, =n, - J and v2 =n - 1). 


Sx "1 2 Le) 2 
Proof. F =s2 Lae |/ |; sr 


-|e5 e ta [sr 73 5 | 


(Ox? = Gy = o? under //:) 
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Since mse and oo are independent chi-square variates with (n, — 1) and 

, (nz ~ 1) Gf. respectively, F follows Snedecor’s F-distribution with (n, —1, n2- 1) 

> df. (c.f. § 14-5). 

_* Remarks 1. In (14-18), greater of the two variances Sy? and Sy” is to be 
: taken in the numerator and n, corresponds to the greater: variance. 

By comparing the calculated value of F obtained by using (14-18) for the 
two Piven samples with the tabulated value of F for (n,, nz) d.f. at certain leve| 
of significance (5% or 1%), Ho is either rejected or accepted. 

2. Critical values of F-distribution. The available F-tables (given in the 
Appendix at the end of the book) give the critical values of F for the right-taileg 
test, i:e., the critical region is determined by the right-tail areas. Thus the 
significant value Fg (nm), nz) at level of significance a and (nm, n2) df. is 
determined by 

P[F > Fg (ny, n2)] =, sa(®) 
as shown in the following diagram. 
CRITICAL VALUES OF F-DISTRIBUTION 
P(F) 


Critical value 


Rejection 
Acceptance y region(a) 


region(1-«) Ym yyy 
F.(M4 bd Ng) 
From Remark to Example 14-17, we have the following’ reciprocal relation 
between the upper and Jower o-significant points of F-distribution : 
1 - 
fi: hh) => 
a ( 2) Fi_a (no, n}) 

=> Fe (ny ’ N>) x F, -a (n2, n)= l oss(**) 

The critical values of F for left tail test Ho: 0,7 =, against 
H, : 0,2 aon are given by F< Fai ny s(1+0), and for the 'two tailed test, 
Ho : 6\7= G2” against H, : 6? # 6? are given by F > Fy, -1, n,-1(a/2) and 
F <F, _1.,-:(1,- a/2) [For details, see § 16-7- 5}. 

Example 14:20. Pumpkins were grown under two experimental 
conditions. Two random samples of 11 and 9 pumpkins show the sample 
standard deviations of their weights as 0-8 and 0-5 respectively. Assuming that 
the weight distributions are normal, test the hypothesis that the true variances 


care equal, against the alternative that they are not, at the 10% level. [Assume 
that P (F io, 3 2 3*35) = 0-05 and P (Fs. 10 = 3-()7) = 0-05]. 


Solution. - We want to test Null Hypothesis, Ho : 0x? = cy’. 
against the Alternative Hypothesis, H, : ox” # Oy (Two-railed). 


F 


Exact Sampling Distribution (¢, F and Z Distributions) 14-59 


We are given : 
n,=11,12=9,sy =0-8 and sy = 0-5. 
Under the null hypothesis, Ho: Gy =y, the statistic: 


a 
= 
follows F-distribution with (n, - 1, n2- 1) df. 
Now ny Sy* = (n, - 1) Sy? 
= ew! iin ie 
$2 = fear tL 1) -( 7 }x@ 8)? = 0-704 
Similarly, Sy? = _ i}? = -(3 )> (0-5)? = 0-28125 
0-704 | 
F= 028125 => 


The significant values of F for two tailed test at level of significance 
a = 0-10 are ; 
F > Fi9,3 (G/2) = Fio,s a (*) 
and F < F108 (1 - a/2) = F 10,8 (0-95) - 
We are given the tabulated (significant) values : 


P (Fio.323-:35)=005 = Figg (0-05) =3-35 ual) 
| Se a er 
Also P(Fs, 10 2307)=005 => "Fas s x07] = 0-05 


=> P [F io, 3 3 0-326) =005 = PIF 10,82 0-326] = 0-95 wae **) 

Hence from (*), (**) and (***), the critical values for testing Ho : 
Oy” = Gy’, against H, : dy* # Gy’ at level of significance a = 0-10 are given 
by: 

F’> 3-35 and F < 0-326 =0-33 

Since, the calculated value of F (=2-5) lies between 0-33 and 3-35, it is not 
significant and hence null hypothesis of equality of population variances may be 
accepted at level of significance a = 0-10. 

Example 14-21. Jn one sample of 8 observations, the sum of the Squares 
of deviations of the sample values from the sample mean was 84-4 and in the 
other sample of 10 observations it was 102.6. Test whether this difference is 
= cant at 5 per cent level, given that the 5 per cent point of F for n, =7 and 

= 9 degrees of freedom is 3-29, [Delhi Univ. B.Sc. (Maths Hons.), 1986) 

Solution. Here n, =8,n,.=10 


and T(x-¥P =844, Yy- y= 1026 
$2 =— Sa -3 p= - 12.057 
ai 7 
S yn eS. 11-4 
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Under Ho : 6x” = Gy” = 0”, i.e., the estimates of 6? given by the samples 
are homogeneous, the test statistic is 


Tabulated F'o.95 for (7, 9) d.f. is 3-29. 
Since calculated F < Foo5, Hy may be accepted at 5% level of significance. 


Example 14-22. Two random samples gave the following results : 


Sample Size Sample mean Sum of squares of 
deviations from the mean 
I 10 15 96 
2 12 14 108 


Test whether the samples come from the same normal population at 5% 
level of significance. 


[Given : Fo.o5 (9, 11) = 2-90, Foo5(11,9) = 3-10 (approx.) 
and f9.95(20) = 2-086, t.95(22) = 2-07] 
[Dethi Univ. MCA, 1987] 


Solution. A normal population has two parameters, viz., mean pi and 
varizuce o*. To test if two independent samples have been drawn from the same 
normal population we have to test (Z) the equality of population means, and 
(ii) the equality of population variances. 


Null Hypothesis . The two samples have been drawn from the same normal 
population, i.e., Ho : Wy, = 2 and 6)? = 0,2. 


Equality of means will be tested by applying ¢-test and equality of variances 
will be tested by applying F-test. Since t-test assumes 6,2 = G,*, we shall first 
apply F-test and then ¢-test. 

We are given . n, = 10, n. = 12; %, = 15, X22 a 

(x = X,)? = 90, X(x2 = X>)* = 108 


F-test 
; ee | = 90 
2) 2 cen De Se ees 
Here S; Phage jy 2G x}) 9 10 
1 = 108 
2. ek ee 
Ay PE y Ue X2) 11 9-82 
Since S,? > S,?, under Ho : 6,2 = 6,2, the test statistic is 


S,? 
F =p MeL eee 1) = FQ, 11) 
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Now F =a0 21018 


Tabulated F, 0-05 (9, 11) = 2-90 


Since calculated F is less than tabulated F it is not significant. Hence null 
‘hypothesis of equality of population variances may be accepted. 


Since 6,2 = 6,2, we can now apply ¢ test for testing Hp : L =. Ye. 


t-test. Under H/o’ : 14, = [2,, against alternative hypothesis, H’ : py, # Yo, 
the test statistic is 


X1—-%2 
2 a Ry + Aq -2 = by 
S2 (i+ 4 
ny 17) 
1 = = 
where S? nF np a2 be HY + L(x2 - x2] 
1 
= 50 [90 + 108] = 9-9 
pe 5-14 _ 1 
| 11 
ro. 9-9 x — 
ae @ . A ° 
=——.~ 0.742 


Now fo.95 for 20 df. = 2-086 


Since | ¢ 1 < ts, it is not significant. Hence the hypothesis Ho’ : p24, = 2 
may be accepted. Since both the hypotheses, i.e., Ho’ : 11 = [2 and 
Hy : 6,2 = 0" are accepted, we may regard that the given samples have been 
drawn from the same normal population. 


EXERCISE 14(f) 


1. (@) If x,2 and x2” are indeperident chi-square variates with n, and nz d.f., 
obtain the probability density function of F-statistic defined by 


Fe (417/m1) 
(X27/n2) 
Mention the types of hypotheses which are tested with the help of this statistic. 
(5) Explain why the larger variance is placed in the numerator of the 
Statistic. F. Discuss.the, application of F-test in testing if two variances are 
homogeneous. 
2. An investigator, newly appointed, was madé to take ten independent 
measurements on the maximum internal diameter of a pot at specified equal 
intervals of time and the standard deviation of these ten observations was found 
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to be 0-0345 mm. After he had been some time on similar jobs, he was asked to 
repeat this experiment an equal.number of times and the standard deviation of the 
new set of ten observations was found to be 0-0285 mm. Can it be concluded 
that the investigator has become more consistent (i.e. less variable) with 
ptactice ? ° 
3. (a) Two independent samples of 8 and 7 items respectively had the 
following values of the variables. 
SampleI — : 9 11 13 11 15 9 12 14 
Sample IT: 10 12 10 14 9 8 10 
Do the estimates of population variance differ significantly ? 
(Delhi Univ. B.Sc., 1992) 
(b) Five measurements of the output.of two. units have given the following 
results (in kilograms of material per one hour of operation). 
Unit A: 14-1 10-1 14-7 13-7 14-0 
Unit B : 14-0 14-5 13-7 12-7 14-1 
Assuming that both samples haye been obtained from normal populations, 
test at 10% significance level if the two populations have the same variance, it 
being given that Fo95 (4, 4) = 6-39 
[Calcutta Univ. B.Sc. (Maths. Hons.), 1991) 
(c) In one sample of 10 observations from a normal population, the sum of 
the squares of the deviations of the sample values from the sample mean is 
102-4 and in another sample of 12 observations from another normal 
" population, the sum of the squares of the deviations of the sample values from 
the sample mean is 120-5. Examine whether the two normal populations have 
the same variance. 
4. (a) Two random samples of sizes 8 and 11, drawn from‘ two normal 
populations, are characterised as follows ; 


- Population from which the Size of Sum of Sum of squares of 
sample is drawn sample observations observations 
I 8 9-6 61-52 
I 11 16:5 73-26 


You are to decide if the two populations can be taken to have the same 
variance. What test function would you use ? How is it distributed-and what 
value it has in this sampling experiment ? 

(b) The following are the values in thousands of an inch obtained by two 
engineers in’ 10 successive measurements with the same micrometer. Is one 
engineer steuiicantly more consistent than the other ? 

Engineer A: 503, 505, 497, 505, 495, 502, 499, 493, 510, 501 

Engineer B : 502, 497, 492, 498, 499, 495, 497, 496, 498, 

Ans. Ho : 0,2 = 6,” (both engineers are equally consistent). F = 2-4. Not 
significant. 

(c) The nicotine content (in miligrams) of two samples of tobacco were 
found to be as follows : 

Sample A : 24 27 26 21 25 

Sample B ; 27 30 28 31 22 36 
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Can it be said that the two samples come from the same normal population ? 
Ans. Ho: y= oe :¢= 1-9, Not significant. 
Ho : 6,2 = 6,7 , F= 4-08< 6-26 [Fo.os(5, 4)}. Not significant. 
Hence the two ia Pee an come from the same normal population. 
5. (a) Two random samples drawn from two normal populations are : 
SampleI : 20, 16, 26, 27, 23, 22, 18, 24, 25, 19 
Sample II : 27, 33, 42, 35, 32, 34, 38, 28, 41, 43, 30, 37 


Obtain estimates of the variances of the populations and test whether the 
populations have same variances. 


[Given Fo95 =.3-11 for 11 and 9 degrees of freedom.] 
(b) Test Ho: 0,7 = 6,2 against H,: 6,74 0,7 


given n=25, ¥ (xy -x)* = 164x 44, 


n3 = 721, X(y; -—y )? = 190: 21. 
Make necessary assumptions, Stating them. 
[Caleutia Univ. B.Sc. (Maths. Hons.), 1987) 

(c) The diameters of two random samples, each of size 10, of bullets 
produced by two machines have standard deviations s, = 0-01 and s, = 0-015. 
Assuming that the diameters have independent distributions N(j1,, 0,2) and 
N(12,027), test the hypothesis that, the two machines are equally good by 
testing : 

Ho : GO; = G2 against H, : 0; # Oo. 

6. The following table shows the yield of corn in bushels per plot in 20 
plots, half of which are treated with phosphate as fertiliser. 

Treated a 0 8 3 6 1 0 3 3 1 

Untreated : 1 4 1 #2 3 2 5 0 2 40 

Test whether the treatment by phosphate. has 

(i) changed the variability of the plot yields, 

(ii) improved the average yield of corh. 

7. (a) The following figures give the prices in rupees of @ certain 
commodity in a sample of 15 shops selected at random from a city A and those 
ina sample of 13 shops from another city B. 


City A’: 7-41 7:77 7-44 7-40 4-38 7-93 7-58 
8-28 7-23 7-52° 7-82 7-71 7-84 7-63 7-68 
City B 7-08 7:49 7-42 7-04 . 6-92 17:22 7-68 


7:24 7:74 7:81 7-28 7-43 7-49 


Assuming that the distribution of prices in the two cities is normal, answer 
the following : 


(i) Is it possible that the average price of city B is Rs. 7-20 ? 
(ii) Is the observed variance in the first sample consistent with the 
hypothesis that the standard deviation of prices in city A is Rs. 0-30 ? 


(iii) Is it reasonable to say that the variability of: prices in the two cities is: 
the same ? 
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(iv) Is it reasonable to say that the average prices are the same in two 
Cities ? 


14-5-6. Relation between t and F distributions. In F-distributioy 
with (v,, v2) d.f. [c.f. 14-5 (@)], take vy) = 1, v. =vand ? =F,ie., dF = 2t dr, 
Thus the probability differential of F — to 


Le ay 
dG(t) = foe 12 ae 2t dt, Ost? <0 


i: ° “rem 

-_- = 1+ 
a(t.) v 
ae! a eres Sere 

a a 
Walt.) [ ; 


the factor -2 disappearing since the total probability in the range (— 09, °°) is 
unity. This is the probability function of Student’s ¢-distribution with v d.f. 
Hence we have the following relation between ¢ and F distributions. 


‘If a statistic t follows Student's t distribution with n df., then follows 
Snedecor’ s F-distribution with (1, n) d.f. Symbolically, 


if b~ lr) 
then (2~ Fy, n) 419 


Aliter Proof of (14-19). If & ~ N (0, 1) and X ~ "X% are independent 
r.v.’s then : 


U =~ x24) [Square of a S.N.V,] 
and t oe ~ ly) 
VXin 
2 2 
_ oe @) 


(Xin) —  (X/n)’ 
being the ratio of two independent chi-square variates divided by their respective 
degrees of freedom is F(), n) variate. 
Hence (2 ~ F(1,n) 

With the help of relation (14-19), all the uses of t-distribution can be 
regarded as the applications of F-distribution also, e.g., for test for a single 
mean, instead of computing 


fe xX — pb 
SNn 
we may compute 
F=?2= nl zu 


and then apply F-test with (1, n) d.f. and so on. 

Similarly,we can write the test statistic F fromm § 14-2-9, § 14-2-10 and 
§ 14-2-11 for testing the- significance of an observed sample correlation 
coefficient, regression coefficient and partial correlation coefficient respectively. 
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Example 14-23. Given : P[F(10, 12) > 2-753] = 0-05 
= P[F(U, 12) > 4-747] 
find P[F(12, 10) > (2-753)"'], and P[- V 4-747 <tio< 4.747 ] 
Solution. 


P[F(12, 10) > (2:753)] =P Fa. 7) 


< 2753] 
= P[F(10, 12) < 2-753] 
= 1 ~P[F(10, 12) > 2-753] 
= 1-005 = 0-95 
P[-V4-747 < ty) < V4-747 ] = P(t?19 < 4-747) 
= P[F(1, 12) < 4-747)] 
= 1~—P[F(1, 12) > 4-747] 
= 1-005 = 0-95 
14-5:7. Relution between F and 2. In F (n,, n2) distribution if we 
let nz —> 0°, then x? = nF follows %2-distribution with n, d.f. 
Proof. We have 


_ (lng? FO-) (ny + m)/2) 
P=" Tens) Tee) i im —o 
mM 


In the limit as np — e,we have 
T(r + ma)/2] (nef) 
ny” T(n2/2) ng? ane 


E ae — n*asn- o.(c/. Remark teow] 


( 1/2 
Also lim E + ee = lim ( + ar)" | 
Ro — oo n2 Ro —> co 47 i 
x lim (1 + +F " 
=exp(n,F/2)=exp(y22) "27° \ (mF 4x?) 
Hence in the limit,the p.d.f. of 72 = nF becomes i 


f2 2 Pe 
_(mfy" e=* (42 \um=1 fy? 
POD = Tn) (  ) a( e 


=——! Ry ry) 2 Qe y2 <0 
212 rn 12)“ (x) dy”, | xX 


which is the p.d.f. of chi-square distribution with n, d.f. 
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Remark, lim Tan+h) lim , (for large n) 
n —> oo (n) n— co (n-1)! 


Vim et *-P (ng enter 
Vine (10 


(On using Stirling’s aporczimattcn for n ! as n —> ©0,) 


= lim, 


n+k-5 Pa ttl | +k 5 
lim n 
—g-—k 
Ber he | at 
= 2 
G5 
lim 1 k-1 lim 1 k-1¥-2 
n— 00 n n — co 
—~k _k 


(k-1) y 
= ev * pt [<4 ape 


e~'.1 
lim T(n+k). + 
n— co Tn =n 


14-5-8. F-test for Testing the Significance of an Observed 
Multiple Correlation Coefficient. If R is the observed multiple 
correlation coefficient of a variate with k other variatés in a random sample of 
size n from a (kK + 1) variate population, then Prof. R.A. Fisher proyed that 
under the null hypothesis (Ho) that the multiple correlation coefficient in the 
population is zero, the statistic 


__R*_ n-k-1 
1-R?2° k : 
conforms to F-distribution with (k, n —k — 1) d.f. 

14:5-9. F-test for significance of an observed sample 
correlation Ratio nyy, Under the null hypothesis that population correlation 
ratio is zero, the test statistic is 


Foo, V=4 Bm Fh 1,N-h) (14-21) 


...(14-20) 


where N is the size of the sample (from a bi-variate normal population) arranged 
in h arrays. 

14-5-10, F-test for Testing the Linearity of ‘Regression. For 
a sample of size N arranged in h arrays, from a bi-variate normal population, the 
test statistic for testing the hypothesis of linearity Of regression is . 


— 2 
Fetas 4 =f. F(h-2,N-h) (14-22) 
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14-5-11. F-test for Equality of Several Means. This test is 
carried out by the technique of Analysis of Variance, which plays a very 
important and fundamental role in Design of Experiments in Agricultural 
Statistics. 

14-5. Non-Central F-distribution. The ratio of two independent x? 
variates each divided by the corresponding df. has a non-central F-distribution if 
the numerator has a non-central y?-distribution and the denominator has a central 
?-distribution. Thus, if X has a non-central x?-distribution with n, d.f. and non- 
centrality parameter i, i.e., if X ~ xn, and Y is an independent (central) 
y2-variate witn no d.f. ize., if Y ~ Xn» then the non-central F-statistic is 


defined as : 


F’= Xin, ~ n,X 
= Y/n2 = nyY 
p.d.f. of F’. Since X and Y are independent, their joint p.d.f. is given by - 
ae a ef x (n{/2)+i-1 
p(x, y¥) = pi). poly) = J 7 i! ° 2imy+22 P[(n, + 2i)/2) 
e-yi2 y 22) =i bees) 
x > OS G, y) <0. 
2 T(nf2) ’ 


_ Let us transform to the new set of r.v.’s F’ and U defined by the 
transformation ; 


F’="2= and u=y => vee 
my Mm 
| my ., 
- O(x, ae a ny 


The joint p.d.f. of F’ and U is given by 


| n uF’ | = re i-1 
° exp ca me e =a 
oo ent HE 2n No 


g(F’, 4) = = i! oi ts? P(nof2) Tim + 21/2) 


x EW? y6A2/2)~1 (rn 
no 
(n,/2) $2 
| ML pe +38 : 
1 5 yeh 2 
mimo l EL oie? ray) Pin, + 21/2) 


gt a | 
xX exp ~4#(i4%Rr)) 4 2 * 
2 no 


OS F’<0,0<u<o 
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Integrating it w.r.t. u between the limits 0 to ce and using Gamma Integral, 
we obtain the marginal p.d.f. of F’ as 
ia Fe Y2) +i -1 
ia ol EL 22 Tey) Pliny + 29/21 


ra ae 


x 
E ( "p i (ny + n2)/2 
2 Nn 
i n,[2)+i-1 


0 ; mM . h2 
B (Pei, 2) 


“(ouryrermt OS F’<oo .,.(14-23) 


n i+ (n, + A,)/2 
(i+ me) — 
no 


Remarks. 1. For A = 0, we get - 


7p’ (a,/2) - 1 
2 
g(F’) =“ —1 — acs [0 <F’< 0, 


> nfm m2) (yy perm 
2 2 n2 


since for A = 0, we get the contribution from the sum only when [ = 0 and all 
other terms vanish. Thus for A = 0, g(F’) reduces to the p.d.f. of central F- 
distribution with (n, , n2) d.f. 

2. The hyper-geometric: function of first kind is defined by 


Tia+i)Tb yt 
iF; G, b, y)= x Ta I'(b + i)” i! .-(14-23b) 
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ona nh a,/2 Pe 
n ‘ 

g(F’)= A. A, + 
mm mp) 2 
ae (t+ Re’) 

xF ny + nz ny An, F’ 

1 2 a i 

m(1+ EF’) 
rn 


3 It can be easily proved that the mean of F " ‘ is given by 


E(F’) = | F’e(F’) dF” 


- |e Xr n, +2 
= ere 7 amar y np > 2. ..(14-23¢) 
is 0 t: ny (n, =< 2) 
If A = 0 (in which case we get contribution from the sum only when i = 0), 
weget E(F’) -25 , (14-234) 


which is the mean of central F-distribution with (z,, n2) df. 


14-7. Fisher’s z-distribution. In G.W. Snedecor’s F-distribution 
with (v,, v2) d.f., if we put 


F=exp(2Z) = Z=SlogeF 14-24) 
The distribution of Z becomes- 
dF 
a2) = pir). | | 


_ (v, Iv ye ( e222) - 1] e7 


° /2 
B yy Y2 fi + al 
2° 2) v2 


v,/2 Viz 
Ses) an La ee <z<oo ,,,(14-25) 
(2.8) [teh 
® v2 


which is the probability function of Fisher’s z-distribution with (v,, v2) d.f. The 
lables of significant valucs zo of z which will be exceeded in random sampling 
with probabilities 0-05 and 0-Gi, i.e., P(z > zo) = 0-05 and P(z > z9) = 9-C1 
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corresponding to various d.f. (v,, v2) were published by Fisher (cf. Statistica] 
Methods for Research Workers) in 1925. From these tablés, Snedecor_(1934-38) 
by using (14-24) deduced the tables of significant values of the variance ratio 
which he denoted by F in honour of Prof: R.A. Fisher. 

Remark. With the help of relation (14-24), all the applications of 
F-distribution may be regarded as the applications of z-distribution also. 

14-7-1, Moment Generating Function of z-distribution. 


Mz7(!) =E(et2)= [ e!® 9(z) dz 
7 fF AF) dF [-.- e* = F) 


Since 1,’ (about origin) for F-distribution is I F'f(F) dF, we can find 
0 
m.g.f. of the z-distribution by putting r = ¢/2 in the expression for 1,’ for 
F-distribution. 


Hence Mad) = (A) aT) Bavtion (1415) 


=> Kz(t) =log Mz(t) 
=5 [log v2 — log v,] + log I'{(v, + 2/2) 


+ log P'{(v2 — )/2) — log P'(v,/2) — log P'(v2/2) 
Using Stirling’s approximation for n !, when n is large, viz., 


1 
lim T(n+1)= lim n!=N2me*n"*2 


n- co n—> oo 


=> log T(n +1) =(n +3) log n—n + log Vn, we get 


1 
a ed Pe ee 
Ks =Ho= 212 ~ v.27 7 7 v2 


whence B; and B» can be‘found. 
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Remark. z-distribution tends to normal distribution with mean 


z fe - 1) and variance + ( + s): as v; and v2 become large. 
2 \ Vo Vv} y Z2\ vy V2 

14-8. Fisher’s z-transformation. To test the significance of an 
observed sample correlation coefficient from an uncorrelated bivariate normal 
population, f-test (cf. § 14-2-10) is used. But in random sample of size n; from a 
bivariate normal population in which p #0, Prof. R.A. Fisher proved that the 
distribution of ‘r’ is by no means normal and in the neighbourhood of p = 
its probability curve is extremely skewed even for large n. If p # 0, Fisher 
suggested the following transformation - 


Z== log. |, = tanh * r ...(14-26) 


and proved that even for small samples, the distribution of Z is approximately 
norma! with mean 


& =) log, 7 P= tanh“ p ...[14-26(a)) 
and variance 1/(n — 3) and for large values of n, say > 50, the approximation is 
fairly good. 

z-transformation has the following applications in Statistics. 

(1) To test if an observed value of ‘r’ differs significantly 
from a hypothetical value p of the population correlation 
coefficient. 

Ho: There is no significant difference between r and p. In other words, the 
given sample has been drawn from a bivariate normal population with 


correlation coefficient p. 
If we take 
= jlog, ((1+r)M(1-1)) and &=3 log, (1 + p)(1—p)), 
then under Ho, 
Fe 
z~n(b, ) => 2-5 __ nq 
ae Vi/(n - 3) 


Thus if (Z - €) V (n —3) > 1-96, Ho is rejected at 5% level of significance 
and if it is greater than 2-58, Ho is rejected at 1% level of significance, where Z 
and E are defined in (14-26) and (14-264). 

Remark. Z defined in equation (14-26) should not be confused with the Z 
used in Fisher’s z-distribution (c.f. § 14-7). 

Example 14-24. A correlation coefficient of 0-72 is obtained from a 
sample of 29 pairs of observations. 


(i) Can the sample be regarded as drawn from a bivariate normal population 
in which true correlation coéfficient is 0-8 ? 


(ti) Obtain 95% confidence limits for p in the light of the information 
provided by the sample. 


Solution. (i) Ho : There is no, significant difference between r = 0-72; and 
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p = 0-80, i.e., the sample can be regarded as drawn from: the bivariate normal 
population with p = 0-8. 


Here Z =} log, te + r= 1-1513 logio ; 3 


= 1-1513 logio 6:14 = 0-907 


ll Ap \ oo (1 + 0-8) 
E =5 log, ( = Js 1-1513 logio (1 - 0-8) 
iis“ watanaa ea 


- 29.196 


S.E. (Z) = = 
FE - 3 126 
Under Hp, the test statistic is 


U _2-§ ~ N(0, 1) 


1/V n—3 
_ (0-907 — 1-100) _ 
Now | 0-196 — 0-985 
Since 1U |< as it is not significant at 5% level of significance and Ho 
may be accepted. Hence the sample may be regarded as coming from a bivariate 
normal population with p = 0-8. 
(ti) 95% confidence limits for p on the basis of the information supplied by 
the sample, are given by 


IU < 1-96 
I 
IZ -€1<1-96 x = 1.96 x 0-196 
Vn -3 
=  10-907-&1 < 0.384 
=> 0.907 — 0.384 <& < 0-907 + 0-384 
= 0-523 $<& $1291 
= 0-523 <1 tog, tery 1-291 
‘ 
= 0-523 < 0-1513 log,o tere < 1.291 
0-523 149). 1291, 
= 11513 * 10810 by . 11513 
ss 0.4543 < logio (72 4 < 1-1213 (4) 
Now logio fers = 0.4543 and logyo Gers: 1.1213 
an 7 = Antilog (0:4543) = 2-846 | => Toe = Antilog (1-1213) = 13-22 
_ 2846-1 1846 | sao, _ 13-22-11 _ 12-92 
> P=) 9464172946 799799 |= P=93>59 4517 14.22 — 786 
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Hence, substituting in (*) we get 0-48 < p <0-86 

(2) To test the significance of the difference between two 
independent sample correlation coefficients. Let r, and r. be the 
sample correlation coefficients observed in two independent samples of sizes 7, 
and nz respectively then 


1 l+r 1 l+r 2 
2, = 3 lop. (7; 1) ana Z=5 toge(1= 4s) 
Under the null hypothesis Ho : that sample correlation coefficients dq not 
differ significantly, i.e., the samples are drawn from the same bivariate normal 


population or from different populations with same correlation coefficient p, 
(say), the statistic 


(Z; - Z2) ~ E(Z; - 22) 
Z = S.E.(Z, - Z) ~ N(O, 1) 


Now E(Z, - Z,) = EZ) - EQ) = §1 - “a = 
81 = b= blog, {+2 (under Ho) 
and S.E. (Z;-2Z2) = VV(Z;) + V(Z2) 


[Covariance term vanishes since samples are independent) 


2 co r 1 
_ n; -3 n,- 3 


Under Hp, the test statistic ts 


Z pee 4 a > ~ N(O, 1) 


moat m3} 
‘In, -— 3 na- 3 


By comparing this value with 1-96 or 2-58, Hy may be accepted or rejected 
at 5% and’ 1% level of significance respectively. 

(3) To obtain pooled estimate of 9. Let r;, r2, ..., 7, be observed 
correlation Coefficients in k-independent samples of sizes 7, 72, ..:, mg from a 


bivariate normal population. The problem is to combine these estimates of p to 
get a poled estimate for the parameter. 


If we take Z;= 5 log, (j = =) Le Li 2h secnk 


1 - v; 
then Z;; i= 1,2, ..., are independent normal variates with variances a : 
t=1,2,...,& and common mean 
1 l+p 
5 = los. (; z 3 
The weighted mean (say Z ) of these Z,’s is given by 


k 
= LwZ/ 2w;, 
jot fet 
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where w; is the weight of Z,. 


Now Z is also an uabiaed estimate of &, since 
1 
E(Z)= alt 2 3 wiz il-3 wi [2m E(Z)| = mn [Ew E|= E 
and V(Z ) = Eu V[<Xw;,Z;) = un [Lw? V(Z))] 


The weights w,’s, (i = 1, 2, ..., m) are so chosen that Z has minimum 
variance, 


-In order that V,(Z ) is minimum for variations in w;, we should have 
0 1,5 : 
a OO i=1,2,...,k 
(Xw;)?2w,V(Z,) - LLw72V(Z)] 2(CLw) 
ee a a 


ms Cow 
=> w,V(Z;) = het , aconstant. 
1 : 
Wi YZ) = Mi -3)sE= 1, 25 ice sae(*) 


~ Hence the minimum variance estimate of & is given by 


k 
= Ww; iZi 2 (n; = 3)Z; 


oe [On using (*)] 
rw, 2% (ni - 3) 
and the best estimate of p is then given by”  =—* 


> I 1 + i- 3 Zi 
Z =5 logeg 5 => p= tanh | MI (ef. § 13-9-1) 


Remark. Minimum variance of Z is given by 


' 2 Ps if 
eg ee) yey 1 


(ViZ)inin = (d(n; - 3)}? = {X(n; — 3))}? 


os (n; - 3) 


Statistical Inference-I 
(Theory of Estimation) 


15-1. Introduction. The theory of estimation was founded by Prof. 
R.A. Fisher in a seriés of fundamental papers round about 1930. . 

Parameter Space. Let us consider a random variable X with p.d_f. 
fix, 9). In most common applications, though not always, the functional form 
of the population distribution is assumed to be known except for the value of 
some unknown parameter(s) 8 which may take any value on a set ©. This is 
expressed by writing the p.d.f. in the form f(x, 6), 8 € ©. The set @, which is 
the set of all possible values of 6 is called the parameter space. Such a situation 
gives rise not to one probability distribution but a family of probability 
distributions which we write as (f (x, 6), 0 € @}. For example if X ~ N (1, 0”), 
then the parameter space 

@ = ((U, 6?) : - wo <p < 00 3 0< 6 <0} 
In particular, for o? = 1, the family of probability distributions is givea by 
{N(u, 1); € @), where @ = {pL : — 00 <p < 00} 

In the following discussion we shall consider a general family of 

distributions 
(f(x.; 0,, Oo, ..., ®) :0;€ 8,i=1, 2,...., k}. 

Let us consider a random sample x), X32, ..., X, Of size n from a population, 
with probability function f(x ; 6), G2, ..., 9,4), where 6,, 9, ..., 6, are the 
unknown population parameters. There will then always be:an infinite number 
of functions of sample values, called statistics, which may be proposed as 
estimates of one or more of the parameters. 

Evidently, the best estimate would be one that falls nearest to the true value 
of the parameter to be estimated. In other words, the statistic whose distribution 
concentrates as closely as possible near the true value of the parameter may be 
regarded the best estimate. Hence the basic problem of the estimation in the 
above case, can.be formulated as follows : 


“We wish to determine the functions of the sample observations : 


N A A 
T; = 0, (x1, X2> eoog Xn) 9 T, = 6, (X41, X2, oeeg Xn)> eooeg T; = 0, (x1, X2, ooeg Xn), 


such that their distribution is concentrated as closely as possible near the true 
value of the parameter. 


The estimating functions are then referred to as estimators. 


15-2. Characteristics of Estimators. The following are some of the 
criteria that should be satisfied by a good estimator. 


(i) Consistency 
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(it) Unbiasedness 
(iit) Efficiency and 
(iv) Sufficiency 
We.shall now, briefly, explain these. terms one by one. 


15-3. Consistency. An estimator T, = T(x, x2, .-., Xn), based on a 
random sample of size n, is said to be consistent estimator of y (8), 8 € ©, the 
parameter space, if T, converges to y (9) in probability. 


P 
i.e., if T, —— Y(9) as n - 00 »-(15-1) 
In other words, T, is a consistent estimator of (0) if for every € > 0, 
1 > O, there exists a positive integer n = m (€, N) such that 
P [IT, -y@N<e] > lasn © :3.(15-2) 


= P [IT, -y@)t<e] >1-7;Vn2m ..(15-2a) 
where mm is some very large value of n. 


Remark. If X,, Xo, ..., Xa iS a random sample from a population with 
finite mean EX; =p <oo, then by Khinchine’s weak law of large numbers 
(W.L.L.N), we have 


i n —p 
Kea 2 X; —» E(X) =p, asn— ©, 


Hence sample mean (X ») iS always a consistent estimator of the population 
mean (1). 


15-4, Unbiasedness. Obviously, consistency is a property concerning 
the behaviour of an estimator for indefinitely large values of the sample size n, 
i.é., aS n —-ce, Nothing is regarded of its behaviour for finite n. 


Morcover, if there exists a consistent estimator, say, 7, of y (8), then 
infinitely many such estimators can; be constructed, e.g., 


m= ( 4 \z =| (a fale 


P 
n—b tale. |, 9 7, 9 10), as n — oo 


and hence, for different values of a and b, T,,’ is also.consistent for y(6). 

Unbiascdness is a property associated with finite n. A statistic 

T,, = T(%1, X2, ..-, Xp), 1S Said to be an unbiased estimator of (8) if 

E(T,) = ¥(8), for all 9 E © .. (15-3) 
We have seen (c.f. ‘§ 12-12) that in sampling from a population with mean 
and variance.o?, 
EQ) = and E(s?) #0? but E(S?) = 07. 
Hence there is a reason to prefer 


I e = : : [2 ae 
= (4%; -x )*, to the sample variance s?:= . > -x). 
=! i=l 


“ 
$= 
‘. — 


n-1,; 
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Remark. If E(T,) > 9, T,, is said to be positively biased and if E(T,) < 9, 
i, is said to be negatively biased, the amount of bias b(0) being given by 


b(8) = E(T,) — y(8), 8€ @ ...(15-3a) 
15:4-1. Invariance Property of Consistent Estimators. 


Theorem 15-1. /f T,, is a consistent estimator of y(@) and w(y(@)) isa 
continuous function of y(9), then. y(T,) is a consistent estimator of y (y(8)). 


Proof. Since 7, is a consistent estimator of y(9), 7, _P_, (8) as n — © 
je.. for every € > 0, n > 0, J a positive integer n 2 m (e, N) such that 
P[IT, -y@)l<e] >1-n,Va2m ..A*) 


since w(-) is a continuous function, for every € > 0, however small, 4 a 
positive number e€, such.that-I w (7,) — w(y(8)) |< €,, whenever IT, - (8) |<e 


i.€., IT,-yO)l<e => dw (7T,)- vy (8) !<e a) 
For two events A and B, 
if A => B,thnA ©B = P(A)<P(B) => P(B)2P(A) ...(***) 
From (**) and (***), we get 


P[hy(T,) ~ w(y()! < &,] =P [IT, - (0) |<e] 
2 Ply) -wo@)l<e]>1-n;Vn2m — (Using (*)] 


P 
y (T,) —> ¥ (1(8)), as n > 0 
=> w(T,,) 1S a consistent estimator of y (8). 
15-4:2. Sufficient Conditions for Consistency. 


Theorem 15-2. Let {7,} be a sequence of estimators such that for all 
ge 8, ; 


\ 


(t) Eg(T,) > 7(8), n — 0 
and (it) Varg(T,) > 0, as n > ©, 
Then T,, ts a consistent estimator of y(@). 
Proof. We have to prove that 7, ‘is a consistent estimator of (8) 


L.eé., T,, ats (8), as n — © 


ie., P [IT,-y)l<e] >1-n;Va2me€,n) (15-4) 


where € and 7) are arbitrarily small positive numbers and m is some large value 
of n: 


Applying Chebychev’s inequality to the statistic T,, we get 


P[IT, - Eo (T,) |< §] > 1 - ete (15-5) 


We have 
17,, — y(0) |=17, -— E(7,) + ECT,) — y (8) | 
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$17, —E(T,)!+)E9(T,) —¥ (8) ! .- (15-6) 
Now 
IT, -E9 (7,155 => IT, —y(0)1 55 +1£Q(7,) — (8)! —...(15-7) 
Hence, on using (***) of Theorem 15-1, we get 
P [IT, + (6) ($8 41 Eg (T,) - y(0)!] 2 P [IT,, - Eo (T,) |S 8] 
21- Wate Co) [From (15-5)]} ...(15-8) 
We are given ; 
E, (7,) > (8) V8 E Sasn> -~., - . 
Hence, for every 5, > 0, 3 a positive integer n 2 ng (8,) such that 


| E,(T,) -8) I< 51, Vn2 No (8) »-(15-9) 
Also Var,(7,,) — 0 as n — 09,-(Given). 
Vago) <n »>Vn2ng (N) »-6(15-10) 


where 1 is arbitrarily small positive number. 
Substituting from (15-9) and (15-10) in (15-8), we get 

P (iT, (0) 158 +5,] 21 -qn2m.,.0) 
=> P[IT, ~y(0) Ise] 21-n nem 
where m = max (no, mo’) and € = § + 8, > 0. 


14 ' 

=> T, — ¥(8), aS n — oo [Using (15-4)] 

=> T,, is a‘consistent estimator of y (0). 

Example 15-1. 21,2, ... X, is a random sample from a normal 
population N(, 1). Show that t = - = x/, is an unbiased estimator of p22 + 1. 

i=l 
Solution. (a) We aré given 
E(x) = 2, Va) = 1 Viz=T,2,...,2 
Now E(x?) = Vix) + (E(x))}? = 1+ p? 


B= 5 > ae: EGA=1 E (+p =1+p 
Nijal Niwl Niwot 


Hence ¢ is an unbiased estimator of 1 + 22. . 
Example 15-4, If T is an unbiased estimator for 9, show that T? is 0 
biased estimator for 67. 
Solution. Since T is an unbiased estimator for 0, we have 
E(T) =0 
Also Var (T) = E(T?) - [E(1)}* = E(T?) - 
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=> E(T?) = 62 + Var(T), (Var T > 0). 
Since E(T*) 62, T? is a biased estimator for 6. 
Example 15:3. Show that (oe fos) 
&, for the sample x), X2, ..., X, drawn on X which takes the values I or O with 
respective probabilities 8 and (I — 9). 

Solution. Since ‘x,, x2, ...,% iS a random sample from Bernoulli 
population with parameter 0, 


is an unbiased estimate of 


Ts 2 x; ~ B(n, 6) 


an E(T)=n0 and Var(T) =n 0 (1-8) 
e| pa) |. e{ BED | 
n(n — 1) = n(n — 1) 


a 
=n TEI -ED)] 
=a Tay vat + (2=)?- EM) 


=a Tp lnea- ~ 0) +n262-n0] 


= n 0? (n - 1) = 92 
~ n(n-1) — 
=> [Xx; Cx;- 1)] / [n(a - 1)] is an unbiased estimator of 67. 
Example 15-4.Let X be distributed in the Poisson form with parameter 
9. Show that the only unbiased estimator of exp [- (k +1 Jo], k > 0, is 
1(X) = (-k)* so that 
T(x) > O if x is even 
and T(x) < 0 if x is odd. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1993, 1988] 


x 
Solution. E(T(X)}=E [(-k*],k>0= > (- pe : 
=e" y. tO] ete +k)6 
seql. 2 
=> T(X)= (—k)* is an unbiased estimator for exp [-- (1+ & 6],&>0. 
Example 15-5. (a) Prove that in sampling from a N(p, 0°) population: 
the sample mean is a consistent estimator of ps: 


(b) Prove that for Cauchy's distribution not sample mean bit sample 
median is a consistent estimator of the population mean. 
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Solution. In sampling from a N(u, 6”) population, the sample mean x is 
‘algo.normally distributed as N (1, o7/n). 
=> E(x)=p and V(x)=07/n 
Thus as n > 9, 
E(Z)=p and V(x)=0 


Hence by Theorem 15-2, x is a consistent estimator for {. 
(6) The Cauchy’s sar ae is given by the probability function 
1 _ de 
dF(x) = 2 1 + (x -p)?’ 
The mean of the distribution, if we conventionally agree to assume that it 
exists, iS at x = LL. 


—0 <xX <00 


If x, the sample mean is taken as an estimator of t, then the sampling 
distribution of x is given by 
> 1 
dF(%) =+ , —B— ;_w<x¥eu Oy 
wo1+.(x -) ‘ 
because in Cauchy’s distribution, the distribution of x is same as the 
distribution of x. 


Since in this case, the distribution of x is same as the distribution of any 
single sample observation, it does not increase in accuracy with increasing n. 
Hence we have 


E(x) = but V(x ) = V(x) #0, asn > © 


Hence by Theorem 15-2, x is, not a consistent estimator of [4 1n unis case. 
Consideration of symmetry of (i) is enough to show that the sample median 
Md is an unbia.2d estimate of the population mean, which of course is same as 
the pope median. 
E(Md) =\ -»- (ii) 
For large n, the sampling distribution of median is asymptotically sail 
and is given by 
dF o exp [-2n f,? (x - w]e 
where f, is the médian ordinate of the parent population. 


. = _@an 2 i 
i.e., dF « exp i2n ay .. (iil) 
But fi = Median ordinate of (i) 

= Modal ordinate of (7) [Because of symmetry] 


1 
= 63) pt = rT 
Hence, from (iii), the variance of the sampling distribution of median is : 
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V(Md) = inf? Il an + Oasn—> ...(iv) 


Hence from (ii) and (iv), using Theorem 15-2, we conclude that for 
Cauchy’s distribution, median is a consistent estimator for . 

Example 15-6. If X,, X2, ...,X, are random observations on a Bernoulli 
variate X taking the value 1 with probability p and the value 0 with probability 
(I — p), show that : 

ne (1 - Dei rai is a consistent estimator of p(1 — p). 
[Dethi Univ. B.Sc. (Stat. Hons.), 1988] 

Solution. Since X,, X2, ..., X, are i.i.d Bernoulli variates with parameter 
Ps 


T 5 x;~ B (n, p) 


=> E(T) =np and Var(T) = npq 
lL. -e T 
x a? ea 
E(X) =+£ (1 =+ p =p 
n n° 
Var (X) -var( 7) 5. Var (1) =? Oas n> @. 
Since E(X) > p and Var (X) > 0, as n -. 0; X is a consistent estimator 
of p. 
Also 2s (i - ale X (I -X), being a polynomial in X,is a 


continuous function of X. 
Since X ‘is consistent estimator of p, by the invariance property of 


consistent estimators (Theorem 15-1), X (1 =) 4 ) is 4 Consistent estimator of 
p (1 -p). 

15-5. Efficient ‘Estimators. Efficiency. Even if we confine ourselves 
to unbiased estimates, there will, in general, exist more than one consistent 
estimator of a parameter. For example, in sampling from a normal population 


N (u, 0”), when o? is known, sample inean xX is an unbiased and consistent 
estimator of tt [c.f Example 15-5a]. 

From symmetry it follows immediately that sample median (Md) is an 
unbiased estimate of f1, which is the same as the population median. Also for 
large n, 


V(Md) = (c.f, Example 15-5(8)) 


_1 
An f,2 
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Here f; = Median ordinate of the parent distribution. 
= Modal ordinate.of the parent distribution. 


_|—! ee *7o | a= 
| or | @-wrne") | =e 


ae ae en 
V(Ma) =4,° 200 = 5, 
Since E(Ma) = 
and pr tea oon 


median is also an unbiased‘and consistent estimator of 1. 


Thus, there is a necessity of some further criterion which will enable us to 
choose between the estimators with the common property of consistency. Such 
a criterion which is based on the variances of the sampling distribution of 
estimators is usually known as efficiency. 


If, of the two consistent estimators 7: T, of a certain parameter 6, we have 
V(T\) < V(T2), for all n (15-11) 
then 7, is more efficient than T, for all samples sizes. 
We have seen above : 


For all n, Vz) == 
df v(M 1.57& 
and for large n, (Md == 21. 7 


Since V(X) <V-(Md), we conclude that for nornial distribution, sample 
mean is more efficient estimator for j1 than the sample median, for large samples 
at least. 

15-5-1. Most Efficient Estimator. /f in a class of consistent 
estimators for a parameter, there exists one whose sampling variance ts less than 
that of any such estimator, it is called the most efficient estimator. Whenever 
such an estimator exists, it provides a criterion for measurement of efficiency of 
the other estimators. 

Efficiency (Def.) If T, is the most efficient estimator with variance V, 
and T, is any other estimator with variance V>, then the efficiency E of T> is 
defined as: 

* V. 


ie ook : 
E= V> .» (15-12) 


Obviously, E cannot exceed unity. 
If T, T,, Tz, ..., T, are all estimators of y(@) and Var(T).is -minimum, then 
the efficiency E; of T;, @ = 1, 2, ...,.n) is defined as : 
Var T 
Var T;’ 


E;= sped. 2.. ...(15-12a) 
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Obviously E;< 1,i=1,2,... 2. 


For example, in the normal samples, since sample mean x is the most 
effic’ent estimator of 1 [c.f. Remark to Example 15-31], the efficiency E of Md 
for such samples, (for large n), is : 


_V(x)_ .0jn 2 
am V(Mda) no*/(2n) 0-637 
Example 15:7. A random sample (X,, Xz, X3, X4, Xs) of size 5 is drawn 


from a normal population with unknown ‘mean ys. Consider the following 
estimators to estimate pl. : 


"OX, + Xo +X34+Xq4+KX 
() t) = pee + 4+ As 
X,+X 2X », ¢ AX 
(i) y= b+ Xs, Gi y= a 
where 4 is such that ty is an unbiased estimator of p. 


Find A. Are t, and t, unbiased ? State giving reasons, the estimator which is 
best among t;, tb and t3. 


Solution. \We are given 
EX) =, (X;) = 07, (say) ; Cov (X;, X;) = 0, @ #j = 1,2, ..., ) 
s<(*) 


12 i= 1 
(?) Eq)=c L EX)=5 2 pac. Spay 
Spat Diet 5 


=> ‘, is an unbiased estimator of pL. 
(i) El) =} E(X, + X2) + E(Xs) 


=s(utp)tn [Using (*)] 
= 2 

~> tis not an unbiased estimator of 1. 

(iit) E(t) =p 

= yE(2X, +X2+AX3) =p 

= 2E(X,) + E(X2) + XE(X3) = 3p 

=> Qw+twt Ap = 3p 

=> AU=O0 => A=0 

Using (*), we get, 


Vit) = 3g [VO%1) + VOK2) + VX) + VX4) + VX5)] = 5 07 


V(t.) = 4 [V(X1) + V(X2)] + V(X3) = $02 + 0? = 3? 
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V(ts) = 5 [4V(X) + V(X) & £ (40? + 0%) = F0? (.:% = 0) 


Since V(¢,) is the least, ¢; is the best estimator (in the sense of least 
variance) of pL. 

Example 15-8. X;, X2, and X3 is a random sample of size 3 from a 
population with mean value 1 and variance 07, T;, Tz, T3 are the estimators 
used to estimate mean value tt, where 

T, =X; + X.—X3, Tz = 2X; + 3X3 -4X,, and T3 = (AX, +X2 +.X3)/3 
(i) Are T, and T, unbiased estimators ? 

(ii) Find the value of A such that T; is unbiased estimator'for pL. 

(iii) With this value of A is T; a consistent estimator ? 

(iv) Which is the best estimator ? 

Solution. Since X,, X2,X3 is a random sample from a population with 
mean 1 and variance 02, 

E(X;) =, Var (X;) = o? and Cov (X;, X;) =0, @ #j ="1, 2,...,2) ...(*) 

@) E(T)) = E(X)) + EX) - Es) =W+H-NEy 


=> T, is an unbiased estimator of 
E(T>) = 2E(X;) + 3E(Xs) — 4E(Xq) = 20 + 3-4 =p 
=> T> is an unbiased estimator of pL. 

(ii) We are given: E(T3) =u 


=> —- S [AE(X,) + EX) + E(Xs) = 
=> SAM ++ Ws > AptMpe3p => ASaL. 


(iii) With A= 1, T3= 4 (X1 + Xp 4X3) = X 
Since sample mean is a consistent estimator of population mean py, by 
Weak Law of Large Numbers, 7; is a consistent estimator of . 
(iv) We have [on using (*)] : 
Var(T,) = Var(X,) + Var(X2) + Var(X3) = 302 
Var(T2) = 4 Var(X;) + 9 Var(X3) + 16 Var (X,) = 29 o* 
Var(T3) = 5 [Var (X1) + Var(Xq) + Var (X3)] = +0? (=) 
Since Var(T3) is minimum, T, is the best estimator in the sense of 
minimum variance. 
15-5:2. Minimum Variance Unbiased (M.V.U.) Estimators. 
If a statistic T = T(x;, x2, ....X,), based on sample of size nis such that : 
(i) Tis unbiased for 4), for all@e @ and 


(it) It has the smallest variance among the class of all unbiased:estimators 
of ¥(8), 
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then T is called the minimum variance unbiased estimator (MVUE) of y(@). 
More precisely, T is MVUE of ¥ (6) if 
E,(T) =¥ (0) forallO ec © ..(15+13) 
and Var,(7) < Var,(T’) for all@e © ...(15-14) 
where T’ is any other unbiased estimator of y (8). 
We give below some important Theorems concerning MVU estimators. 


Theorem 15-3. An M.V.U. is unique in the sense that if T, and T2 are 
M.V.U. estimators for y(@), then T; =T>, almost surely. 


Proof. We are given: that 


E 9(T,) = Eg (T2) = y (8), for all 6 € © 
and Varg(T,) * Vare(T>) for all 0 € @ -+(15-15) 
Consider a new estimator 
T =3(T,+T;) 


which is also unbiased since 
E(I) => {E() + E(7)) = 6 


Var (T) = Var [5 (T, +T2)] = + Var (T, +72) [-- Var (CX) = C2 Var (X)] 
= 4 [Var (7,) + Var (Tz) + 2 Cov (T;, T2)] 
= 4 [Var (T;) + Var (T2) + 2p VWar (T,) Var (T2) ] 
=5 Var (T;) (1 +p), ...[From (15-15)] 


where p is Karl Pearson's co-efficient of correlation betweeri T, and T4. 


Since T, is the MUV estimator, 
Var(T) 2 Var (T;) 
=>  5Var(T,)[1+p] >= Var (T,) 
=>, 5 (1+) 21,ie, p21 
Since | p | <1, we must have p = 1, i.c., T, and T, must have a linear 
relation of the form : 
T; =a+BT>, (15-16) 
where a and’B are constants independent of x, x2, ....X, hut may depend on 8, 
i.e., we may have a = (8) and B = B(6). 
Taking expectation of both sides in (15-16) and using (15-15), we get 
, 6=a+ BO ..(15-17) 
Also from (15-16), we get 
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Var (T;) = Var (a + B Tz) = B? Var (7) 
=> 1=—8? => fB=+1 ...[From (15-15)] 
But since pT, T>) = +1, the coetficient of regression of T, and T, must be 
positive. 
x B =1 > a=) [From 15-17)) 
Substituting in (15-16), we get T; = Tz as desired, 
Theorem 15-4. Let T, and T, be unbiased estimators of “y(@) with 


efficienciés e, and é2 respectively and p = pe be the correlation coefficient 
between them. Then 


Vee, - Vl = €1) (1-2) Sp $ Vey en + V(I — 1) (1 - 2) 


Proof. Let T be the minimum variance unbiased estimator of (0). Then 
we are given : 


ET;) = y(0)=E,(T2), V8 E€ 9 . (15-18) 
VAT) Ve V 
: V =— (15-19 
and 4 = VT.) = : (say) = Vi= a ; ( ) 
_ Volt) _ Vv 
€, = Vo(Ts) “¥, , (say) => V2= } . (15-20) 
Let us consider another estimator ; 
T; = AT + WT, »-(15-21) 
which is also unbiased estimator of (6), 
L.e., E(T3) = (A +) (0) = (0) (Using (15-18)) 
= A+p =1 » (15-22) 


Vo(T3) =V AT, + T2) 
= 2 V(T,) + 2? V(T2) + 2A Cov (71, T2) 


V x ms oe (Using (15-19) and (15-20] 
= —-+ + SING (15-17) an : 
E N rnd 

But Vo (T3) 2V, since V is the minimum variance. 


Sd wv 2pAu 
— fe >1=(A+ nu) [Using (15-22)] 


1 1 a p 
=> —— 1/X274)/—- 1 |p24+ 20 -1)20 
e - Ja : e €1€2. } 


= (a NG ye el Gece 1] >0 ...(15-23) 


which is. quadratic expression in (A/j1). 
Note that : 


e<1l > 1 => & 1)>Qi=1,2 
é; e; 


8 8 
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We know that 
AX? +BX +C 20 Vx,A>0,C > 0; 
if and only if 
Discriminant = B? - 4AC < 0 .. (15-24) 
Using (15-24), we get from (15-23) : 


Gis JB Gee 


=> (p -Vee2 )?-(1-e1) (1-e2) < 0 
=> 2-2 Ve; e2 P+ (e;t+e2-1)< 0 


This implies that p lies between the roots of the equation 
92-2 Ve, e> p +(e, te,-1) =0 
which are given by 


1[2 Vee, + 2 V e1e.-(e + é€,-1) 
= Ve, e, +V(e - 1) (e2- 1) 


Hence we have : 
=> Vere, -V(1-e:) (1 — 2) Sp $ Ve, ey + V(1 - e;) (1 - €2) 
| ..(15-25) 
Corollary. If we take e, = 1 and e2 =e in (15-25), we get 
Ve<p cVe => p=ve 


This leads to the following important.result, which we state in the form of 
a theorem. 


Theorem 15:5. [f T,; is an MVU estimator of y(@), 8 € © and T> is any 
other unbiased estimator of y (8) with efficiency e = €9, then the correlation 
coefficient between T, and T> is given by 


p=Ve ie, po=Veg,VOE@. 


For an alternate proof, see Examples 15-9 and 15-10. 


Theorem 15:6. If T; is an MVUE of y(@) and T> is any other unbiased 
estimator of y(8) with efficiency e < 1, then no unbiased linear combination of 
T, and T, can be an MVUE of y(9). 


Proof. A linear combination. 
T = LT, + loT 2 .» (15-27) 
will be unbiased estimator of (6) if 
EQ) =1,E(7,) + LE(T2) = (6), forall@e © 
=> ; +l, =1 .- (15:27) 
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since we are given E(T,). = E(T2) = y(@). 


We have 

_ Var (T)) _ Var T, 

= Var (T>) Var T, = ; .. (15-28) 
and 9 =p(7), T2)= Ve {c.f. Theorem 15-5] 


From (15-27), on using (15-28), we get 
Var T = 1,2 Var (T;) + lo? Var (T>) + 21, Cov (T;, T>) 


= [,? Var (T;) + lo? Var (T>) + 2b p VVar (T;) Var (T2) 
2 
= Var (T;) 1 + 24 2b] 


Ve 


2 
= Var (7)) 2 + Zyl, + Ld (..- p Ve) 
2 Var T; (1,2 + 21, l5 + [7] (..-0 <esl> m2 1 
= Var T, (F + 1,)? 
= Var(T}) [From (15-27a)] 
=> T cannot be an MVU estimator. 


Example 15-9. If T, and T, be two unbiased estimators of y(@) with 
variances O,*,0 2 and correlation p, what is the best unbiased linear 
combination of T, and T,.and what is the variance of such a combination ? 

[Delhi Univ.B.Sc. (Stat. Hons.), 1990] 


Solution. I.et 7, and 7, be two unbiased estimators of (9). 


E(T,) = E(T2) = (8) (1) 
Let T be a linear combination of 7, ‘and T, given by 
T=hT, + LT? (*) 
where /,, l, are arbitrary constants. 
E(T) = lE(T,) + 1 E(T2) = (lr-+ 12) Y) [From (1)] 
.. T is also an unbiased estimator of (8).if and caly if 
Fi + lo =] (2) 


Now VT) = VAT; + bT,) 

= 1,2V(7;) + 122V(T>).+ 21,12 Cov-(T,; T2) 

= 1;26,2 + 1,2 do? + 2h;12p G02 .- (3) 

We want the minimum value of (3) for variations in /, and 1, subject to the 
condition (2). 


= V(T) =0 =1,0,2 + 2p 0,02 
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r 
oo = 1, 62? + |, 9.5402 


Substracting, we get 
1(G,? ~ 0162) = 1, (Oy? - 00,02) 
shad l sted lo * I+ ls 
O2?-90,02 6);7-p0102 ~ 6,7+ 62*- 296,02 
1, ; 
6,2 + 0,7 ~ 206,60, [From (2)] 
ye eee and p= pS PS gy 


6,2 + 0% — 290,02 6)? + 62? — 290102 

With these values of |; and 2, T given by (*) is the best unuiased linear 
combination of T, and T2 and its variance is given by (3). 

Example 15-10. Suppose T; in the above example is an unbiased 
minimum variance estimate and T> is any other unbiased estimate with variance 
ole. Then prove that the correlation between T, and T> is Ve ‘ 

Solution. The coefficients of the best linear unbiased combination of 7, 
and Tz, given by (*) in Example 15-9 are given. by (4). 

We are given that 6,2 = V(T;) = 02 


o2 
and FNS = V(T2)=6,7=07/e 


V(T2) V(T>) 
Substituting in (4) of Example 15-9, we get 
= 1-pvVe 
’ , Ve 
, whereeD=1+e-2p Ve _...(5) 
e-p Ve 
ly = D : 


Hence from (*), the unbiased statistic is 
a [a= Ve) 7 + (e- pve) 7 
and from (3) the minimum variance is : 


VT) = AL a- pVeyra? +(e~pVer= +2(1-pVe) (e-p Ve.p.0.0Ne | 


= Fl (1+ p%- apVe)+4 (e2 + pe—2peVe) +2 (1- pie) (Ve-poe | 


D? 
S| 1 +p%- 2p Ve +e+p- 20 Ve + 2 (pVe-p%e- p+ 0 | 
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-$|1- pe +e-p?- 2p Ve + 2p2Ve | 
=$| (1 +e -2pVe)- pre + 1-29 Ve) | 


_ G1 = p?)(1 + e ~ 2p Ve) _ 671 - p?) 
(1+e-2p ey 1+e-2p Ve 
o? (1 - p?) , 
"(1 =p?) + (Ve ~p) 
a i 1 oe 
o “(1 ~p2) + (Ve-p/ . 


Since T, is the most efficient estomator, 


WT) te = ae AD) 
From (6) and (7), we get 
V(T) =1. ie. eee rae 
ee 92) 4 Ve = py 
= (Ve-p=0 = p=ve 


Aliter. From (5) onwards. Since T, is given to be the most efficient 
estimator, it cannot be improved upon (cf. Theorem 15-6). Hence, in order that 
T defined in (*) is minimum variance unbiased estimator we must have 


Ah >| => p=Ve ..-[From (5)] 
Remark. This problem leads to the following very important result : 
“The correlation coefficient between a most efficient estimator and any other 

estimator with efficiency e is Ve.” 


Example 15-11. (a) Show that if a most efficient estimator A and a less 
efficient estimator B with efficiency e tend to joint normality for large samples, 
B -A tends to zero correlation with A. 

[Dethi Univ. B.Sc. (Stat. Hons.), 1988] 

(6) Show that the error in B may be regarded as composed (for large 
samples) of two parts which are independent, the error in A and the error in 
(B — A). 

(c) Show further that 


V(A- By = (2- 1) VA) 


Solution. (a) We have to prove that 
rfA.(B -Ayl=0 = Cov (A, B-—A)=0 
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Cov [A, (B — A)] = Cov (A, B) - V(A) = po,dx - 42, 
where p is the correlation coefficient between A and B. 


If we take O, = O, then Op = ir and p= Ve (cf. Theorem 15-5) 
e 


eo. 
Ve 
Hence (B — A) has zero correlation with A. 
(b) We have B = A + (B — A) 
V(B) =V[A + (B -A)] = V(A) + V(B -—A) + 2 Cov (A, B - A) 
= V(A) + V(B - A) [Using part (a)] 
=> Error in B = Error in A + Error in (B — A) 
and since A and (B — A) are independent, [c.f. part (a) viz., r(A, B — A) = 0 and 
A and B tend to joint normality], the result, follows. 
() V(A-B) =V(A) + V(B)-2 Cov (A, B) 
= O42 + Gp? -2 P.d, Op 
=o+2_2 Veo. 


Ve 


2 
teases |e? 
€ € 


Example 15-12. /f T, and Tz are two unbiased estimators of y(@), 
having the same variance and p is the correlation between them, then show that 
p22e —1, where e is the efficiency of each estimator. 

Solution. Let T be MVUE of (6). Then, since V(T,) = V(T2), the 
efficiency e of each estimator is given by : 

jew VO). A) 
VT) V(T2) 
Consider another unbiased estimator of (6) viz., 


1 
Tz =35 (7, + T2) 


Cov (4, B- A) =Ve.0.-4 - o2=0 


=> WT) =4(V(T}) + VT) + 2 Cov (T;, T)] 
=1/¥O, YO, 95 ./ YO VO 
é é 4 


=4|— 
rat +14 2p] ot OU@ 


Since V(T) is the minimum variance, 


(i+p).V(T) 
V(T;) = 7 YO) 57} 
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=> 1+p 2 2e => p 2(2e - 1). 

Aliter. Deduction From (15-25). If T, and T> have same 
variances/efficiencies i.e., €, = e2 = e, (say) then (15-25) gives 

e-(l-e)spse+(l-e) => p 22e-1. 

15-6. Sufficiency. An estimator is said to be sufficient for a parameter, 
if: it contains all the information in the sample regarding the parameter. More 
precisely, if T = ¢(x,, X5, ..., X,) is an estimator of a parameter 0, based on a 
sample x, X2, ...,X, Of size n from the population with density f(x, 8) such that 
the conditional distribution of x, x2, ..., x, given T, is independent of 8, then T 
is sufficient estimator for 9. 

Illustration: Let XX, ... X, be a random sample om a Bernoulli 
population with parameter ‘p’,0 < p< 1, i.e., 

_ 1, with probability p 
= : with probability g = (1 - p) 
Then T =1 (4, Xo, ..-, Xq) =X +XQ+... +X, ~ Bn, p) 


pore k)=( 7, \ph-p ye 


The conditional distribution of (x;, X2, ..., X,) given T is 
P[x1Nx2N...0X,0T = k] 


Plxynxn...ax, |T =k] = P(T =k) 
_ pk(i-pyrt 
n k _ pn\a-k ‘ 
G i _— ¢ 
0, if , Xx; # k 


iw) 
Since this does not depend on ‘p’, T = S x;, ts sufficient for ‘p’. 
i=} 


Theorem 15-7. Factorization Theorem (Neyman). The necessary 
and sufficient condition for a distribution to admit sufficient statistic is provided 
by the “factorization theorem’ due to Neyman. 

Statement T ='t(x) is sufficient for 9 if and only if the joint density 
function L (say), of the. sample values can be expressed in the form 

L = go ([t(x)].h(x) .. (15-29) 
where (as indicated) go [t(x)] depends on 6 and x only through the value of t(x) 
and h(x) is independent of @. 

Remarks 1. It should be clearly understood that by ‘a function 
independent of 8’ we not only mean that ‘it does not involve 9 but also that its 
domain does not contain : For a the function 


fo=z, a-O0<x<a+0;-~2<0<c 


depends on 6, 
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2. It should be noted that the original sample X = (X,, Xo, ..., X,), is 
always a sufficient statistic. 
3. The most general form of the distributions admitting sifficient statistic 
is Koopman’ s form and is given by 
L=L (x, 8) = g(x).h(8). exp {a(0)y(x)} -- (15-30) 
where h(@) and a(Q) are functions of the parameter 9 only and g(x) and w(x) are 
the functions of the sample observations only. 


Equation (15-30) represents the famous exponential family of distributions, 
of which most of the common distributions like the binomial, the Poisson and 
the normal with unknown mean and variance, are the members. 


4. Invariance Property of Sufficient Estimator. 

If T is a sufficient estimator for the parameter 6 and if w(T) is a one to one 
function of T, then w (T) is sufficient for y(8). 

5. Fisher-Neyman Criterion. A statistic t; = t; (X%1,X2, ..., Xn) tS 
sufficient estimator of parameter 0 if and only if the likelihood function (joint 
pdf. of the sample) can be expressed as : 


L = TT Ax 8) 


= 21 (t;, 9). k (x45 x2. uvog Xa) (15-31) 
where g, (t;,8) is the p.df. of statistic t, and k (x), x2, ..., X_ ) is a function of: 
sample observations only independent of 0. 

Note that this method requires the working out of the p.d.f. (p.m.f.) of the 
statistic ¢; = U(X1,Xg, --+r Xn) which is not always easy. 
Example 15-13. Let x, x2, ...,X, be a random sample from a uniform 
population on [0, ®]. Find a sufficient estimator for 6. 
[Madras Univ. B.Sc., Oct. 1992] 


Solution. We are given 


5, 0S x; <0 
0, otherwise 
l,ifa<b 
0,ifa>b 
K(O, xj) k(xi, 9) 
7) ; 
Mf E 2) BGs 2} 


fox) = 


Let k(a, b) -| 
Then fax) = 
L = Tl feiss 

k(O, min x) & (max x;, 6) 


Lsisn ° lsisn i 
= on 8 [#(x)] A(x) 
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where, go [1(x)] = ALi, S} ,x) = max x; andh(x)=k0, min x) 


an 
1sisa lsisa 


Hence by Factorization Theorem, T= max 4;, is sufficient statistic for 0, 


lsisa 
Aliter. We have L = i f(x;, 9) + an ;0<x; <8 a (3) 
If t = max - X2, «-++Xq) = Xiqy, then p.d.f. of T is given by : 
g(t, 8) =n [F (Xm) }**! -S(X(ny) (ii) 
Wehave F(x) = P(X <x)= f; fx. 0) dx = in 5 dr nd 
) 0 0 
: _ 
g(t,0) =n | & ) [From (ii)} 
n a-1 
= on [x] 
Rewriting (i), we get 
a-l 
, tel  __1 
6" n [xtny)” - 
= 9(t, 8). h (x1, Xo, ..- Xp) ‘ 


Hence by Fisher-Neyman criterion, the statistic ¢ = xj) , 1S sufficient 
estimator for 8. 


Example 15-14, Let x;, x2, ...,X,. be a random sample from N(p, 0°) 
population. Find sufficient estimators for pLand o°. 


Solution. Let us write 
@ =(1, 07); -—wo<p <0, 0< 02 <00 


Then 
L= Ti fla) = =) exp - =a 2, (x; - H| 
(chs) ode (3,27 202+ me) 
1 
= ge [e(x)]. h(x) 
where 


26 [0(x)) ge) exp 5 ach (208) ~ 28 (x) + np? | 


(x) = (t:(x), 2(%)) = (Lx; Lx?) and A(x) =1 
Thus ¢(x) = %y; is sufficient for 1: and t(x) = Ex7, is sufficient for 0”. 


Statistical Inference (Theory of Estimation) 15.21 


Example 15:15, Let X,,X>2,.... X, be a random sample from a 
distribution with p.df. 
Ii, 0) =e 8) Bex<w;-w <8 <0 
Obtain sufficient statistic for 8. 
Solution. Here 


L = TI fe 0)= TT [e~ SF - 7 


= exp | xr x | x exp (n@) .-(*) 
a 
Let Y;, Yo, ..., Y, denote the order statistics of the random sample such 
that Y; < Yo <... < Y,. The p.d.f. of the smallest observation Y, is given by 
. 81 (y, , 8) =nl- FO)" f On, 8) 
where F(-) is the distribution function corresponding to p.d.f. f(-). 
e7 (x,- 8)- 
-1 


=z 
s 


- =] —~e -@-9) 
9° 


z 
Now FO) = [e- dew 
0 


“800 anfee-O]E 1-9 


=ne""-9 Q<y, <0 
= 0, otherwise 
Thus the likelihood function of X,, X2, ..., X, may be expressed as 


n 
L =e" exp (- 2 +) 
i= 


exp es x;) 
=nexp {-n(y, - 8)} prerny 


‘exp (— 2 xj) 
= g, (min x; 6)| ————-+——_ 
81 MIN Xs 97 exp (— n inin x,) 
Hence by Fisher-Neyman criterion , the first order statistic 
Y, = min (X,, Xp, ..., X,) is a Sufficient statistic for 0. 


Example 15-16. Let X,, Xz, ....X, be a random sample from a 
population with p.df. 


fix, ®) =0x°"' -0<x<1, O>0. 
Show that t; = I] X;, is sufficient for 6. 
ial 
[Dethi Univ. B.Sc. (Stat. Hons.), 1988; Agra Univ. B.Sc., 1992] 
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Solution. L (x, @) = Tr Kx, ) = en T (x -') 


° ] 
-o(0 oe 
i=l (11 «:) 


= ath, 8). h (x1, X?, ooes Xn)» (say). 
Hence by Factorisation Theorem, 


a 
t = au X;, is sufficient estimator for 9. 
gm 


Example 15-17. Let X;, X2, .... X, be a random sample from Cauchy 


population : 
“fe @) = = aT) tek @ < co, 


Examine if there exists a sufficient statistic for 0. 


1 


oe eee aren 
Solution. L (x, 6) = TT fe. 0) ==. J, 1+ (x,— 6) 


# g(t, 0) . A(x}, Xo, 06) Xn) 


Hence by Factorisation Theorem, there is no single statistic, which alone, 
is sufficient estumator of 0. 


However, 
L (x, 6) = ky (X1,X2, eoey Xa 6). ko (X1, X, eoey Xn) 
=> The whole set (X;, X2, ..., X,) is jointly sufficient for 6. 
15:7. Cramer-Rao Inequality 


Theorem 15-8. If ¢ is an unbiased estimator for y(8), a function of 
parameter 0, then 


OP re 
we! Yep 


3 2 I) 
E E log L| 


where I(6) is the information on 8, supplied by the sample. 


In other words, Cramer-Rao inequality provides a lower bound [y’ (6)]2/(8), 
to the variance of an-unbiased estimator of (8). 

Proof. In proving this result, we assume that there is only a single 
parameter 6 which is unknown. We also take the case of continuous r.v. The 
case of descrete random variables can be dealt with similarly on réplacing the 
multiple integrals by appropriate multiple sums. 


Var (t) (15:32) 
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We further make the following assumptions, which are known as the 
Regularity conditions for Cramer-Rao Inequality. 

(1) The parameter space © is a non-degenerate open interval on the real line 
R : (- o°, 00), 

(2) For almost all x = (x,, x2, ..., X,), and for all69 e © 


the exceptional set, if any, is independent of 9. 

(3) The range of integration is independent of the parameter 9, so that f(x, 9) 
is differéntiablé under integral sign. 

If range is not independent of 6 and f is zero at the extremes of the range, 
i.e., f(a, 9) =0=f(, 8), then 


b 
2 { fax = I. F ax f(a, 0) + f(b, 0) 2 


: g L(x, 8) exists, 


=> 2 ; fdx= : 2 a, since f (a, 8) = 0 =f (6, 6) 


(4) The conditions of uniform convergence of integrals are satisfied so that 
differentiation under the integral sign is valid. 


2 
(5) (8) = e| {2 log L(x, oo} , exists and is positive for all Oe 9. 


Let X be ar.v. following the p.d.f. f(x, 6) and let L be the likelihood 
function of the random sample (x), x», ..., X,) from this population. Then 


L=L(x,9)= ats f(%in 9) 
Since L is the joint p.d.f. of (21, X2, +..3X_)> 
J Lex, 0) dx = 1, 


where Jdx= ff... f dey dey... de 


Differentiating w.r. to 6 and using regularity conditions given above, we 
get: 


fSca- 0 = (Sloe \eax = 0 
=> E Diop h =0 (15-33) 
. 00 ooe 
Let t = ¢ (x1, X2, ..., X,) be an unbiased estimator of y(@) such that 
E() =y(0) = Je.Ldx= 6) ,.(15-34y 


—, w.T. to 8, we get 


fr.% So dx =(0) = (% log L x= (8) 
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= E( ; SZ tog L}=7'@) ..€15-35) 
Cov | t 216 L el oa lo an E(t).E 2 16 L 
‘90 08 - 99 78 ~ (99 °8 
= (8) [From (15-33) and (15-35)] 
We have : 


[> Y] <1 = [cov(x%,y] <Var(X). Var (Y) 


| cov () $ tog L}F < Vart. Var (Ste L | 
=> [y’(e)}? < var «| E {2108 LP {e is log Ly 
=> [y‘@] s var 1.£| (2108 L}| (Using (15-33). ...(15-36) 


2 
=> Var(t) 2 Wor ..-(15-36a) 
e|(Zue x) 
which is Cramer-Rao Inequality. 


Corollary. If ¢ is an unbiased estimator of saremciee 0 i.e., 
E®”®=6 => 60)=9 => y(O6)=1, 
then from (15-36a), we get 


Var(t) > Te - 7..(15:37) 
(2 log J] | 
where 
a IQ) =E ((2 log }| (15374) 


is called by ‘R.A. Fisher as the amount of information on 0 supplied’ by the 
sample. (x;, X2, ..., X,) and its reciprocal 1//(6), as the information limit to the 
variance of estimator ¢ = t(x;, X2, ..., Xp). 

Remarks, 1. An unbiased estimator ¢ of (9) for which Crames-Rao 


lower bound in (15-32) is attained is called a minimum variance bound (MVB) 
estimator. 


2. We have : 
I) = e|(2 log L i =-E fe log L| .--(15-38) 
avd 1(0) =n| & -_ 
a (8) =n 6 log f(x, 9) a 562 log f .» 15-38) 


Proof. We have proved in (15-33), 
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ce Sp log L ) =0 (*) 
E\39 g a 
Also 


Boers) dle) «]- Gee) 
(Seslog Ls ~2) 76 1O8 L).L 39 108 L 70 
— LO] oe oO 
-2/(3 oe 1) L| (2 log Ly 
Integrating both sides w.r. to X = (x1, X%, ..., X,), We get 


ig 0. r) ’) 
e(Zplos 1) = = 76 - E (St0gt )-£ (Soe 2} 


=—E (S log Ly [Using (*)] 


= 10) =E (Slog L} e+e (= log L), 


a form which is more convenient to use in practice. 


Ly n 2 
Also 18) = e|(2 log L}| =E } z Z log f(x;, 0) | 
| * fa 2 
=E 2 FE log f(x; oy} 


. 0 
+ tere €2 log fi, 2) (s log f(x;, | 


=n.E E log f(x, »] ; [On using (*)] 


since x;’s;i= 1, 2,..., m are i.i.d. r.v.’s. 
15-7-1. Conditions for the Equality Sign in Cramer-Rao (C.R.) 


Inequality. 
In proving (15-32) we used [c,f. (15-36) that 
ty’(6)]? SE [¢-(0)}? .E & log a .. (15-39) 


The sign of equality will hold in C.R. Inequality if and only if the sign of 
equality holds in (15-39). The sign of equality will-hold in (15-39) by Cauchy 


Schwartz Inequality, if and only if the variables [¢ - y (6)] and & log L Jar 


proportional to each other, L.é., 
t- (8) = x = X(0) 
36 log L 
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where X is a constant eae Of (x), Xo, ..., X,) but may depend on 6. 


g log L Ae [+~y(0)] A@) (15-40) 
where A=A(®) = 1/[A(0)] , say. 


Hence a necessary and suffi cient condition for an unbiased estimator t to 
attain the lower bound of its variance is given by (15-40). 


Further, the C-R minimum variance bound is given by : 
Var () = [y'(6)}2 JE (2 too 7.) ? 
a= Or Je E log L 1 (15-41) 
2 
But E & log Lf = E|A(6) ; {1 - (8) } ] [From (15-40)] 


2 2 
= [A)] .E[r-6)] 
| =[A@] . Var 
Substituting in (15-41), we get 


2 
Var() <—V@] _. 
[A(@)] . Var (2) 
=> Var (t) = 7 = | y’(0).A(6) | ...(15-42) 


Hence if the likelihood function L is expressible in the form (15-40) viz., 
o SO) s. 
BY.) log L= VG) — [¢ af 7(6)] . A(8), 


then 
(i) tis an unbiased estimator of y(8), 
(ii) Minimum eso Bound (MVB) estimator (t) for (0) exists, and 


(iti) Var(t) = | ro A(6) - D | bya wel 


The importance of this result lies in that fact that C.R. inequality, in 
addition to find if MVBU estimator for y(8) exists, also gives us the variance of 
such an estimator, which is given by (15-42): 

Remarks 1. If y(6) = 0, i.e., if ¢ is an unbiased estimator of 6, then 
(15-40) can be written as : 


ce _ 1-8 
50 log L = rT »-(15-43) 
Hence if (15-43) holds, then ¢ is an MVB estimator for’ with 
Var (t) =124.(0)!=1/1A(6) Tt ...(15-43a) 
2. We have aici in (15-40) that an MVB estimator ial for 0) if 
4-18) 
Slog L = = [1-0]. 7 A(t) 
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where A =A(6), say. If we write 
Daa 
J x dQ = a(6) , 
then integrating (*) w.r. to 8 (by parts), we get 
log L = [¢-(0)] a (0) + f a(0). y’(0) dO +k (x) 

=> logLl =[t--y(6)] a (0) + B(O) +k (x) ...(15-44) 
where 0(6) and B(6) are arbitrary functions of 6 and k(x) = k(x, x2, ..., Xp), iS 
an arbitrary function of x,’$ iridependent of 6. 

Hence log f(x, 6) = [¢—(6)] A,(6) + B,(6) + k(x) 

=> ix, 9) = g(x). h(0) . exp [a (6). w (x)] ...(15-44a) 
which is the necessary and sufficient condition for the existence of a sufficient 
statistic [cf Koopman’s form, Equation (15:30) in Remark 3 to § 15-6)]. 
Hence an MVB estimator for y(8) exists if and only if there exists a sufficient 
estimator for y(@). 

This suggests that in our search for an MVB estimator for y (8),we need to 
confine ourselves to sufficient estimators of y (8) alone. 
This explains why the method failed in the case of Cauchy population [c//f. 
Example 15-19], where no sufficient estimator exists and its success in the case 


of normal population [cf Example ae where x is sufficient for and 
n 


Example 15-20, Y x;/n is sufficient for 07). 
i= 


Example 15-18. Obtain the MVB estimator for pt in the normal 
population N(, 07), where o? is known. 

Solution. If x;, x2, ...,X, iS a random sample of size n from the normal 
population, then 


_7 : oe iy _ 2/9 e2 
Lo Th fen) = (—=)-2| 2 Gi w) por} 


log L =-—nlog (V2n 9)-555 2 (x; -— p)? 


= Ai . 2 
=k-33,%, (x; -— LL) » 
where k is a onstant independent of 1, (o being known). 


. log L = “368 a (2c; - u)(-1)] 


which is of the form (15-40). 
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A 2 
Hence X is an MVB unbiased estimator for 2 and V (1) = V(x) = = : 


Example 15:19. Find if MVB estimator exists for @ in the Cauchy's 


population : 


1 
FD = 5 TEGO 


—eo<nX<oo | 


Solution. Here 


a 1 7 1 
= Haw oe(& J eacee 


log L =-nlogr- 2 log [ 1 + (x;~-9)7] 


(x; 


sp log L =2 x ligwae? er] 


7 30 


Since this cannot be expressed in form (15-40), MVB estimator does not 
exist for 6, in the Cauchy’s population and so ‘Cramer Rao lower bound is not 
attainable by the variance of any unbiased estimator 9. 


Example 15-20. A random sample x,, Xo, ....X, iS taken from a normal 
AR 
population with mean zero and variance 0”. Examine if ¥ x;/n is an MVB 
i=l 


estimator for 0°. 
Solution. Since X ~ N(0, 07), 


1 x? 
F o2 = 7 (- a) »~- eo <xX <0 
ee oV2n "PT 20? ‘ 


= Th ft, 0 =(—EY exp! - 5 G20? 
L TI fe 07) es exp PF «preo?| 


wi SS 2 2_ 1. 2 
=> logl = 9 log (2) 7 logs tial 


ini 


0 
Ao? 108 L =-55+ ae E x/= 264 


A 
>» xA7/n — 0? 
tal 


~  (204fn) 
which is of the form ? 40). 


x? 
Hence G2 = x —_ ,isan MVB estimator and V(62) = 2 
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a 
Example 15:21. Show that X= > X;/n,in random sampling from 
it 


(1/8) exp (~x/0), 0 < x < 0 
* 
fix, 0) = -{) 0, otherwise eae) 
where 0 < @ < co, is an MVB estimator of @ and has variance 67/n. 
Solution. Let x, x2, ..., X, be a random sample of size n_ from 


population with p.d.f. in (*). Then 
= TT Ae 8) art exp E z x; jo | 


=> log L =~n'log 0-2. L x; 
ie 


Land 


2 
i 
1 =-e| & loge |=-4 +2 @5 r E(x) 


és 1 


In sampling from exponential population (*), we have mS 


E(X) =® and Var (X) = 62 .o(**) 
ee ae errs 
(®) = etg:,2, (-- x,’s afe ii. d) 

=~ Gt GF 


Also 78) =6 => yO)=1. 
Hence Cramer Rao lower bound to the variance of an unbiased estimator of 


@is:' 
‘Or _1 & eas 
“G6 WA" — 
Consider the estimator X =+ r Xj. 
jul 
ee | 1 d 
We have: E(X)=" E B=! 2 (0)=6 
= X is an unbiased estimator of 6. 
3 2 
Also Var (X)= o . = “atk = = (From (**)] 
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Thus we see that Var (X) coincides with the Cramer-Rao lower bound 


obtained in (***), Hence X, the sample mean is an MVB unbiased estimator, for 
0. 


Aliter. A more convenient way of doing this problem is as follows : 
We have 


d n 1 e j a aia 
-x~@_X-@ |, 

="(@yn) ace) °° 7 

which is of the form (15-40). 


Hence X is an MVB unbiased estimator of @ and Var (X) =A (0) = 62/n. 
Example 15:22. Given the probability density function 
f(x: 0) = [n{1 + (& -— 6)?}]"!; — 00 < x < 00, —-00 < 8 <0 woo(*) 
show that the Cramer-Rao lower bound of variance of an unbiased estimator of @ 
is a , where n is the size of the random sample from this distribution... 
[Sri Venkateswara Univ M.Sc., 1992] 
Solution. logf =-— log x— log [1 + (x- 6)?] 


dlogf __—2(x~ 8) 
06 2S ss [1 + (x - 6)?] 


lo _ 4(x=0)? 
EC) - | te (I+ (x oes A ax 


i |" _4@-6) _ 
“we Jie [1+ @-6)?}3 


0 

_2 “2 2 8 2 4 

= 4 sin’ cos*¢ do = (cos*@ — cos"¢) do 
R Jo m Jo 

E m 3.1 x 
; eee 2 42 ° 2 


( Using reduction formula for \ cos’ x dx ). 
0 
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_8|x 31 
~el4 16| 2 


Hence Cramer-Rao lower bound is 


an GET eee ee Gp Gewese ge 


Examp, , 15:23. Prove that under certain general conditions of regularity 


to be stated clearly the mean square deviation E ( 8 — 6)? of an estimator 6 of 

the parameter 0, can never fall below a positive limit depending only on the 

density function f (x, 8), the size of the sample and the bias of the estimate. 
Solution. We have proved Cramer-Rao’s ‘inequality 


2 
V (6) 2 OE where E (6) =v (0): mo 


Now 
E(6 - 0)? =E[6 ~ (0) + w(d) — 67 
= E[6 ~ y(0)}? + [8 — w(6)}2 + 21y(0) - 0} . ELS —y(6y) 
= V(8) + [0 - ye)? 
E (6 -0)? > — [o- yO? (Using)... 
Let 6 be a ‘biased’ estimator of @ with bias given by (6) 


i.e., E(@) =6+b(0)= y(8), (say). 
es y(@) - 8 = (8) 
From (**), we get 
E + $00) 
E(6 —6)? 2 Ea nie | + [b(6)]*> 0, 
1) 


where 10) =n | (Sloe f} fe 8) dx>0 


— 


This proves the result. 


15-8. Complete Family of Distributions. Consider a statistic 
T = (X;, X2, ..-» %q), based on a random sample of size n from the population 
fix, 9), 8 € @. The distribution of the statistic T will, in general, depend on 0. 
Hence corresponding to T, we again have a family of distributions, say, 
(9(t, 8), 8E O}. : . 


Definition. The statistic T =¢ (x), or more precisely the family of 
distributions {g (¢, 0), 6 € @} is said to be complete for 0 if 


Eo [A(T)] =O for all® => Po [h(T)=0]=1 ...(15-45) © 
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i.e., Jj h(t) g(t, 9) dt=0 for allOe @ 


---(15-45a) 
or x h(t) g(t, 8) = 0 for all OE 9 
8 


=>  h(T)=0, forall ® € ©, almost surely (a.s.). ..(15-458) 


The concept of complete sufficient statistic is specially useful in Rao- 
Blackwell Theorem [c/f. § 15-9]. 


Example 15-24, Let X,, X, ....X, be a‘random sample from Bernoulli 
distribution : 
67 (1 —- 6)'-7;x=0, 1 
fix, 8) = 


0 , otherwise 
a 


Show that & X;, is acomplete sufficient statistic for 0. 
j=l 


Solution. The likelihood function of the sample (X;, X2, ..., X,) iS given 


a- Dx; 


A p> Fi 
by : L = fh sou0={6 (1-6) '° x 1 


=28 [¢(x), 6] hk (x1, XQ ove Xn) 
where i(x) = = x; and h (x, 2X, ....X,) = 1 


Hence by Factorisation Theorem, T= + X;, is sufficient estimator of 8. 
jel 
Since X;’s are i.i.d. Bemoulli variates with parameter 0, 
a 
T= > X;~B (n, 6), 
rs | 


with p.m_f. 
*C,6*(1-6)*-*, K=0,1,2,...,2 


P =k)= 
een { 0 , otherwise 


E,[h(T)} = a h(k). P(T=k)= > h(k). "C, OF (1 — 6)*-* 


. = Ak). OF(1-0)*-#; AM HAW. PC, «..*) 


; A(0) (1 — 6)* + A(1) 0(1 — 6)*-! +... + A(n). 6" 
Now i 

E,[h(T)] =0 for all@ Ee O©= {6:0<86< 1) 

=> A(0) (1 - 6)" + A(1) 6 (1 ~ 6)?-1 +... + A(n) O* = 0, VO 

=> A(0) +A, [6/1 -6)] +... +A(m) [0/1 -6)"=O0V Ge [0,1] .,.C**) 
= A(0) = A(1) = A(2) =... = A(@) 0, | 
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nce a polynomial of degree n in x is identically zero (for all x), if all the 
coefficients are zero. 


From (*) and (**), we get 

h(k) = 0,k =0,1,2,...,0 
=> h(t) =0, t= 0,1,2,...,2 
Hence T is a complete (sufficient) statistic for 6. 
Example 15:25. Let X,, X2, .... X, be a random sample of size n°from 

N (9, 1) population. Examine if T = t(x) = X,is complete for 6. 
Solution. We have T = X; ; @ = {8 :-—00 <6 <0} 
Eg [h(T)] =9 


oo te 012 
= | hue“ au =0,forallee @ 


=> | (nu) } e™ . du=0, forall © 


This is a bilateral Laplace transform in 6. Since these are unique : 


h(u).e" ? = 0.4.8. 


=> h(u) = 0, a.s. 
=> P (hk) = 0) =1, VOeE 8 
=> T = Xj, is complete statistic for 8. 


Remark. It can be easily seen that T, = > X;, is a sufficient estimator 
of § and-since T, ~ N (8, 1/n), by sroceedina ae the above problem, we can 
prove that T, = ; z X;, is a complete sufficient statistic for 9 and the family of 
distributions (2, (t;, 6), 6e e} , is complete. | 


Example 15-26. Let X;, Xz, ..., Xx be a random sample from N(0, 8). 
Prove that T = X; is not a complete statistic for @ but T, = X;° is complete for - 
6. 


Solution. Here T = 1(x) =X,;@= {0:0<0<0} 
E, [h(T)] = 0, forall Oe @ 
=> | h(u) exp [- u2/(20)] du = 0, for all 8 € @ 


_ This holds only for all odd functions h(u) of u, for which the integral exists 
4é., for all functions s.t. 
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h(u) =— h(-u); for all u 
=> hu) #0, as. 
=> T =X, is not complete statistic for 0. 
Let us now consider the statistic T; = X,?. . 
E, [A(T,)] =0, for all 0.e @ 


=> i, h(x”) exp (-x?/20) dx = 0, for all 8 € © 


a 


= I AW) exp (— u/20) du =0,V 6€ @ 
wee Nu 


This being a Laplace transform in (1/6), we have 


h(u) 


=0, a.s. 
Vu 
=> h(u) =0, a.s. 
= T, =X,2, is complete statistic for 9. 


Remark. We can easily see that 7, = X,7, is sufficient statistic for 0. 
Hence T; = X;? is a complete sufficient statistic for 0. 


Example 15-27. Let X;, X2, ....X, be a random sample from uniform 


U[0, 8], 8 > O population. Show thatT = max (X;) = Xiq), is a complete 
lstgsn 


sufficient statistic for @. 
Solution. T =Xqy has p.d.f. 


nae. 
a(t, 8) -| 7 eel ces 
0 ” , otherwise 
E, [h(1)] =0, for all 0 € 6 = {6:0<0<-} 
= ae 7 h(u). u*-' du=0, for all Oe @ 
0 


Differentiating w.r. to 8, we get from the fundamental theorem of integral 
calculus : 


h(0).0*-!=0, VOe @ 


=> h(T) = 0, a.s. 

=> T= max (X), Xo, ..., Xq) = X¢m), iS Complete for 9. 

We have also proved in Example 15-13, that 7 = X,,), is sufficient for 0. 
Hence T = Xq , is complete sufficient statistic for 9. 


s 15:9. MVU and Blackwellisation. Cramer-Rao inequality 
(c.f. § 15-7) provides us a technique of finding if the unbiased estimator is also 
an MVU estimator or not. Here, since the regularity conditions are very, strict, 
its applications become quite restrictive. More-over MVB estimator is not the 
same as an MVU estimator since the Cramer-Rao lower bound may not always 
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pe attained. More-over, if the regularity conditions are violated, then the Icast 
attainable variance may be less than the Cramer-Rao bound. [For illustration see 
Example 15-30]. In this section we shall discuss how to obtain MVU estimator 
rom any unbiased estimator through the use of sufficient statistic. This 
technique is called Blackwellisation after D. Blackwell. The result is contained in 
the following Theorem due to C.R. Rao and D. Blackwell. 
Theorem 15:9. (Rao-Blackwell Theorem). Let X and Y be random 
yariables such that 


E(Y) =y and Var (Y) = oy? >0 


Let E(Y\X =x) = (x), then 
(i) E[XX)] =u 
and (ii) Var [¢(X)] < Var (Y) 


Proof. Let fy (x, y) be the joint p.d.f. of random variables X and Y, f; (.) 
and f2 (.) the marginal p.d.f.’s of X and Y respectively and h(y |x) be the 
conditional p.d.f. of Y for given X = x such that 


h(y |x) er | 
E(Y|\X =x) = \- y. h(y | x)dy 
\- y"A@ ® 
“rip f y fix, y) dy = $(x), (say) .-(15-46) 
=> | y flx, y)dy = O(x). fr) p+ (15-46a) 


From (15-46) we observe that the conditional distribution of Y given X = x 
does not depend on the parameter 1. Hence X is sufficient statistic for. 
Now 


E[o(X)] = E[E(Y | X)] = EY) =p, .. (15-47) 
which establishes part (Z) of the Theorem. 
Wehave — 


Var (Y) = ELY - E(¥)}*= ElY -—-p}* 


= E(Y — 9X) + o(X) - p]? 
= E(Y — 9(X))? + E[QX) - p}? 
+ 2EL(Y-KX)} (WX)-p}] —_...(15-48) 
The product term gives 
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EL(Y - 0) (0) -u)] = | | (y - #2) (GSH) flex, y)dedy 


| | (y --1(x) [O(x) — WI Si(x) Ay | x)dxdy 


40) - | | [y — o(x) JAY dy dx 


But { ° Ly — A(x)]h(y | x) dy =0 [-» E(Y 1X = x) = $y) 


E((Y — $(X))(@X) — W)] = 0 
Substituting in (15-48), we get 


Var (Y) = ELY - (X) I? + Var [W(X)] ,..(15-49) 
= Var Y > Var [9(X)] (.- ELY - 9(X)I2 20) 
= Var [¢(X)] < Var Y,. ,..(15-49a) 


which completes the proof of the theorem. 
Remarks. 1. From (15-49), it is obvious that the sign of equality holds 


in (15-49a) iff 
E(y —-9(X)] = Q 
=> Y—-(X) = 0, almost surely. 
i.e., iff P{(x,y) :y - &)=0) =1 (15-50) 


2. Here we have proved the theorem for continuous r.v.’s. The result can 
be similarly proved for discrete case, replacing integration by summation. 

3. Rao-Blackwell theorem enables us to obtain MVU estimators through 
sufficient statistic. If a sufficient-estimator exists for a parameter, then. in our 
search for MVU estimator we may: restrict ourselves to functions of the 
sufficient statistic. The theorem can be stated slightly different as follows : 

Let U = U(x), X2, ..+» Xn) be an unbiased estimator of. parameter y (8) and 
let T = T(X;, x2, ...; Xn) be‘sufficient statistic for y(@). Consider the function 
$(T) of the sufficient statistic defined as 


Wt) =E(UIT =1) (15-51) 
which is independent of 8 (since T is sufficient for y(@)). Then 
ET) = (8) 
and Var WT) sVar (U) ..(15-52) 


This result implies that starting with an unbiased estimator U, we can 
improve upon it by defining a function WT) of the sufficient statistic as given 
in (15-51). This téchnique of obtaining improved estimators is -called 
Blackwellisation. 

If in addition, the sufficient statistic T ts also complete, then the estimator 
QT) discussed above will not only be an improved estimator over U but also the 
best (unique) estimator. We state below the relevant theorem. 

é 
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Theorem 15-10. Let T be a complete sufficient statistic for (0), 9E 8. 
Then WT), the function of T defined in (15-51) is the unique unbiased estimator 
of y(9). . 

Combining the results of the two Theorems 15-9 and 15-10, we have the 
following result. 


Corollary. If T is a complete sufficient statistic for y(@) and if we can 
find some function of T, say 9(T), which is unbiased. estimator of: (0), then 
g(T) is the MVU estimator of y(@). 


Example 15-28. Let X;, X>, ....X, be a random sample from N(@, 1). 
Ol tain MVUE of 8. 


Solution. it can be easily proved (c.f. Example 15-25] that the statiStic 
T =X, +X2+...+X,= Dee 
is complete sufficient statistic for 6. ~ 
Consider X,= . z X; = —= (1), (say) 
Since xX, = 9(T), is unbiased estimator of 6, by corollary to Theorem 15-10, 
X,, is MVUE of 0. 


Example 15°29. Let X;, Xz, ...,X, be a random sample from U[0, 6] 
population. Obtain MVUE for 8. 


Solution. We have seen that in sampling from U (0, 8] population, the 
statistic : 
T=Xw= max. (Xi) 
1sisa 
is sufficient (Example 15-13) and complete (Example 15-27) for 6. Also 
E(T) = E[Xqy] = ( a Je [See Example 15-30] 
oe p|@+UT] 9 


Hence by corollary to Theorem 15-10, [(n +, 1)T/n] = [(n + 1) Xqqy/n] is an 
MVU estimator of 60. 
Example 15°30. Given : 


S(x, 8) =5. 0<x< 8, 8>0 (8) 


= 0, elsewhere, 
compute the reciprocal of: 


; A| 2 tos fe. op 


and compare this with the variance of (n + 1) Y,/n, where Y,, is the largest item 
of a random sample of size n from this distribution. Comment on the result. 


“a 
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Solution. log f(x, 8) =-log 8 =. <3! log f= 5 
- {0 
=> n'#( logs) = nel gs )- z 
Hence reciprocal of n aE: log f(x, @) TF a (**) 


For ‘the rectangular population (*), the p.@f. of nth order statistic (the 
largest sample observation),Y,, is 


2) =n. (FO, OF" fly. ©) 
where: F(x, 8) =PaSs)= [, fuydu= [ sat 


-~1f 
8g) = n(% J are y*-'s0sy<0 


8 n nd” 
ry — r =— r+n-1 = 
Be) = | y . ay)dy ore yr ay ee. 
Taking r = 1 and 2, we get 


_ 70? " 
EY,) = n+ 7 E(x") = n+2 ee) 
Now E|2 7s l ae as ntl EY) = ) [Using ***] 
=> (n L. 1)Y,,/n is an ee estimator of 9. 


“ie LY, | -(2+4 LF. Var (Y,) 


n+ nth [EY,2- (EY,] 


x(ee a) fia a] ag my 


n+2 (n+ 1)? 


_g|(n+1? ,|__ @ 6 
-o| ~ 1|- 85 < 


n+ 1 oO 
=> Var| par r.| S [| (2. log f 
Hence (n + 1)Y,/n is an MVUE. 


Remark. This example illustrates that if the regularity conditions 
underlying Cramer-Rao inequality are violated, then the least attainable variance 
may be less than the Cramer-Rao lower bound. 


EXERCISE 15(a) 


1. What do you understand by Point Estimation ? Define the following 
terms and give one example for each : 
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(i) Consistent Statistic 
(ii) Unbiased Statistic 
(iii) Sufficient Statistic 
(iv) Efficiency. [Dethi Univ. B.Sc. (Stat. Hons.), 1987, 1982) 
2. What do you understand by Point Estimation ? When would you say 
that estimate of a parameter is good ? In particular, discuss the requirements of 
consistency and unbiasedness of an estimate. Give an example to show that a 
consistent estimate need not be unbiased. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1992, 1986] 
3. Discuss the terms (i) estimate, (ii) Consistent estimate, (iii) unbiased 
estimate, of a parameter and. show that sample mean is both consistent and 
unbiased estimate of the population mean. 
[Calcutta Univ. B.Sc. (Maths. Hons.), 1986] 
4. (a) If 5,2, 592, ..., 5,2 are r sample variances based on random samples of 
SIZES NM), No, ..., Np respectively, and if T is some statistic given by 
_ mys 2 + noSo? +... + 1,5,2 
= ——— -, 
for estimating o? as an unbiased estimator, find the value, of a, supposing 
population is very large and for every sample 


2 at ¥(x;-FP 
Ans, a ='(n, + Mo +... +,)- TF. 


(b) If X,, X>, X3, ..., X, are the sample means based on samples of sizes 
N,, Mz, Ny, ..., m, respectively, an unbiased estimator, 


7 mX, + n2X> ye n,X, 
~ k 
has been defined to estimate Lt. Find the value-of k. 
Ans. k=n, +o +.:. + M,. 
5. (a) For the geometric distribution, 
fix, 0) =6 (1-6)*-!, @=1,2,...),0<6<T, 


Obtain an unbiased estimator of 1/0. [Ans. E (x y= 1/0] 

(b) The random variable .X takes the values 1 and 0 with respective 
probabilities © and 1 —6. Independent observations X,, X>, ....X, on X are 
available. Write & =X, +X,+...+X,. 

Show that & (n - &)/n(n — 1) is an unbiased estimate of 6(1 — 6). 

6. Show that if T is an unbiased estimator of a parameter 0, then 4,T + A, 
is an unbiased estimator of’A,6 + Az, where A, and 2, are known constants, but 
T? is a biased estimator of 62. 

7. For the following cases determine if the given estimator is unbiased for 
the parametric function. When it is biased, derive an unbiased estimator from it. 


x is the sample mean. 
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(a) X,, ...,X, is a random sample from a distribution with variance o?; The 
= 2 
estimator n=! [(x, -x)2+...+(%,-x) ] 
is ‘used to estimate 6. 
(b) X,, ....X, iS an independent sample from an exponential distribution 


=1. 
with méan 9, The estimator {1 —- + is used to estimate exp (- 3 when 
nX 
nX > 1 and zero is used when nX < 1. 


(c) r successes are observed in n Bernoulli trials with success probability p. 
(r/n)? is used to estimate p*. 


8. f(x 3B, 0) = b exp|- (z=u)), UW Sx<oo, ~co << oo 
and 0 <6 <0 
Obtain 
() an unbiased estimate of |) when o is known, 
(ii) an unbiased estimate of o when 1 is known, 
(ii) two unbiased estimators of 6” when tt is known. 
Hence obtain an infinity of unbiased estimators’ of o? in this case. 
[Hint. The exponential distribution has mean jt + 6 and variance 67) 


9. Suppose X and:Y are independent random variables with the same 
unknown means ut. Both X and Y have variance as 36. Let T = aX + bY be an 
estimator of 1.- 

({) Show that T is an unbiased estimator of 1 if a + b = 1. 
2 


(ii) Ifa=;andb=2, what is the variance of T ? 


(iii) Ifa= >and b= > what is the variance of T ? 


(iv) What choice of a and b minimizes ‘the variance of J’ subject to the 
requirement that T is an unbiased estimate of 1? 


10. (a) Examine the.unbiasedness of the following estimates : 


@ s= 7 x (x;-x 


| 
(i) s)°= - ; z (x; ~ 1)? (where 1 is known), 


for o”, the population vanance. 
(Dethi Univ. B.Sc. (Stat. Hons.), 1982] 


Ans, E(s,2) = (" - Joe # 02, (i) E(s,*) = 02. 


(b) Let x1, X2, .... X, be a random sample of size n drawn from a population 
with mean }1 and varlance o?. Obtain an unbiased. estimator for 1”. 


1 
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= n — 
Hint. E(x) aE > fon: Var (X) = 02/n 
és} 


E(X2) = Var (X) + [E(X)}? =p? + (07/n) 
Ans. X?2 —(o2/n), if? is — : 


and X2 — (S2/n) = X2— . (x; - %)2, if o? is. unknown. 


am n(n — 1) (2 


11. If X,, X2, ...; X, is a random sample of size n from N (j1, o”), where 
pt is known and if 


1 a 
T=s- >> l X = ul 
Nj-wl 
‘examine if T is unbiased for o. If not, obtain an unbiased estimator of o. 
P [Delhi Univ. B.Sc. (Stat. Hons.), 1987 
a 
Hint. E(T) = + Z EIX;-pl=eVQip.«, 
ial 


since, for N(t, 62), Mean Deviation about mean = 1 (2/x) o 
Ans. No; ‘ (1/2) T. 


12. If x;, Xp, ...,X,_ iS a random sample from the population 


f(x, 9) = 0 + 1) x93 O<x< 1; O>-1 sui) 
show that pee SY 1 | is unbiased estimator of 6. 
DX log x; 


Hint. In-sampling from (*), U =- log X has an exponential distribution 
with parameter (6 + 1) 


iid. 
=> p=~logX; ~ y(0+1,1); i=1,2,. 
= Q@+1 
= =- 2 logx; ~y(@+ tay EWM = 7 


13. Suppose XSpas a truncated Poisson distribution with p.m_f. 


-— 0) . 6% 
fe, o-| [l-exp(-O@)]}x!° ~*~ 1,2,3, 
0 otherwise 


Show that the only unbiased estimator of [1 = exp (— 6)] based on X is the 
Statistic T, defined as: 


0, when x is odd 
2, when x is even 
Note. This is an Example of absurd unbiased estimator. 


T(x) = 
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14, Consider a random sample X;, X>, X3 of size 3 from uniform p.d_f. 


K =| hercine 
ance, , otherwise 


Show that each of the statistics 4X1), 2Xg) and xX (3)» where Xj is the ith 


order statistic is an unbiased estimator for 6. Find the variance and hence the 
efficiency of each. 


15. Obtain an unbiased estimator for (i) 6, and (ii) 6, in case of binomial 
probability distribution : 
f(x, 9) = *C,, O (1 — 0)*-*%3x=0,1,2,...,.2;0<6<1. 
Hint, E{ = )=0; £| 72=) |<, 
n n(n-1) J 
If we write T = x/n, the observed proportion of successes then 


: 2 = 
Ba =0; 27%) =" +(*=+) eve 
This illustrates that we may have : 
t, unbiased for 6 but ¢,27 not unbiased for 6. 
16. Define ‘efficiency of an estimator’. 


X is a uniform random variable with range [0, 9). x,, x2, .... X, are 
independent observations on X. Define 


6, =? (xX + 2X2... + Xe) é, =| 2+ Dl max (1, X2, .--> Xp). 


ry . 


cw 


Show that 6: and & are unbiased for 9. Evaluate their relative efficiency. 


17. (a) The observations x1, x2, ..., %, Tepresent a random sample from a 
uniform distribution over the interval (0, 6), where 9 is an unknown parameter. 
The statistics X, m and M are the mean, the smallest value and the largest value 
respectively for the sample. Find. values for & so that, k¢ is an unbiased 
estimator for 9 where 

(a)t=X 
‘(b) t=M, 
(c)t=M-m, 

Of the three unbiased estimators which is the best ? Give your reasons. 

(b) Let X,, X2, ..., X_ (Nn > 2) be a random sample of size n from the 
distribution having density function. : 

f(x; 9) = Ox®-! ,O<x<1, 6>0 
- 1 


IfZ=—— x log X;, show that a. 7 is an unbiased estimator for 6 and 


its efficiency is (nm —2)/n. 
Hint. See hint to Question 12. 


18. (a) Suppose X;, X2, ....X, are sample values independently drawn 
from population with mean m and variance o?. Consider the estimates :— 
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y XX, +Xot... +X, z= X, + 2X. + 3X3 4+... +X, 
oo n+] Cele n? 


Discuss whether they are unbiased, consistent for m. What is the efficiency 
of Y,, over Z,, ? 

(b) Let X;, X2, X3 and X4 be independent random variables such that 
E(X;) = » and Var (X;) = 0? for i = 1, 2, 3, 4. 


X,+Xo+X44+Xq4 Xy+Xo+Xx+ Xu 


If + Y= 4 Z= 5 
X,+2X5+X3,-X 
and T= u 5 3 + 


examine whether Y, Z and 7 are unbiased estimators of 44 ? What is the 
efficiency of Y relative to Z? 


(c) Let x), x9, X3, x4, be a random sample from a (it, 62) population. Find 
4 


the efficiency of T=3(x, + 3x, + 2x3 + x4) relative to X= t 2. x). Which is 


relatively more efficient ? Why ? 

19. A simple random sample of size 2 is drawn from a population 
containing 3 units, without replacement. Let y,, yo, y3 be the value of a 
characteristic measured on the three units and let 7; be the estimator of the 


population mean Y for the sample that has units i andj; i,j = 1, 2, 3,7 #/. 
If Ty = Oy + Y2/2, Ty3 = (4/2) + (2y3/3), T23 = O'2/2) + 0'3/3), show 


that 7; 1s unbiased for Y. Find the variance of T,, and hence show that 
the variance of 7;; is smaller than that of the sample mean estimator if 
y3 By2 + 3y; — y3) = 0. {Indian Forest Service, 1991] 

20. Let x, the earnings of a commercial bank, be a random variable with 
mean jt and variance 6. A random sample of earnings of n banks is denoted by 
X1, X25 «++; X». However, because of the disclosure laws, individual bank earnings 
are not disclosed and only the following average values are made available to the 
researcher : 
x) "7 x2 , a5 = x3 _ x4 seep A = — Xn ; 
where #7 is an even number and m = n/2. 

(‘) Devise the best linear unbiased estimator of uw, given the available 
information. What is the variance of the proposed estimator ? 

(ii) Devise an unbiased estimator of o2. [Delhi Univ. M.A. (Eco.), 1990] 

21. (a) Define a consistent estimator. 

Let 7,, be an estimator of ® with variance 6,7 and E(7,,) = 9,. Prove that if 
8, — 8 and o,? > 0, as n > © then 7, is a consistent estimator of 98. 

Hence obtain consistent estimators for : 

(‘) Mean of the normal distribution. 


a); = 
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(ii) Variance of the normal distribution when mean is known. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989] 
(b) Give an example of an estimator : 
(i) which is consistent but not unbiased, 
(ii) which is unbiased but not consistent. 
[Dethi Univ. B.Sc. (Stat. Hons.), 1988] 
22. (a) State and prove a sufficient condition for the consistency of an 
estimator. Define the invariance property of a consistent estimator and establish 
it. [Delhi Univ. B.Sc. (Stat. Hons.), 1985] 
(b) Given a random sample X,, X>, .... X, from a normal (1, 67) 
distribution, examine unbiasedness ae consistency of 


(i) X for p, (ii) = (X; — X)? for o?. 


23. (a) When would you say ft estimate of a peramneise is good ? In 
particular, discuss the requirements of consistency and unbiasedness of an 
estimate.. Give an example to show that a consistent estimate need not be 
unbiased. 

Show that an unbiased estimator whose variance tends to zero as the sample 
size increases to infinity is consistent. 


(b) Define unbiasedness and consistency of estimators. Let X,, X>, ..., X, 
be a random sample from the N (1, o”) distribution. Propose three estimators of 
jt based on this random sample such that the first is unbiased but not consistent, 
the second is consistent but not unbiased and the third is both unbiased and 
consistent. [Punjab Univ. M.A. (Eco.), 1990] 

24, (a) Define an unbiased and consistent estimate of a parameter in a 
population distribution. 

Prove that for a sample of size n from a normal (m, 1) population, the 
arithmetic mean is an unbiased estimate of m and by Chebyshev’s inequality or 
otherwise, show that the estimate is consistent too. 

[Calcutta Univ. B.Sc. (Maths. Hons.), 1991] 

(b) If X,, Xo, ..., X, iS a random sample obtained from the density 
function : 


show that the sample mean X is an unbiased and consistent estimator of 6 +3 : 


25. (a) Define a consistent estimatur. Let T, , and Tz, be consistent 
estimators of g,(0). and g2(0) respectively. Prove that a7,,,+ 572, is a 
consistent estimator of ag;(0) + bg2(9), where a and b are constants independent 
of 8. 

(b) Define consistent estimator. If the estimator ¢, based on a random 
sample of size n is such that 

E (t,) 29 

and V (t,) > 0, 
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as n —> oe, then prove that ¢, is a consistent estimator for 8. Hence prove that 
sample mean is always a consistent estimate for population mean. 
[Delhi Univ. M.Sc. (Maths), 1990] 

(c) If t, is a biased estimate of paramater 9 based on a random sample of size 
n, and E(t,) = 0+ 5, and if 6, — 0 and V(¢t,) — 0 as n — oo, show that #, is 
consistent estimator of 8. 

(d) Define a consistent estimator of parameter 8. If T is a consistent 
estimator of 9 and if @ is any continuous function of its argument, show that 
g(T) is a consistent estimator of (0). 


a a 
26. (a) Show that x = * y x; and s* = - x (x; -xX )*, are joint 
i=l isl 


consistent estimators for j1-and Go? respectively, if. x,, x2, ..., X, iS a random 
sample from a normal population N(U1, 67). 

Also find the efficiency of ns?/(n — 1). 

(b) Show that if ¢ is a consistent estimator of a parameter 6, then e’ is a 
consistent estimator of e°. 

(c) Prove that in case of Binomial distribution with parameter 9, ¢, defined 
as r/n is a consistent unbiased estimator for 9, but ¢, defined as (r/n)? is 
consistent but not unbiased estimator for 67. 

27. Show that in sampling from Cauchy distribution 

1 


FOOD RIT + (x= OP’ SES O70: 


(i) Sample mean X is not a consistent éstimator of 9. 
(ii) Sample median is a consistent estimator of 6 and its asymptotic 
efficiency is 8/n?. 
28. (a) If T, and T, are consistent estimators of y(®), show that 
a,T, + 427, such that a, + a2 = 1, is also consistent for y(6). 


(b) For a Poisson distribution with parameter 0, show that 1/X is 


consistent estimator of 1/8, where X is the mean of a random sample from the 
given population. 


Hint. Prove that X is a consistent estimator of @ and then use Invariance 
Property (Theorem 15-1). 

29. Define MVU estimator. If T, and T2 are two unbiased estimators of a 
parameter 0, with variances 6,7 and 0,2 and correlation coefficient p, then 
obtain the best unbiased linear combination of 7, and T2. Also obtain its 
variance, [Dethi Univ. B.Sc: (Stat. Hons.), 1990} 

30. (a) Let 7, and Tz be two unbiased estimators of y(6) having the same 
variance. Show that their correlation coefficient p_ cannot be smaller than 
(2 @g — 1), where ég is the efficiency of each estimator. 

Further show that if 7, is MVU estimator and T, is any unbiased estimator 
with efficiency e, then ; 
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V(T, -T>) = (2 2 1) V(T;) 
[Delhi Univ. B.Se. (Stat. Hons.), 1989] 
‘(b) If T, isa MVU for 6 and 7> is any other unbiased esumator of 6 with 
efficiency eg then prove that the correlation between 7, and T> is Veo ; 
[Dethi Univ. B.A. (Stat. Hons.), 1987] 
31. (a) Define MVU estimator. Show that an MVU estimator is unique. 
{[Dethi Univ. B.Sc. (Stat. Hons.), 1985) 
(b) If T, and T> are two unbiased statistics having the same variance and p 
is the correlation between them then show that p 2 2e — 1, where e is the ratio 
of the variance of the best estimator to the common variance of T, and 75. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1992] 
32. (a) Let T be an MVU estimate for y(0) and:T,, T2 be two -other 
unbiased estimators of (6) with efficiencies e, and ez respectively. 
If Po is the correlation coefficient between T, and T>, then 
(e,€2)'? — ((1 — e,)(1 — €2)) 7 < po S (€:€2)!” + ((1 - €1)(1 — €2)} 2. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1993, 1988, 1986] 
(b) Let ¢, and t2 be two unbiased estimates of 6 with variances 6;? and 0,2, 
(both known) and correlation p (known). Consider the estimate 


A 
8=at, + (1 —Q) fo. 


A A 
Show that 8 is unbiased. Find o such that 6 has minimum variance. 
[Delhi Univ. M.A. (Eco.), 1986] 

33. Suppose X and Y are independent unbiased estimates of p. It is known 
that the variance of X is 12 and the variance of Y is 4. It is desired to combine 
two estimators in order to obtain a more efficient estimator: Let T = aX + bY, 
be the new estimator. 

(i) In orderthat T be an unbiased estimator of 11, what conditions must be 
imposed on a and b-2 

(ii) Find the values of a and b that minimize the variance of T subject to the 
condition that T be an unbiased estimator. 

34. (a) What is an-efficient estimator ? 


If T,, Tz are both efficient estimators with variance v and if T = 5 (T; + T2), 


show that variance of T is (v/2)(1 + p), where p is the coefficient of correlation 
between 7; and T,. Deduce that p = 1 anti that T is also efficient. 

(b) If T and T’ bé two consistent estimators of which T is the most 
efficient, prove that the correlation coefficient between them is 


A ry , where V(T) and V(T’) are the variance of T and T’ respectively. 


Show also that the correlation coefficient between two most efficient 
estimators is unity. [Allahabad Univ. M.A. (Eco.), 1993] 
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35. Define a sufficient statistic. Explain the method of finding sufficient 
estimator. If (x,, x2, ..., X,) is a random sample from a distribution : 
f(x, p) = p® (1 -p)'-4;x=0,1 andOsp <1, 
find the sufficient estimator of p. [Madras Univ. B.Sc., 1988) 


36. State the factorisation theorem on sufficiency. Obtain a.sufficient 
statistic for the parameter 6 in the following distribution : 


f(x: 9 =4, 0<x<9. 


(b) Define a sufficient statistic. 
If X,, X2, ....X,_ 1S a random sample from a distribution : 
f(x, 9) = 6* (1-6)? ;x=0,1,0<6<1 

= 0, elsewhere. 

Show that Y, =x, +x. +... + Xn , 1s a sufficient statistic for 9. 
[Madras Univ. B.Sc., 1987] 

(c) Let x;, x2, ...,X, denote.a ee sample from a population with p.d.f 

fix, 9) = 0x®-! ,O<x< 1. ) 
Show that Y = x, x2 ... x,, is a sufficient statistic for 6. 


oy, (a) Let X be a random sample of size one from a normal distribution 
NO, o 


(i) Is X a sufficient statistic for o? ? 
(ii) Is |X la sufficient statistic for o? ? 
(iii) Is X* a sufficient statistic for o” ? (Gujarat Univ. B.Sc., 1992) 


(6) Examine which of the following distributions admit sufficient 
estimators for their parameters : 


'@) fix, 0) = 0x°-' ,0<x<1 
(ii) fixy, p) ‘a - Te (x? - 2pxy + ¥2)| 


38. (a) Show that if a sufficient estimator exists, it is also the maximum 
likelihood estimator. Is the converse true ? Explain. 
(b) Do the following distributions admit of sufficient estimators ? 


(i) fl, 8)= 4340 <x<(kK+1)0, where k is an integer. 
" 1480 
(it) fe, 8) =O ey PA 


39. (a). Prove that if an unbiased estimator and a sufficient statistic exist 
for (8) and the density function f(x, 8) satisfies certain regularity conditions (to 
be stated by you), then the best unbiased estimate of (0) is an explicit function 
of the sufficient statistic. 


Examine if the following distribution admits a eulncient Statistic for the . 
parameter 6. 
f@, 9) =(1 + 6) x®; 0<x<1,0>0 3 
(b) Discuss if a sufficient statistic exists for the parameter 6, in n sampling 
from double exponential distribution with p.d.f. 
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f(x, 8 =S exp (-1x- 6!) ,+0 <x < 00, 


Hint. Proceed as in Example 15-17. 

Ans. No sufficient estimator for 6 exists. 

(c) Obtain jointly sufficient estimators for a and B in a random sample 
X1, X2, ..., X, from the uniform ire with p.d.f. 


fix, 0, B) =5~ aa aSxsB 


= 0 _ , otherwise 

Ans. T,; =X) and Tz =X), are jointly sufficient for a and B 
respectively. 

40. (a) Show that a necessary and sufficient condition for a statistic T to be 
sufficient for 6 is that the probability function fg (x) should belong to an 
exponentially family. 

(b) Let x;, x2, ... x, be a random sample from a distribution with p.d.f. 
fx: 9) =e -&-%), x2 6, ~ 00 < 8 < ©, Obtain a sufficient statistic for 9. 

(Dethi Univ. B.Sc. (Stat. Hons.), 1987, 1985] 

41. Define a sufficient statistic. State and prove the Factorisation theorem 


_ on sufficiency. [Delhi Univ. B.Sc. (Stat. Hons.), 1986] 
42. (a) Let (X;, X2, X3) be a random sample from the probability mass 
function : P(X =x) =6* (1) - 6)'>, (x= 0,1;0<86<11). 


If ¢ =X, + X2_ + X3, show that the conditional distribution of the random 
sample given =r, does not depend on 6. Interpret this result in the light of 
sufficiency-concept. 

(b) Let (X,, X2) be a random sample from a Poisson distribution with 
parameter 9. Prove that t = X; + 2 X, is not sufficient for 9.- 

(c) Let (X;, X2) be a random sample from N (6, 1). If T =X, + Xz and 
U = X,-X;, show that the conditional distribution of U given T = t, does not 
depend on @. Interpret this result in the light of sufficiency-concept. 

(ad) For a random sample X; G@ = 1, 2,..., 2), from an exponential 
distribution with p.d.f. 


1 x 2 
ftx, 9) =p! - 6 ,» x>0, 8>0, 
obtain an unbiased and sufficient estimator for 9. 


, [Delhi Univ B.Sc. (Stat. Hons.) 1983, 1988] 


4@. Prove that under certain regularity conditions to be stated by you, the 
variance of an unbiased estimator T for (8), satisfies the inequality 


Var(T) > ——___VOr_ 
pf 21oese log fo Mu Sas pK P 


. [Dethi Univ. - a (Stat. Hons.), 1992, 1986) 


_ 44, (a) If T is an unbiased.estimator of a parameter 9, based on a random 
sample of size n, prove that 
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Var (T) 2 1/{nI(8)], where /(6) is the information function. 


(b) Show that under certain regularity conditions, an unbiased estimate T of 
a parametric function y(@) attains a Cramer-Rao bound for the variance of 
unbiased ee of w(8), if and only if T satisfies the relation 
dlogL _ni(®) _ 
© = we 67 - vO) 
where L is the likelihood function of a sample of n observations and 


nl(@) = ES d log L gry 


What is the variance of T in such a case ? Show that an estimator T satisfying 
the above relation is unique when it exists. Further a parametric function 
admitting such an estimator T is unique except for an additive and multiplicative 
constant. . (Meerut Univ. B.Sc., 1992) 


45. (a) State and Prove Cramer-Rao Inequality. 
(b) Let X,, Xo, .... X, bea random sample from.a population with p.d.f. 
fix, 0) =O-e-®. x¥>0,0>0. 
Find Cramer-Rao lower bound for the variance of the unbiased estimator 
of 8. {Dethi Univ. B.Sc. (Stat. Hons7,1987}) 
46. f(x, 9) is a probability density function and (x, x2, ..., X,) is a random 
sample from it. Prove that if an unbiased minimum variance bound (MVB) 
estimator .T exists, it must be of the form T = 6 + AX 4 log f(xi, 9), in which 


i does not depend on sample values. 
Show that the variance of T is 4 and is given by 
1 |e 
Write a note on the connection between MVB estimators and sufficiency, giving 
example. 


47. (a) Define Minimum Variance unbiased estimator and Minimum 
Variance Bound unbiased estimator and explain clearly the difference between 
them. Prove that minimum variance unbiased estimator is essentially unique. 


(b) Verify that there mags an bee V.B. estimator for the parameter 9 of the 
distribution : fix, 0) =2£—® y= 0, 1,2,.. 


and hence obtain the value of ce (Marathwada Univ. M.Sc., 1993) 
(c) Show that there exists a parameter function (8) in-the case of the 
geometric distribution : 
f(x, 9) = (1 - 8) 6*;x=0,1,2,..:;0<6<1 
such that there exists an M.V.B. unbiased estimator T of (68). 
Obtain (8), T and V(7). {Agra Univ. M.Sc., 1988] 
48. (a) Define minimum variance unbiased estimator (MVUE). How:is © 


Cramer-Rao inequality useful in obtaining such an estimator ? Derive ‘this 
inequality. 
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(b) Obtain minimum variance unbiased estimator of 6 from a sample of n 
independent observations x, x2, ..., X,,drawn from the binomial ,B (WN, 8) 
population having probability function : 

f (x; 8) =NC, 6* (1-6) -*, x =0, I, 2, ..., N. 

Also obtain variance of this estimator of 9. 

49. (a) If b(8) is the bias in the estimator T of 8, then show that (under 
conditions to be stated by you), 

Er - oy 2 UT FO. (oon)? 
_ where J(0) is the information on 6 supplied by a sample of n observations. 

(b) Prove the following result : 


i erreene | PBT enoes( 


a 
where g(x, 8) is the frequency function in x having the first moment w(6) and 
finite second moment. Discuss when the equality sign holds. 

50. For the ies distribution 


fle, = app xP=! exp -x/6); 0<x<00,8>0, p (known), 


find the expectation of X2. Use it to obtain an unbiased estimator T of 67. Find 
V(T) 

Evaluate Fisher’s information function /(0) about 6? and verify the truth of 

the inequality : . 
V(T) > [n (0%) 

51. State and prove Rao-Blackwell theorem and explain its significance in 
the theory of point estimation. 

Let x}, X2, ....X, be a random sample from Poisson distribution with 
parameter A. Obtain Cramer-Rao lower bound to the variance of an unbiased 
estimator for A. Hence find the M.V.U.E. for A. 

[Delhi Univ. M.Sc., (Maths.), 1990) 
$2. State:and prove Rao-Blackwell theorem and explain its significance in 
point estimation. 

Let X,, X>, ...,X, be a random sample from a rectangular distribution with 

p.d.f. 
; f(x, 9) = 1/0, O< x <9. 

Find MVU estimators of 6 and 36 + 5. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1993, 1987] 

53. Define completeness of a statistic T. Let X,, X>, ..., X, be a random 
sample from uniform population U[0, 6]. Obtain sufficient statistic for 6. Show 
that it is complete. Hence obtain MVU estimator for 9. 

[Delhi Univ B.Sc. (Stat. Hons.), 1988) 

54. Definé a complete sufficient statistic. 
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If T is a complete sufficient statistic for y (6), and E [((7T)] = y(6), then 
show that (7) is the unique MVUE of (6). 

Use this property and obtain MVU estimator of © based on a random 
sample X,, X, .... X, from the distribution with p.m.f. 

9 {* (1 -6)!-*,x=0, 1 

fee) = 0 , elsewhere 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 

55. Show that the family ( f(x, 6), 8 € (, 1)} with 

fa. 8) = 2C, O (1 a 6)? -* » x= 0, 1, 2, 

is complete. [Delhi Univ. B.Sc. (Stat. Hons.), 1993] 

56. Let X,, X2, ....X, be a random sample from 


fo &) =410<x<0 forall 0¢ e 


Show that X(,) = max (X,, X2, ..., X,) is sufficient for 6 and {nt Dy = 
(n) rn 7) 


is an unbiased estimator for 0. 
Comment on the result. [Agra Univ. M.Sc., 1988] 
57. Let the random variables X and Y have the joint p.d.f. 


f(x, y) =% exp| - ety) |oczcy<m 


) 
and zero elsewhere. 
(a) Show that : E(Y\x)=x+6 
Obtain the expected value of X + 6 and compare the variance of X + 6 with 
that of Y. [Delhi Univ. B.Sc. (Stat. Hons.), 1992, 1986] 


(b) Show that: E(¥)=20, Var (Y) = 562 
[Madras Univ. B.Sc., 1988} 
58. (a) A random sample of size n is drawn from a Poisson population 
with parameters A. Obtain the minimum variance unbiased estimator of, A. 
[Delhi Univ. M.A. (Eco.), 1992) 
(b) Establish a necessary and sufficient condition for an unbiased estimator 
to be an MVU estimator. 
Let X,,X2, ....X, be a random sample from a Poisson distribution with 
parameter 8. Find an MVU estimator for y(6) = e~° 64 /24. 
59. Define sufficiency of an estimator 
Let Y; < Y2 < Y3 < Y,4 < Ys be the order statistics of a random sample of 
size 5 from the uniform distribution with p.d.f. 


0, elsehwhere 


Show that 2Y3 is an unbiased estimator of 6. Find the conditional 
expectation E [2Y; | Ys] = (Ys), say. Compare the variances of 2Y; and (Ys). 
[Dethi Univ. B.Sc. (Stat. Hons.), 1989} 


Ri 0<x<0,0<0<0 
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60. Let X,, X>, ..., X, bé a random sample from the Bernoulli population 
with parameter 6, 0 < 6 < 1. Obtain a sufficient statistic for © and show that it 
is complete. Hence obtain MVU estimator of 9. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1989] 


61. Show that T= % X;, is a complete sufficient statistic for the 
i=l 


parameter 6 in a random sample X,, X>, ....X, drawn from the population with 
p.af. 


(a) f(x, 9) = 6* (1 -6)!-4;x=0, 1 

=0(0, elsewhere 

fe? OF fe 1 2-0, 15 25-485 
©) fix, 8) -{ 0 , elsewhere 


62. If X,,X>2, ...,X, iS a random sample from N (1, 67), show that : 


(@) T =X, is complete sufficient statistic for 1, (- 2 < 1 <e), when o? 
is known. 


(6) T= > (X;-p)?, is complete sufficient statistic for o?, (0 o% < co), 
i=} ? 


when is known. 

15-10. Methods of Estimation. So far we have been discussing the 
requisites Of a good estimator. Now we shall briefly outline some of the 
important methods for obtaining such estimators. Commonly used methods are 

(@) Method of Maximum Likelihood Estimation. 
(ii) Method of Minimum Variance. 
(iii) Method of Moments. 
(iv) Method of Least Squares. 
(v) Method of Minimum Chi-square 

(vi) Method of Inverse Probability. 

In the following sections, we shall discuss briefly the first four methods 
only. 

15-11. Method of Maximum Likelihood Estimation. From 
theoretical point of view, the most general method of estimation known is the 
method of Maximum Likélihood Estimators (M.L.E.) which was initially 
formulated by C.F. Gauss but as a general method of estimation was first 
introduced by Prof. R.A. Fisher and later on developed by him in a series of 
papers. Before introducing the’method we will first define Likelihood Function. 

Likelihood Function. Definition, Let x, x2, .... X, be a random 
sample of size n from a population with density function f(x, 6). Then the 
likelihood function of the sample values x,, x2, ...,X,, usually denoted by 
L = L(®) is their joint density function, given:by 


L = flx,, 0) fla, ©) .:. fq» ®) = qT fx;, 9). ...(15-53) 
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L gives the relative likelihood that the random variables assume a particular set - 
of values x;, X2, ..., X,. For a given sample x,, x2, ..., X,,L becomes a 
function of the variable 0, the parameter. 

The principle of maximum likelihood consists in finding an estimator 
for the unknown parameter 9 = (8), 95, ..., 8,), say, which maximises the 
likelihood function L(@) for variations ia parameter i.e., we wish- to find 


nA A A A 
§ = (8), 82,..., 84) so that 
A 
L(@)>L(®) V0E9 
A 
i.é., L( 8) = Sup L(®) V 6 eE @. 
Thus if there exists a function = ry (x}, Xz, ..., X,) Of the sample values 
which maximises L for variations in 8, then 9 is to be taken as an estimator of 
0. 0 is usually called Maximum Likelihood Estimator (M.L.E.). Thus @ is the 
solution, if any, of 
a 6 and ot <0 (15-54) 
Since L >0, and log L is a non-decreasing function of L ; L and log L 
A 
attain their extreme values (maxima or minima) ‘at the same value of 6. The 
first of the two equations in (15-54) can be rewritten as 
1 OL dlogL _ 
L‘#. =0 => 6 = 0), ...(15-54a) 
a form which is mucH more convenient from practical point of view. 
A 
If 8 is vector valued parameter, then 6 = ( 61, Oo, bie 8»), is given by the 
solution of simultaneous equations : 
O. ce ee 
30, log L = 36, =~ log L (0;, 92, .... 9, =0; i =1,2,...,k 
.--(15-54b) 
Equations (15-54a) and (15-546) are usually referred to as the Likelihood 
Equations for estimating the parameters. 


A 
Remark. For the solution 6 of the likelihood equations, we have to see 
that the second derivative of L w.r. to 9 is negative. If 6 is vector valued, then 
for L to be maximum, the matrix of derivatives 


00; 00 


15-11-1. Properties of Saciaea Likelihood Estimators. 
We make the following assumptions, known as the Regularity Conditions : 


2 
(t) The first and second order derivatives, viz. aia and ope exist 


and are continuous functions of 9 in a range R (including the true value 9p of the 
parameter) for almost all x. For every 6 in R 


92 
(55.35), - : should be negative definite. 


< F(x) 


Q re 
3 log L| < F,(x) and Ee log L 
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where F(x) and F.(x) are integrable functions over (— °°, 00), ~ | 


(ii) The third order derivative x log L exists such that 


x . log L| <M(x) 


where E(M(Q)] < K, a positive quantity. 
(iii) For every 9 in R, 


(Bs) (Bom) 


| =1(8), 

is finite and non-zero. 

(iv) The, range of integration is independent of 6. But if the range of 
integration depends on 9, then f(x, 6) vanishes at the extremes depending on 0, 

This assumption is to make the differentiation under the integral sign valid. 

Under the above assumptions M.L.E. possesses a number of important 
properties, which will be stated in the form of theorems. 

Theorem 15-11. (Cramer-Rao Theorem). “With probability approaching 


unity as ni — 0, the likelihood equation 30 log L = 0, has a solution which 


converges in probability to the true value @,”. In other words M.L.E.'s are 
consistent. 

Remark. MLE’s are always consistent estimators but need not be 
unbiased. For example in sampling from N (1, 62) population, [c.f. Example 
15-31], 

MLE(p) = x (sample mean), which is both unbiased and consistent 
estime tor of pL. 

MLE(o7?) = s? (sample variance), which is consistent but not unbiased 
estimator of 62. 

Theorem 15-12. (Hazoor Bazar’s Theorem). Any consistent solution of 
the likelihood equation provides a maximum of the likelihood with probability 
tending to unity as the sample size (n) tends to infinity. 

Theorem 15-13. (Asymptotic Normality of MULE’s). A 
consistent solution of the likelihood equation is asymptotically normally 


A 

distributed about the true value 0. Thus, 9 is asymptotically N (0 a as 

Nn —> 00, 
Remark. Variance of M.LE. is given by 


V(6) = i es ees 
1) ( x 
E — 392 log L 


Theorem 15-14, If M.L.E. exists, it is the most efficient in the class of 
such estimators. 


»+6(15-55) 
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Theorem 15-15. If a sufficient estimator exists, it is a function of the 
Maximum Likelihood Estimator. 

Proof. If ¢ = ¢(x), x2, ..., X,) is a sufficient estimator of 6, then Likelihood 
Function can be written as (c.f. Theorem 15-7) 

L = g(t, 9) hy, x2, X3, ..., X,! a 
where g(t, 9) is the density function of t and h(x, X2, .:., X, | 4) 1s the density 
serie of the sample, given t, and is independent of 6. 
log L = log g(t, 6) + log h(x, Xo, ..., Xn! 8) 
Differentiating w.r.t. 6, we get 


o 408 b =F tog ott ©) = w(t, 9), (say), (15-56) 


which is a function of ¢ and 6 only. 
M.L.E. is given by 


otoRL 0 = wi,e)=0 


A 
6 =n (¢t) = Some function of sufficient statistic. 
A 
t 


=> = w(8) = Some function of M.L.E. 
Hence the theorem. 


Remark. This theorem is quite helpful in finding if a sufficient estimator 
exists or not. 


If 2 log L can be expressed in the form (15-56), i.e., as a function of a 
statistic and parameter alone, then the statistic is regarded as a sufficient 
estimator of the parameter. If 2 log L cannot be expressed in the form (15-56), 


no sufficient estimator exists in that case. 


Theorem 15-16. [f for a given population with p.df. f(x, 6), an MVB 
estimator T exists for 9, then the likelihood equation will have a solution equal 
to the estimator T. 


Proof. Since T is an MVB ae of 6, we have [c.f. (15-40)], 
2 log L = a = (T - 6) A(@) 
MLE for 6 is the solution of the likelihood equation 
2 logL=0 => 9 =T 
as required. 
- Theorem’ 15-17. (Invariance Property of MULE). If T is the MLE 
of 8 and y() is one to one function of 8, then y(T) is the MLE of w(@). 
Example 15-31. In random sampling from normal population N(,1; 0°), 
find the maximum likelihood estimators for 
(i) ppwhen o is known, 
(ii) o when pis known, and 
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(iii) the simultaneous estimation of p and o7. 
[Madras Univ. B.Sc. Sept., 1987) 
Solution. X ~ N (ut, 07) then 


= - Ee coe ‘ 2oH (i - )}] 
: (Hz) a , E Gi-w) ‘/20°} 


ky A 21 + w_m 
log i = 7 log (21) 7 10g 6 203,» (x; - H) 


Case (i). When o? is known, the ait equation for estimating {1 is 


1 
om logL=0 => - 202 , 2 z 2(x; — §1)(-1) = 0 


or xX @%-p=+0 » E x;— m= 0 
i=] is 


= => z ay a) 


Hence M.L.E. for 1 is the sample mean x. 
Case (ii). When 1 is known, the — ae for estimating 0? is 


0 1 
Agi log L = 0 => -3* 3+ agi - (x; -p)?2=0 
aA 1 3 
=> n-3 Y (@-p=0, ie, =- F @-nP  ...*) 
Ciel Niwl 


Case (iii). The likelihood equations for simultaneous estimation of 11 and o? 


7 log L =0 and 2 log L = 0, thus giving 
fl =% [From (*)) 
2, - uP [From (**)} 


a 
=7 2 (x; - X )? = s?, the sample variance. 


Important Note. It may be pointed out here that though 
A es 
E(t) = ECz) = "| 


A (cf. § 12-12) 
E(o’) = E(s?) # o2 
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Hence the maximum likelihood estimators (M.L.Es.) need not necessarily 


be unbiased. 
Remark. Since M.L.E. ts the most efficient, we conclude that in 


sampling from a normal population, the sample mean x is the most efficient 
estimator of the population mean UL. 


Example 15-32. Prove that the maximum likelihood estimate of the 
Pane a of a population Ps density function : 
4, (a-x),0<x<a 
for a sample of unit size is 2x, x being the sample value. Show also that the 
estimate is biased. (Burdwan Univ. B.Sc.. (Maths. Hons.), 1991) 
Solution. For a random sample of unit size (n = 1), the likelihood 
function is : 


L (a) =f (x, o)=4(a-2):0<x<a 


Likelihood equation oe ; 
£ og =o = Ag, Log 2-2 log a + log (a —x)] =0 
1 
=> — + 


Pk => 2a-x)-a=0 =» a=2x 


Hence MLE of a is given by O. = 2x. 
E(G) = E(2X) =2 J x. f(x, a) dx 


4 4. jar x3 
= a2 |, xa- xydx=o “27 3 


Since E(Q) # a, 0. = 2x is not an unbiased estimate of a. 
Example 15°33. (a) Find the ‘maximum likelihood estimate for the 


parameter A of a Poisson distribution on the basis of a sample of size n. Also 
find its variance. 


(b) Show that the sample mean x, is sufficient for estimating the parameter 
A of the Poisson distribution. 
Solution. THe probability function of the Poisson distribution with 
parameter A is given by 
7% e~*- 
P(X,= x) =f(x%, A) = x 


Likelihood function of random sample x,, x2, ...,X, Of'n observations from 
this population is 


42205422... 


ent, 
“Xi! x01. oe tp 


L= TL fA) = 
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log L =—nh+( 2h x) logh- 2X log (x; !) 


=-nh+nxlogh- log @; !) 


The likelihood equation for estimating dis 
Oop _ NX _ _= 
9 108 £ =0 => —-n+3-=0 => Nex 


Thus the M.L.E. for % is the sample mean X. 
The variance of the estimate is given by 


= E|- = (tog L)| (cf. (15-55) 
V(A) 
0 x x a 
el-dCee Ble CH) JeBew-h 
vA) =A/n 


(b) For the Poisson distribution with parameter A, we have 
o log L =—n+ y 


=n ( a ~ 1 = w(x,A), a function of x and A only. 


Hence (c.f. Remark Theorem 15-15), x is sufficient for estimating x. 
Example 15°34. Let x,, X2, ..., X, denote random sample of size n from 
a uniform population with p.df. 


fix, 9) =1;90- pSxS041,-0<0 <0 
Obtain M.L.E. for 9. [Delhi Univ. M.C.A., 1987] 
Solution. Here 
L = LQ; x, Xp, «3-5 Xq) = 1,0 - 3S 4; 50 +4 


= 0, elsewhere 
If X(1), X(2)> 2++» Xn) 1S the ordered sample then 


1 l 
0-5 SX) SX) S mee EX) <9 + 2 
Thus L attains the maximum if 
1 1 
0-58 xa) A Xin) $9 +5 
Lu ae oe 
=> 0 Sxa1) +5 A Xn) — 5 39 


Hence every statistic t = t(x,,X2, ...,X,) Such that 


Statistical Inference (Theory of Estimation) 16-59 


1 1 
X(n) — 5 St (X41, Xo, ..., Xa) SX) + > 


provides an MLE. for i 
Remark. This example illustrates that M.L.E. for a parameter need not be 


unique. 
Example 15-35. Find the M.L.E. of the parameters a and A, (A being 


large), of the distribution : 


S@: a, 0) = 55 { * eae sth 0sx<o,A>0 


You may use that Te large values of A, 
| | 
WA) = ey] log T{A) = log a- aA 


ad Wiest 
[Dethi Univ. B.Sc. (Stat. Hons.), 1985) 
Solution. Let x,, x2, ..., X, be a random sample of size n from the given 
population. Then : 


A ies 1 (4h Ms : 
L= au f(x, a, 0) = (Fas) ( . ie oi 5 a al, au (x) 


log L = ~n log PQ) + nA(log 4 — log a) -* , xi+(A- 1), z log x; 


Qn 
If G is the geometric mean of x,, x2, ..., X,, then 


log G =4 2 logx; = nlogG= 2 log x; 
inl inl 


log L =—n log I'(A) + nA (log A — log 0) Anz +a- 1). n log G 


where G is independent of A and a. 
The likelihood equations for the simultaneous estimation 1 of and dare: 


OL 2 oO _ 
FOr log L=0...(1) and yy log L =0 .. (2) 

(1) gives 
~ th, A nF =0 => —-l+ +25 0o-= ans Ae Ga | 


(2) gives (for large values of A), 


—n Cr r - 3x) +m| 1. Clog XW -log a) +A. 4 ~~ 4 nlogG=0 


1 x). 
=> a t(1- log a + log G -2 = 0 


=> 1 + 2A (log G -logx) =0 . [From (*)] 
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x : - 1 
: 1-r06 (Z)=0, te, £ -——1__ 
2 log (-x/G ) 


Hence the M.L.E.s for o and 1 are given by 
1 
2 log (x/G)- 
Example 15:36. In sampling from a power Series distribution with p.df. 
f(x, 8) = a,8*/w(0) ; x =0,1,2,... 
where a, may be zero for some x, show that MLE of @ is a root of the equation 
(C 
x - OVO _ 9), wa 
a = HO) (*) 


[Dethi Univ. B.Se. (Stat. Hons.), 1989} 
Solution. ‘Likelihood function is given by : 


: 7 | oxi =F ey x] @2%i 
= II fe = al v@ Lt ler 


A A 
a=x and A= 


where (8) = E(X). 


=> logL = log a, + log @. x xj—n log y(6) 
é= 1 jel 


Likelihood equation for estimating 9 gives : 


a _,7_ 2k ny’ 
a? @~ yi) 
> X=— a =H 90D . 100), (say). ae) 
Hence MLE of 0 is'a root of equation (*). 
We have : 
Q* 
E = 5 5 | ie } a 
(X) = = 5 xf ) ©) (**) 


Or @ = % = Se! i ee” ey) 


Differentiating w.r. to 8, we get 
2 [a,.x6*-1] =w’(6) 


x O*|  6.y%(0) 
= 5] 4, | =A ="v®) 
= E(X) =(0) =X, (From (**) and (*)] 
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x, be arandom sample from the 


Example 15-37. (a) Let x;, X», . 


uniform distribution with p.df. 
f(x, 9) = gr O<x<o, @>0 


= ; , elsewhere 
Obtain the maximum likelihood estimator for 90. 
[Lucknow Univ. B.Sc., 1992] 


(b) Obtain the M_L.Es. for a. and B for the rectangular population 
a<x< 
F(x: a, B) = {3 - a B 
0, elsewhere 
[Dethi Univ. B.Sc. (Stat. Hons.), 1989; Gujarat Univ. B.Sc. 1992] 


Solution. (a) Here 
Le IT fx = 9-9 ooo 5=( | spel y 


Likelihood equation, viz., g log L =0, gives 
a = Sn 
a9 (7 log 6) = 0 > =0 > 


obviously an absurd result. 
In this case we locate M.L.E. as follows 
We have to choose 9'so that L in (*) is maximum. Now L is maximum if 


6 is Minimum. 
» Xn) be the ordered sample of n independent observations 


Let X(1)2 X(2)2 +++ 
from the given population so that 
aS Xin) <8 => O02 X(n) 


OS x1) SX) S 
Since the minimum value of 8 consistent with the sample is x(q), _ the 


largest sample observation, Q= = X(p)- 
M.L.E. for 6 = x(,) = The largest sample observation 


t -(p4) 


log L =-—n log (B- 
The likelihood equations for a and B give 
0 n 
aa 08 = Oe 
—n 
loglL=0= 
and 3p 108 B-a 
Each of these equations gives B —- a =, an obviously negative result. So 
we find M.L.Es for a and B by some other means. 


(b) Here 
(**) 
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Now L in (**) is maximum if (6 — @) is minimum, i.e., if B takes the 
minimum possible value and a takes the maximum possible value. 

As in part (a), if xq), X(2), ---» X(q) iS an ordered random sample from this 
population, then as X(1) <s X(2) su. Xn) Ss B. Thus B 2 Xin) and a < X(1) 
Hence the minimum possible value of B consistent with the sample is x;,) and 
the maximum possible value of @ consistent with the sample is xj). Hence L is 
maximum if B = x,,) and & = xq). 

M.L.E. for o and 6 are given by 
A 


=X) = The smallest sample observation 


A 
and B = X,) = The largest sample observation. 
Example 15-38. State as precisely as possible the properties of the 
M.L.E.Obtain the M.L.Es. of a@ and B for a random sample from the 
exponential population 


f(x; a, B) = ype F &-%, a <x <0, 8B >0 
Yo being a constant. 


Solution. Here first of all we shall determine the constant yo from the 
consideration that the total area under a probability curve is unity. 


Yo i exp [- B(x- @)] dx=1 


e ~B- a) 2 
-B a 
f(x; a, B) =Be ®@-9 as x<0 


If x1, X2, ..., X, IS a random sample of n observations from this population, 
then 


or Yo 


=l = -ZO-D=1 => yo=B 


Le fi eacap=Peol 9 $a o}-mess 


ae log L =n log B — nB( x —-a) sc) 
The likelihood equations for estimating @ and B give 
d x 
aa er 0 = nB wsa("*) 
and = log L=0 =" _n(X¥-a wel 
$5 18 Bn - 0) Go 


Equation (**) gives B = 0, which is obviously inadmissible and this on 
substitution in (***) gives @ = 9, ‘a nugatory result. Thus the likelihood 
equations fail to give us valid estimates of « and B and we try to locate M.L.Es. 
for « and B by maximising L directly, 

Lismaximum = log L is maximum. 


From (*), log L is maximum (for any value of B), if (x - a) is minimum, 
which is so if @ is maximum. 
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If X(1), X(2). «++» X(ny iS Ordered sample from this population then 
OQ SX) SX2y S «ee SX—Qy < O, 
so that the maximum value of consistent with the sample is x1), the smallest 
sample observation, i.e., 


A 
Q= Xi) 
Consequently, (***) gives 
} ee De S | 
Ruz*e- ARX -Xq) => B= 
B x- X(1) 
Hence M.L.Es. for a and B are given by 
A A 
=X) and B=- ! 
> ae X(1) 


Remarks 1. Whenever the given probability function involves a constant 
and the range of the variable is dependent on the parameter(s) to be estimated, 
first of all we should determine the constant by. taking the total pro)ability as 
unity and then proceed with the estimation part. 

2. From the last two examples, it is obvious that whénever the range of 
the variabie involves the parameter(s) to be estimated, the likclihood equations 
fail to give us valid estimates and in this case M.L.Es are obtained by adopting 
some other approach of maximising L or log L directly. 


Example 5°39. Obtain the maximum likelihood estimate of @ in 
fix, 0) =(1+0)x%0<x<1, 


based on an independent sample of size n. Examine whether this estimate ts 
sufficient for 0. 


Solution. 
An a 6 
L (0) = II fx, a= +o. 2) 
= logL = nlog(1+6)+06. DX logx; 
i=l 


oO ee Coe = 
a9 log L oe a gt 2 log x; =0 
=> n+6 2% log x; + % log x; =0 
A 
§ =———--_ 1 = ————- 1 a Gs) 


{a +0)". (1, ns) 7 (1, 2) 


Also L(x, 0) 
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an 
Hence by Factorisation theorem, 7 = ( IT x) is a sufficient statistic for 
i=l 


A a 
6, and 6 being a.one to one function of sufficient statistic (1 x) » Is also 
inl 


sufficient for 9. 


. Example 15°40. (a)Obtain the most general form of distribution 
differentiable in 8, for which the sample mean is the M.L.E. 
(Dethi Univ. B.Sc. (Stat. Hons.), 1983) 


(b) Show that the most general continuous distribution for which the 
M.L.E. of a parameter @ is the geometric mean of the sample is 
ov 
6. — 
- 08 
fix, )= ( 7 } exp [y(8) + &(x)]. 
where y( 8) and &(x) are arbitrary functions of 8 and x respectively, 


Solution. (a) We have L =H f;, 9) 


= logh= & log fix; 9) = logs, Cf=flx, 8) 
jz 
the summation extending to all the values of x = (x), x2, ..., X,) in the sample, 
The likelihood equation is 
ch a io. oes z 
9 OBL =0, ie., 99 (2 log sf) =0 


=> 5S logs =0 => EE. Ea0 seal?) 


W are given that the solution of (*) is 
@=2 5% of nO= Sx 


ai 2-8) =0 --(*) 
Since this is true for all values of x and 9, we. get from (*) and (**), 

1 of_ 

tf 50 = = A(x - 6), 


where A is independent of x.but may be function of 8. Let us take 
A= ey where _ = y(@) is any arbitrary function of 0. 
Thus g 56 08S = S62 oy (x - 6) 
Integrating wr, to 6 naa we get 
log f = (x - 8). oy - [ev dO +E(x) +k 


where E(x) is an arbitrary function of x and & is arbicrary constant. 
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log f = (x - 8) £4 ye) + E(x) +k 
Hence f = Const. exp G = 8) oy, w(6) + 50) 


which is the probability function of the required distribution. 
Remark. In particular, if we take. 


w(6) -£ and E(x) = - 2, then 


2 2 
f = Const. exp| -6).0+ _ a 
= Const. exp [-3 (x? + 6? - 26x)] 
= Const. exp {-5( —6)?} 


which is the probability function of the normal distribution with mean 6 and 
unit variance. 
(6) Here the solution of the likelihood equation 


oO =¥ lop f= 

5 BL =X 55 logf=0 ..-(*) 
is- @ =(X1,Xo, ....X,)™ 
a log © =7, D log x => 2 (log x - log 0) = 0 su(**) 


Since this is true for all x and all 9, we get from (*) and (**) 
2 log f = (log x — log 6) A(®) 


where A (6) is an arbitrary function of 8 and is independent of x. 
Integrating w.r. to 9 (partially), we get 


log f = log x,J A(0) d0 ~ | A(6) log 6d0 + E(x) 
where €(x) is an arbitrary function of x alone. 
If we take | A(@) d0 = A,(6), then 


log f = log x . A,(8) - | Ax00) log 6 - | A,(@) . 1 40] + E(x 


A) 19 4 EG) 


= A,(0) log (x/0) + f 2 
Let us take 
») = 9 ow 
A,(6) =8 0 ° (suggested by the answer) 
where y = (6) is an arbitrary function of 6 alone. 


0 0 
log f = 0S log (x/0) + |[ ¥d0+8w) 
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= 0 2% tog (2/0) + y(0) + E(x) 


vga 
= log eS | + (0) + E(x) 


9 
Hence f =f, 9) = & exp [y(0) + E(x]. 


Example 15-41, A sampie of size n is drawn from each of the four 
normal populations which have the same variance o*. The means of the four 
populations area +b+c a+b-c,a-b+c anda-b-—c. What are the 
M.L.Es. for a, b, c, and c? 

Solution. Let the sample observations be denoted by x;,,i= 1, 2, 3, 4: 
j=1,2,...,. Since the four samples, from the four normal populations are 
independent the likelihood function L of all the sample observations x;,, 
(i = 1, 2, 3, 4; = 1,2, ..., n), is given by 


5 1 4a 1 5 5 2 
af a exp {- 5,2, ,F,av-wol 
where pt;, (i = 1, 2, 3, 4) is mean of the ith population. 


1 4n 1 5 > 
= Pe —_ —l——s a + o —_. 
( a ) xP | — gu |RCey — Ha)? +E Cea - Ha) 
+ U(X3j — 13)? + 2(%4j- ut] 


. log L =k —2n log o? 
1 ; 
—-—s5/ ¥(x,;-a —-b-c)* + E(xo;-a-b +c) 
58 | 2 ) a 2j ) 


+ Y(x%3;-a +b -c)* + Saya +b +0) 
J J 


where k is a constant w.r. to a, b, c and 07. 


The M.L.Es. for a, 6, c and o? are the solutions of the simultaneous 
equations (maximum likelihood equations for estimating a, b, c and o%): 


o 0 

ay, log L =O (1) ap 08 L = = 0 .. (2) 
9 oot =0 (3)  SelogL =0 (4) 
ac gs vee daz g Ce me 


(1) gives 
~sa| Rey ~ a= b—~c)(-2) + E(ta;- a ~ b + c)(-2) 
+ Y(x3,-a + b-c)(-2) + Lxqj-a +b + en(-2)| = 0 
i; 3 j 
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=> (x1; + X2j + Xs; + X4j) 
J 


+n [((-a—b—c)+(-—a—-b+.c)+(-at+ b-c)+(-a+b+c)]=0 


=> Dy ( D3 sy) n(-4a) =0 
ye 
A 1 4 nr = 
oan rj) mh 
Now (2) gives 


363 [Bxy-a-b-c) (-2) + Dry -a- b +c) (2) 
+ ery-a+b-c) (2) + Lexy + a+ b + c)(2)] =0 
J 
=> 2 Xj +L X7, - 2 Xj — 2 Xai 


+n[(-a-b-—c)+(-a-b+0c)-(-a+b-—c)-Catbt+cl= 
= LY xj + DL x25 — L x3; -— L x4; - 4nb = 0 


b=tl4r2,+b5 Lys 
=41 71 X1j gk X25 - Sta; A 4j 


A 
= b=(x, +X7-X3 -X,4)/4, 


where x, is the mean of the ith sample. 


Similarly (3) will give 
C= 4 (Fi —¥p +¥y-F/4 
Equation (4) gives 
2n 


Bh] Bay- 2-0 Set 2a Raa ere) 


+ X(x3;,-a+ b-c)? 4+ E(y-a+b +0)? |= 
j j 
A | A A A A A A 
Oa | Laj-a-b-c)+ Liy-a-b +c)? 
ALi j 


A A A A A A 
+ X(x3, - a + b = c)2 + LX (x4aj -— +b+ c)? 
dj. J 


Example 15-42, The fellowing table gives probabilities and observed 
frequencies in four classes AB Ab, aB and ab in a genetical experiment. Estimate 


the parameter 6 by the method of maximum likelihood and find its standard 
error, 
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Class Probability Observed frequency 
AB 112 + 0) 108 
4 
Ab 7 (1-8) 27 
aB i (1-6) 30 
ab 60 8 


Solution. Using multinomial probability law, we have 


= wt ! 1 = 
LaLO)= i nat na! We ( Pi P27 Ps Pa", Lp; = 1, Lnj=n 


=> logL=C +n, log p, + nz log pz + ny log ps + 14 log pa, 


where C = log a eee , IS a Constant. 
ny ing!naing! 


log L=C +n, log (23 8 } n, log ( r ae log (452). ng log & 


Likelihood equation gives : 
Plogh _m _m _ _M Mig 


0 =~ 2+0 1-60 1-0°0 +(*) 
my (ig +g) ng 
ae 2+0~ 1-0 *07° 
Taking n, = 108, n. = 27, ~~ and n, = 8, we ef 
_@7 + 30), =0 
7 + ae 1-6 
=> 1086 (1 dete aae ameicas 
=> 173 6? + 140 —-16=0 
mn a 
= 9 = 14+ A28 + OTE = - 0-34 and 0-26 
But 9, being the probability cannot be negative. Hence M.L-E. -of 6 is 
given by 6 = 0-26 ar Aged 


Differentiating (*) again partially w.r. to 8, we get 


0? log L S| _ ir +73) M4 
oe" (2+6)2 (1-6)? © e 
= (= log ). E(my)_, Elnz) + E(ns) | E(u) 
oe ~ (2 + 6) (1 - 9) = 
mpi __ , W(p2 + Ps) MPs 
“(2 +6)? (1-6)2 


_ n(2 + 8) ,_aua-98) , 
“4(2 +0)" 201 - 9)? * 


I he oh ale = En; = 173. 
7 4(2+ 6) 21-6) 2° 
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l l 1 
=2'2 E x 2-26 *2x0-74* 4x 0726 
= 173 [0-11 + 0-67 + 0-96].= 173 x 1-74 = 301-02 
A 1 
S.E.( 6) = V 1/1(8) = = 00576 
V¥ 301-02 . 
[cf. (15-55) Theorem 15-13)] 
15°12. Method of Minimum Variance. [Minimum Variance 


Unbiased Estimates (M.V.U.E.)]. In this section we shall look for estimates 
which (é) are unbiased and (i) have minimum variance. 


rn 
IfL= IT f\%j, 9), is the likelihood function of a random sample of n 
is 


observations x), X2, ...,X, from a population with probability function f(x, 9), 
then the problem is to find a statistic ¢ = f (x,, x2, .... X,), such that 


E()= if tLdx=¥(8) => if [¢-y(8)} L dx =0 .. (15-57) 


and V(t) = if [*t-E())? L dx = { [t —y(8)}* L dx .. (15-58) 


— co t= 00 


is minimum, where 


( dx represents the n-fold integration 


J-° 
if if eee i" dx, dx, eee dx, 


In other wotds, we have to minimise (15-58) subject to the condition 
(15-57). 

For detailed discussion of this method see MVU Estimators (§ 15-5-2) and 
Cramer-Rao Inequality (§ 15-7). 

15-13. Method of Moments. This method was discovered and studied 
in detail by Karl Pearson. 

Let f(x; 91, 92, ..., 0,) be the density function of the parent population 
with k parameters 61, 02, ..., 0. If ys’, denotes the rth moment about origin, 


then 
[tp = | x f(x ;'6;, 02, wee 6,) dx, (r = l, 2, ‘cea k) .e(15-59) 


In general }1,’, [1o’,...> |,’ will be functions of the parameters 6), 65, ..., 
6. 

Let x;,i = 1, 2,..., 2 be a random sample of size n from the given 
population. The method of moments. consists in solving the k-equations (15-59) 
for 6,, 05, ..., 0, in terms Of }1;’, [o’, .-.. Wy and then replacing these moments 
U,’;r= 1,2, ...,& by the sample moments. 
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A gilt 3. ANS 3 A, 
€.8., 6;= 6; ( Ly, pla, oo, ply ) 
= 6; (m,’, my, rer m, ) . i= 1, 2, disiey k 
where m, is the ith moment about origin in the sample. 


Then by the method of moments 8, Q>, eee 0, are the required estimators 
of 01, 8, ..., 0, respectively. 
Remarks. 1. Let (x,, x2, .... X,) be a random sample of size n from a 
population with p.d.f. f(x, 0). Then X;, (i = 1, 2, ..., m) are iid. => X/, 
(i= 1,2,,..,) are ii.d r.v’s. Hence if E (X/)-exists, then by W.L.L.N., we get 


R P 
1S xf — EH) 
a p 


=> m, —— pL,’ .. (15-60) 

Hence the sample moments are consistent estimators of the corresponding 
population moments. 

2. It has been shown that under quite general conditions, the estimates 
obtained by the method of moments are asymptotically normal but not, in 
general, efficient. 

3. Generally the method of moments yields less efficient estimators than 
those obtained from the principle of maximum likelihood. The estimators 
obtained by the method of moments are identical with those given by the 
method of maximum likelihood if the probability mass function or probability 
density function is of the form 


Sx, 6) = exp [bo + b;x + bax? + ...] .-(15-61) 
where b’s are independent of x but may depend on 0 = (6, 65,... ). 
(15-61) implies that 


L (x1, X2, ..., X_ 3 8) = exp [nby + bd 1; + bod x7 + ...] 
= Slog L=ag+Q; DX +02 Lx; + a3 Lx7+.,.] — ...(15-61a) 
j 


Thus both the methods yicld identical estimators if MLE's are obtained as 
linear functions of the moments. 
Example 15-43. Estimate a and B in the case of Pearson's Type IIl 
distribution by the method of moments. 
f(x; a, B) = en e-F O<x<co 


[Delhi Univ. B.Sc. (Stat. Hons.), 1987, 1988) 
Solution. We have 
af -1,-feq,—- B& Matr) Marr) 
WY =Ta@ Jo 7* oF Te parr = F(a) Bp 
»_T(a+2)_ (a+ l)a 
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a " 
=> pB=-5 , 
M2’ — py’? ’ Hy’ Me’ — Hy? 
Hence Q@&§ =—>—— zy and B=——7z 
mz —-my, ma—-my 


where m,’ and mz’ are the sample moments. 
Example 15-44, For the double Poisson distribution : 
= 1 emi my” 1 emmy 
p(x) = P(X = x)=5 oz +51 -3x=0,1,2 
show that the estimates a m, and mz by the method of moments are : 
[Dethi Univ. B.Sc. (Stat. Hons.), 1993) 
Solution. We have 


nm 5 x:p(x) = : pgcaa Ld eg. 5 Pp alr 
1 = ‘ == ~ —— 
x=Q 7 x=O x! ‘ z=20 x! 
1 1 
=om + 2 M2 ™) 


(since the first and second summations are the means of Poisson distributions 
with parameters m, and mz respectively). 


x=z0 
oo Madde | m,* es en Mm * 
“| Ze 7 )+ z,2-( 7 r } 
= 5 [(m,? + my) + (mz? + m2)] (ef. § 7-3-3] 
pa’ => [Cm + mz) + (m,2 + m?)] .(**) 
= ; [2yuy’ + ma,? + (2p)’ — m1)? ] [Using (*)] 


=3 [2pey’ + my? + 4p, + my? — Army By] 
Pha” = py’ + ty? + 2s? — Quy my => my? — 2m py’ + (2p? + py’ - feo J =O 
A 2 + V¥4p)7- 402 fF + ty’ - 2. , ? ? ? 
=>m,= E = Ho) uy + Ving ~ ty”? 


Similarly on substituting for m, in terms of nt. from (*) in (**), we get 
mo? — Imai" + (2p)? + py’ — We) = 0 
Solving for m2, we get 
mo =|’ = V 2’ — pi’ — 1”? 


Example 15°45, A random variable X takes the values 0, 1, 2, with 
respective probabilities 
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8 1 6) 8 af, 8 8 ~J-a A) 
aN (1 - ww 9 (t=) and at 2 (1-3) 
where N is a known number and a, 6 are unknown parameters. If 75 independent 


observations on X yielded the values 0, 1, 2 with frequencies 27, 38, 10 


respectively, estimate @ and a by the method of moments. 
(Dethi Univ. B.Sc. (Stat. Hons.), 1988] 


Solution. 


E(X) -0./ 8,4 (1 7 : | 1, ke rat -£)| 


=1 -$ (1 = 4 .(*) 


E(X4): = | ait . (1 x S)i+2-[ay+ : — (1 -¥| 
-3y+(1 - ma" 2(1 - a) 


, 6 3 0 
=> H2'= 2 aN 2% ( -¥) soa(**) 
The sample frequency distribution is 
x 0 1 _2 
f 21 38 10 , 


mi ypoty _ 38 
M's 1 2S = 75 (38 + 20) = 5, 


as ~ b (294 4) = 8 
Equating the samplé moments to theoretical moments, we get 


a 8)_ 58 
1 -$(1 -¥)-% 


Q 58 17 
= $(1 - $)= 1 33.20 .(**4) 
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Substituting in (**), we get 
o 17_ 78 A 42 
2- oN 3% 45"75 = 8=55N 


Substituting in (***), we. get 


a 42) 17 A 34 
$(1- 3} I (3g 
15-14. Method’ of Least Squares.” The principle of least squares is 
used to fit a curve of the form: 
y =flx, do, a}, ..., ,) .. (15-62) 
where a;’s are unknown parameters, to a set of n sample observations (x;, y;); 


i=1,2,...,” from a bivariate population. It consists iN minimising the sum of 
squares of sa viz., 


E= z Ly; —flx;, ao, 41, .--, d,)]? (15-63) 


subject to variations in do, a), ..., An. 
The normal equations for estimating ao, a), ....d, are given by 


os 0; i=1,2,...,0 .o(15-64) 


Kemarks. 1. In chapter 9, we have discussed in detail the method of least 
squares for fitting linear regression (§ 9-1-1), polynomial regression (§ 9-1-3) 
and the exponential family of curves reducible to linear regression (§ 9-3). In 
chapter 10 § 10-12-1, we have discussed the method of fitting multiple linear 
regression. 

2. If we are estimating f(x, do, a), .... G,) aS a linear function of the 
parameters do, d;,...,4,, the x’s being known given values, the least square 
estimators obtained as linear functions of the y’s will be MVU. estimators. 


EXERCISE 15(b) 


1. (a) State and explain the principle of maximum likelihood for 
estimation of population parameter. 

(b) © Describe the M.L. method of estimi.ion and discuss five of its 
optimal properties. 

(ii) Examine a situation when M.L. method fails and explain how you 
tackle such situations. 

(c) Define the likelihood function for a random: sample drawn from () a 
discrete population, (32) a continuous population. 

Find the likelihood function for a random sample of size n from each of the 
following populations : 

(a) Normal (7m, 67), (b) Binomial (n, p), (c) Poisson (j1), (d) Uniform on 
(a, b). [Calcutta Univ. B.Sc. (Maths. Hons.), 1991) 


* Ror detailed discussion see Chapter 9. 
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2. (a) A random variable X takes the values 0 and 1 with respective 
probabilities p and 1 — p. Obtain on the basis of random sample of size n, the 
maximum likelihood estimator of p. 

(b) Obtain the maximum likelihood estimator for the distribution having 
the probability mass function : 

f(x, 6) = 67 (1 - 6)F* x =0,1;0<0<1 
[Calcutta Univ. B.Sc. (Maths. Hons.), 1986] 
(c) Obtain the dua likelihood estimator of 0 in the following cases : 


(i) fix, e)=4 . exp (~x/6) ;x20,6>0 


(it) flx, 6) = ae 6* (1 — 6)? *;x=0,1,2,...,2 
3. Suppose that X has a distribution N (1, 6”), that is, the p.d.f of X is 


poe eool- HE 


Using M.L. estimation, determine 2 and o?. What conclusions do you 
draw on the nature of the result so obtained ? 

4. (a) Explain the technique of the method of maximum likelihod and give 
a formula for the large sample standard error of the maximum-likelihood 
estimator. 

(b) For the distribution with p.d.f. 

S, 8) = Oe **, (x 20; 6 > 0), find the maximum likelihood estimators of 
6 and E(X), and obtain their large-sample standard errors. 

(c) X is a random variable such that 

P(X $x) =0, forx<0 
=1-e**, forx2>9 


Based on n independent observations on X, obtain the maximum likelihood 
estimator of E(X). 


5. (a) Let X,, X, ..., X, be a random sample from the distribution with 
probability density function : 
fix, 0)= 56% 0<x<e, 0<@<00 
Find the maximum likelihood estimator of 0. 
(Madras Univ. B.Sc. Sept., 1988] 


(b) For the distribution : 
a ee 
dF (x) = 6°T(p) 


where p is knowa, find out the maximum likelihood estimate of @ of the basis 
of a random sample of size n from the distribution. Find the variance of the 
estimate. 

6. (a) If x; @ = 1, 2, ...n) is an observed random sample from the 
distribution having p.d.f. 


AE +! xt exp(-Ax) 
fi) = T(k +1) ,x>0 


exp (—x/0) xP-! 50 <x <00,p>0,6>0 


Statistical Inference (Theory of Estimation) 16-75 


A 
where A> 0 and k is a known constant, show that the ML estimator A for 2. is 


(k + 1)/x Show that the corresponding estimator is biased but consistent and 
that its asymptotic distribution for large n is 
N (A, A/[n(& + 11). 

[Delhi Univ. B.Sc. (Stat. Hons.), 1986] 
o 

(b) Derive the MLE of the mean 7-3 

f(x) = (B (a, 2)t' x™! (1 -x),0<xela>Qd. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 

7. (a) From a sample of size n from the population of X, determine the 
maximum likelihood estimates of the parameters a and b of the probability 
density 


of the beta distribution : 


f(x) = Constant exp [~ (x — a)/b]; x 2 a, b > 0, —- 00 < a <0 
[Calcutta Univ. B.Sc. (Maths Hons.), 1991) 
(b) Let X;, Xo, .... X, be a random sample from the distribution with p.d.f. 


1 ,-G-8), y>Q,,- 0 <0, < 0, 0,>0 


Se; 0, 8.) = ‘ 
0, clsehwere 
Obtain the maximum likelihood estimators for 8, and 02. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1992) 

(c) Given a sample of n independent observations from the distribution with 
density : flx, 1, 82) = 02! exp [~ (x-6,)/02], 0; $x < 00 

Find the maximum-likelihood estimator of 6. when 6, is known and the 
maximum likelihood estimator of 6, when 8, is known and also the joint 
maximum fikelihood estimators of 6, and 62. Comment on the estimators you 
obtain. 

8. (a) A random variable X has the probability density function : 

fx) = (B +1) x8, for O<x< 1), ® > -1). 
= 0, otherwise. 

‘Based on n-independent observations on X, obtain the maximum likelihood 
estimator of B and an unbiased estimator of (B + 1)(B + 2), when B #-2. 

(6) A random variable X has a distribution with density function 

fx) =(a4+1)x*,O<x<l,a>-l) 
= 0, otherwise 
and a random sample of size 8 produces the data : 
0-2, 0-4, 0-8, 0-5, 0-7, 0-9, 0-8, 0-9. 

Find the maximum} likelihood estimate of the unknown parameter a, it 

being given that in (0-0145152 ~ — 4-2326 (in denotes natural logarithm). 


[Burdwan Univ. B.Sc. (Hons.), 1989] 
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(c) Find the MLE of 6 for a random sample of size n from the distribution : 
fix, 9) = (0 +1) x, O<xE1 


= Q, otherwise 
Show that it is also sufficient statistic for 8. 
Ans, MLE (6) =|-——*—- 1 ...(*) 
x log x; 
ie} 


a 
T = J] x,, is sufficient estimator for 6 
ial 


A 


—n : ‘ 
= 6 = E (ik * 1} being a one to one function of 


sufficient statistic, is also a sufficient statistic for 9. 

9. (a) Obtain the MLE for the parameter 6 in a random sample of size n 
from the uniform population U[0, 6]. 

Ans. 6 =X), the largest sample observation. 

(b) Show by means of an example, that MLE are not, in general unique. 

Ans. See Example 15-34. 

(c) Show that in a random sample from a distribution with p.d. f. 

flx, 0) = 0e -8 >0 : 

1/X is the MLE for © and has greater -variance than the unbiased estimator 
(mn - 1)(nX). 


Hint. MLE 6=1=2, T= 5 X; > AX= 
X T i=l 
X;, @ =1,2,...,”) areiid. y(@, 1) 
= T =3,X;~7(@,n) 
n-1 n- 1 
EA - -E[% 5+ |- (n— 1) ET) = 0. 
xX 


wg hate) ew 


10. (a) Let x,, x2, ...,X, be a random sample from a population with 
density : , 
fx, 8) = 5 exp [- | x ~ Bj |, - 20 <x <0, 


Find the estimator for 6 based on the method of maximum likelihood. 
[Madras Univ. B.Sc., 1989] 


Hint. es 5 few |- x |«;- 9 || is maximum, i x |x; ~ 9| 
Sa iol i=] 


iS MiMImien, => 9 ~ Median of (1, X3, ....Xq)- 
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(b) Obtain the maximum likelihood estimator of © based on a random 
sample of size n from the population with p.d.f. 

(i) f(x, 0) = 6 &-; 0 < x < 00, — 00 < @ < 00 

(ii) f(x, 8) = 0x81; 0<x<1,0< 6 <0, 

Examine in each case, whether 6 is unbiased. 


a 
Hint. (i) L is maximum if + (x; -6) is minimum. 
i=l 


=> Each deviation (x; -9),i=1,2,...,.2 is minimum => Q = = Xi). 

11. (a) Explain what is meant by an estimate of a population parameter. 
Find the maximum Sikelihood estimate of the parameter © of a population 
having density function : 

2(8 — x)/62, (0 < x < 8) 
for a sample of unit size and examine whether the estimate so obtained is biased 
or not. [Calcutta Univ. B.Sc. (Maths. Hons.), 1987] 


Ans. 6 = 2x; biased. 
(6) Obtain Maximum Likelihood Estimator of 6 for the distribution : 


C, 
fot, 0) = <2 5 x= 0, 1,2, 0.50 >0, 


C, is a constant. Also write the Maximum Likelihood Estimator of 
302 + 46 + 5. (Agra Univ. B.Sc., 1988) 
Hint. For MLE of 362 + 46 +°5, use Invariance Property of MLE (¢/f. 
Theorem 15-17) 
(c) A population has a density function given by : 


f(x) = = 2/2 xe ae 3-00 <X < 00 


Find the maximum likelihood estimate for v. 
[Calcutta Univ. B.Sc. (Maths. Hons.), 1988] 

12. (a) Consider a population made up of 3 different types of individuals 
occurring in the population with probabilities 62, 26 (1 — 6) and (1 —- 9)?, 
respectively where 0 < 6 < 1. Let m;, nz and nz denote the respective random 
sample sizes of the above three types of individuals. Determine the maximum 
likelihood estimator for 9. [Rajasthan PCS, 1989] 

(b) Obtain thé maximum likelihood estimate of 6, if the variable takes the 
values 1, 2, 3 and 4 with probabilities (1 — 6)/2, (1 — 6)/2, 6(1-6) ‘and 6? 
respectively and the observed frequencies are n,,.n, nz and n, respectively. 

13. In life-testing it is sometimes assumed that the life-time of an item is 
a random variable which is greater than or equal to x with probability 


#l-(4 J] 


x20, m > 0 is known ana @ > 0 is unknown. Suppose n such items are tested 
and yield X;, Xo, ..., X, as their times of “death”. 


Find the maximum likelihood estimate of 8. 
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14, X,, X2, X3, X4 are iuuependent normal random variables with means 
a+B8,a0- 8, a + 28,a— 8 respectively and a common variance o?, on the 
basis of one observation on each X; ; obtain the maximum likelihood estimators 
of a, B and o?, What is the asymptotic variance of a2 ? 
{(Bharatiyan Univ. M.Sc. (Maths), 1993) 
15. (a) For the bivariate normal distribution A (411, U2, 67, Op”, p) find the 
maximum likelihood estimators 
(é) of 6,2, o2* and 9 when 41, and py are known, 
(ii) of all five parameters of the distribution. 
(b) Describe clearly the important properties to be possessed by a good 
estimator. 
If (x;, y;), @ = 1,2, ..., 2) come from a bivariate normal population with 
zero means, unit variances and co-efficient of correlation p, obtain the maximum 
likelihood estimator of p. 


16. (a) Show that the most general continuous distribution for which the 
M.L.E. of a parameter 6 is the sample harmonic mean is : 


fix,0) = exp| {0 30 VO) = “36 8) 


where (6) and E(x) are arbitrary functions of 6 and x respectively. 

(b) Explain the principle of maximum likelihood estimation. Give 
examples to show that MLE need not be unique and also not necessarily 
unbiased. 

Show that the most general form of the distribution for which the sample 
arithmetic mean X is the MLE of 0 has the p.d.f. 

fix, 8) = exp [(x — 6) A’(8) + A(8) + B(x) | 
{Delhi Univ. B.Sc. (Stat. Hons.), 1988) 
17. (a) Suppose that distribution of X is represented by the function : 


PX=x)= =e y= 0,1, 2,. 


where A > 0. Given a random sample of size n, show that the sample mean is 
the maximum likelihood estimate of A. Show further that this estimate is 
(@) best unbiased, and (i) consistent. [Delhi Univ. M.A. (Eco.), 1986) 

(6) Consider the estimation of the Poisson parameter from a random 
sample. 

(i) Work out the maximum likelihood ¢stimator and its variance. 

(ii) Work out the Cramer - Rao Lower bound and show that it is equal to 
the variance worked out in (i). Comment on the significance of this result. 

(Delhi Univ. M.A. (Eco.), 1990] 
18. X is a discrete random variable and 
PX=r) =(1—-p)p’-'; r=1,2,3,... 

Find the MLE of p based on a random sample of n observations and its variance 
in large samples. 
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Show that the variance attains the lower bound of C.R. inequality. 


19. Explain the terms : (é) sufficient estimator, (ii) efficient estimator, 
(iit) Cramer-Rao lower bound to the variance of an estimator, (iv) maximum 
likelihood estimator; and describe the relations amongst these four concepts. 


20. (a) Describe the method of moments for estimating the parameters. 
What are the properties of the estimates obtained by this method ? 


(b) Let (X;, X2, .... X,) be a random sample from the p.d_f. 


fix, 0) =0e",0<x<—, 6 > 0: 
= 0, elsewhere 


Estimate 6 using the method of moments. 
(Madras Univ. B.Sc., 1988) 


21. X,, Xz, .... X, iS a random sample from 
| 1. 
fix; a, 6) Raa a<x<b 


= 0, elsewhere 
Find estimates of a and b by the method of moments. 
(Gujarat Univ. B.Sc. Oct., 1993] 
22. Explain the methods of estimation-method of moments and maximum 
likelihood. Do these lead to the same estimates in respect of the standard 
deviation of a normal population ? Examine the properties of the estimates from 
the point of view of consistency and unbiasedness. 


23. (a) Estimate 9 in the density function 
fa, 9) =(1 + 0) x9; 0<x<1 

by the method of moments and obtain the standard error of the estimator. 

(b) The sample values from population with p.d.f. 

fx) = (1 + 0) x9, 0<x<1,6>0, 

are given below : 

0-46, 0-38, 0-61, 0-82, 0-59, 0-53, 0-72, 0-44, 0:59, 0-A0 

Find the estimate of 6 by (i) method of moments and (ii) taaximum 
likelihood estimation. 


24. (a) For the distribution with probability function : 
x= ‘1, 2; 3, eer 


obtain the estimate of ® by the method of moments. 
(6) For the following probability function : 


x — m)3-% 
I P) -( ‘ \r ae py [x = 1,.2, 3], 


obtain the estimator of p by the method of moments, if the frequencies at 
X= 1;2 and 3 are respectively 22, 20 and 18. 
-25. Let x;,X2, ..., X, be a sample from a distribution with density. 
function : 
fo(x) = 0(6 + 1) x! (1x), O<x< 1,6 >0 
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Determine the estimate of © by the method of moments, 
[Indian Civil Services, 1981) 
26. Explain the method of minimum chi-square in estimation, with a 
suitable example. [Madras Univ. B.Sc., March 1989) 
27. Describe the method of moments and discuss. when the estimates 
obtained by the method of moments are identical with those of maximum 
likelihood estimates. 
Estimate o and B by the method of moments for the distribution : 
f(x; a, B) “tps eBt O0< x <00, 
[Dethi Univ. B.Sc. (Stat. Hons.), 1987, 1983] 
28. State the conditions under which Maximum Likelihood Estimators of 
the parameters are identical with those given by the method of moments. 
Examine if the MLEs of the parameter(s) are identical with those obtainéd 
by the method of moments in random sampling from the following 
distributions : 


(fix, 0) = 5. exp-(- t); 0<x<co 


] 
(ii) f(x, p, 62) = — exp [- (x - 21)2/207] ; -0 <x <0, 
ene 


Ans. (i) MLE ( 6) =%= 6 (Method of Moments) 
(it) MLE (1) = X = {1 (Method of Moments) 


MLE (6?) = s? (sample variance) = 6? (Method of Moments). 

29. Independent samples of sizes n, and nz are taken from two normal 
populations with equal means 1 and variances respectively equal to Ao?, 0%, 
Find the maximum likelihood estimator of 1 based on (n, + nz) sample 
observations and show that.its large sample variance is 


; 
Var (1) = oz + na] 
Hence show that the unbiased estimator, ¢ = (nix, + nox )/ (ny + no) 


. (ny + Nn»)? - 5 ‘ , eae 
has efficiency, (nh + nam, + nA) which attains the value 1 if and only if 
A=1, 


Ans. MLE (it) = (zt + min) [+ na } 


OBJECTIVE TYPE QUESTIONS 
1. Comment on the following, statements : 
(i) In case of.the Poisson distribution with parameter A, x is sufficient for 


(ii) If (X,, X>, ..., X,) be a sample of independent observations from the 
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yniform distribution on (6, 6 + 1), then the maximum likelihood estimator of 6 
is unique. 

(iii) A maximum likelihood estimator is always unbiased. 

(iv) Unbiased estimator is necessarily consistent 

(v) A consistent estimator is also unbiased. 


(vi) An unbiased estimator whose variance tends to zero as sample size 
increases is Consistent. 


(vii) If'¢ is a sufficient statistic for © then f(t) is a sufficient statistic for 
@). 

A (viii) If t, and % are two independen: estimators of 9, then ¢, + f is less 
efficient then both ¢, and ¢. 

(ix) If T is consistent estimator of a parameter 6, then. aT +b is a 
consistent estimator of a8 + b, where a and b are constants. 

(x) If x is the number of successes in n independent trials with a constant 
probability p of success in each trial, then x/n is a consistent estimator of p. 

II. Fill in the blanks : 

(i) In a random sample of size n from a population with mean pL; the 
sample mean (x ) is ... estimate of ... 

(ii) The sample median is ... estimate for the mean of normal population. 


(iii) An estimator ) of a parameter 6 ts said to be unbiased if... 


(iv) Thé variance s? of a sample of size n is a ... estimator of population 
variance O°, 


(v) If a sufficient estimator exists, it is a function of the ... estimator. 
(vi) ... estimate may not be unique. 


UL, (a) Give example of a statistic ¢ which is unbiased for a parameter 6 
but ¢2 is not unbiased for 8. 


(h) Give example of an M.L. estimator which is not unbiased. 
{V. What is the relationship between a sufficient estimator and a 
max‘muin likelihood estimator ? 


V. (i) If x is an unbiased estimator for the population mean 1, state which 
of the following are unbiased estimators for 1? : 


2 
(a)x?, (b)x2- = (6? is known/unknown). 
(ii) If ¢ is the maximum likelihood estimator for 9, state the condition 


under which f(é) will be the maximum likelihood estimator for f(8). 


(tii) Write down the condition for the Cramer-Rao lower bound for the 
variance of an unbiased estimator to be attained. 


(tv) Write down the general form of the distribution admitting sufficient 
Statistic. 


VI. A random variable X takes the values 1, 2, 3 and 4, each with 
probability +. A random sample of three values of x is taken, ¥ is the mean and 
m is the median of this sample. Show that both x and m are unbiased estimators 
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of the mean of the population, but x is more efficient than m. Compare thei; 
efficiencies. 

VII. Give an example of estimates which are 

(t) Unbiased and efficient, (if) Unbiased and inefficient. 

15-15. Confidence Interval and Confidence Limits. Let x, 
(i = 1,2, ..., 2) be a random sample of n observations from a population 
involving a single unknown parameter 0 (say). Let f{x, 8) be the probability 
function of the parent distribution from. which the sample is drawn and let us 
suppose that this distribution is continuous. Let ¢ = ¢(x1, x2, ..., X,), a function 
of the sample values be an estimate of the population parameter 6, with the 
sampling distribution given by g(¢, 8). 

Having obtained the value of the statistic ¢ from a given sample, the 
problem is, ““Can we make some reasonable probability statements about the 
unknown parameter 6 in the population, from which the sample has been 
‘trawn ?” This_question is very well answered by the technique of Confidence 
Interval due to Neyman and is obtained below : 

We choose once for all some small value of a (5% or 1%) and then 
determine two constants say, c, and cz Such that 

P(e, <8<c,l)=1-a . (15-65) 

The quantities c, and cz, so determined, are known as the confidence limits 
or fiducial limits and the interval [c,, C2] within which the unknown value of 
the population parameter is expected to lie, is called the confidence interval and 
(1 — a) is called the confidence coefficient. 

Thus if we take a = 0-05 (or 0-01), we shall get 95% (or 99%) confidence 
limits. 

How to find c, and c2? Let 7, and T, be two Statistics such that 

P(T, > 8)=a, . (15-66) 
and P(T, < 8) =a, ..-(15-66a) 
where Q, and OQ, are constants independent-of 8: (15-66) and (15-66a) can be 
combined to give 
P(T, <9<T,)=1-a, . (15-665) 
where & = G1, + Q,. Statistics 7; and T> defined in (15-66) and (15-66a) may be 
taken as c; and c> defined in (15-65). 

For example, if we take a large : sample from a normal population with 

mein 1 and standard deviation o, then ~ 


2-1 _No,1) 
~ ONa 
and P(~1-96 < Z < 1-96) =0-95, 


[From Normal Probability Tables] 


235 P (1-96 < 2H < 1.96 |, = 0-95 
( = oVn 
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fwepexr+ 1.96] = 0-95 


Vn Vn 


= 2) ee 
Thus x + 1-96 are 95% confidence limits for the unknown parameter LL, 
Nn 
the population mean and the interval 


=> P|x - 1-96 


- 1-96 i ,x + 1-96 a is called the 95% confidence interval. 
Also P(-2:58 < Z < 2:58) = 0-99 
= P{ 2 <t HH. 238] 09 
on 
= P(x 2-582 <p <= +2-58-X)\=0.99 
F-28, tn) 


Hence 99% confidence limits for uw are x + 2-58 — and 99% confidence 
n 
merval for is] 3 2:58, = + 2-58 | 
in ieee ’ ; 
| Vin Vn 
Remarks 1. Usually o? is not known and its unbiased estimate S? 
pbtained from the samples, is used. However if n is small, 


Z=*— is not N (0, 1) 
SNn 
and in this case the confidence limits and confidence intervals for 1 are obtained 
by using Student’s ‘?’ distribution. 

2. It can be seen that in many cases there exist more than one set of 
confidence intervals with the same confidence coefficient. Then the problem 
arises as to which particular set is to be regarded as better than the others in 
some useful sense and in such cases ‘we look for the shortest of all the intervals. 
Example 15°45. Obtain 100 (1 - a)% confidence intervals for the 
parameters (a) @ and (b) o”, of the normal distribution 

, 2 
Kix, 9; 6) = —K exp |(-> (2=°) »- 2 <X <0 
oV2n OF 

Solution. Let X;, (= 1, 2, ..., 2) be a random sample of size n from the 
density f(x ; 8, o) and let 


i 
Nj 


] 
n—-1; 


ae n n se n a 
Kel 3x, tal & (x -H, Stet YX, ¥P 
= is = 


pee 8 
SINn 


(a) The statistic : 
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follows student’s ¢-distribution with (m — 1) degrees of freedom. Hence 
100(1 ~ «)% confidence limits for 6 are given by 
P(l«lst,J =l-a 


- S 
=> Pilx -O lst sl-a 
peer e) 
= P)K- tas 0sk+tq.-5]=1-a ---(567) 
Vn Vn 


whe.e fq is the tabulated value of ¢ for (n — 1) d.f. at significance level ‘aq’. 
Hence the required confidence interval for 0 is : 


= S ‘s S 

Lets =X +l SS 

pea ee 
(b) Case (i) © is known and equal to yp (say). 


2(X;- pM)? _ ns? 
Then o = o as 1) 
If we define x,” as the value of x? such that 
P(x? > Xa") = | , PXra? =0. af! 
Ya ' 


where p(x?) is the p.d.f. of ~?-distribution with nd.f., then the required 
confidence interval is given by 


PL X71 -(any SX? SX¥7an2)=1-a 


ns? 
=> P i -(/2) 5 gS arn | =1-o +) 
| ns? — > ns Cs 
Now o $s X a/2 => un so 
and 28 SC > => o2 << => 
X"1- (@n) co 121 - (a2) 
Hence (**) gives 
ns? ns? 
Pi—=—so7s Jsl-a PM baoced 
E= Anal om) 


where X72 and ¥71-<qyz are obtained from (*) by using n d.f. 
Thus e.g., 95% confidence interval for o? is given by 


Pleo <o< on |= 089 


X7o.005 X7o975 
Case (ii). 8 is unknown. In this case the statistic 
X(X;-X)? nse 
o ig  Xe-n 
Here also confidence interval for o? is given by (***) where now 2, is the 


significant value of 7? [as defined in (*)] for (m — 1) d.f. at the significance level 
¢ a’. . 
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Example 15-46. Show tnat the largest observations L of a sample of n 
observations from a rectangular distribution with density function : 


fx, ®) =. O<x'<0 tt) 


= 0, otherwise 


-1 
dG(L) =0(% J OSL <0 


Show that the distribution of V = L/@ is given by p.df. 
A(vy=nv™!,0svs1 
Hence deduce that the confidence limits for @ corresponding to confidence 


coefficient a are L and 


has the distribution 


on 
( l-~a iia 

[Delhi Univ.‘B.Sc. (Stat. Hons.), 1982, 1983] 
Solution. Let X,,X2...,X, be a random sample of size n from the 
population (*) and let L = max (X;, Xo, ...,X,). The distribution of L is given 
by dG(L) = n[F(L)}™". AL) dL 
where F(.) is the distribution function of X given by 


FL) = f, fix, Ode = 


1 
dG(L) -n(5 J © osise 


If we take V=L/8, the Jacobian of transformation is 6. Hence p.d.f. h(.) of 
Vis given by 
A(v) = av LT = ave, O<v<i 
which is independent of 6. 


To obtain the confidence limits for 6, with confidence coefficient a, let us 
define v,, such that 


1 
P(va<V<l=a => J h(v) dv=o. .-.(**) 


1 
=> n{ widv=a => I1-v,"*=a 


= Vq = (1 —@)!/n (RH) 
From (**) and (***), we get 
P[a -a)'!*#<V<1j=a 
=> P| - ayn < Be i]=0 
L 
=> ap <@0< (1 - ais a 


Hence the required confidence limits for 6 are L and L/{(1 - a)'”. 
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Example 15-47. Given a random sample from a population with p.d f. 
f(x, Q=4. O<xs0 


show that 100 (1 — a)% confidence interval for 0 is given by R and R/w where 
yw is given by 
y™! [n-(n-1)y] =a, 
and R is the sample range. 
Solution. The joint p.d.f. of x;, x2, ..., x, is given by 

i 
@" 

_ If xq), X), «++» X¢ay is the ordered sample then the joint p.d.f. of x4) and 
Xa) is given by 


L= .0sx;<S9 


n(n — 1) 


: n-1 a-2 
(Xa) ; X ny) = & (x, - xay] ,O<s Xa) S Xn) <6 


To obtain the distribution of the sample range R, let us make the 
transformation of variables 

R = Xn) — X(1) and VEXu => V=X—qy-RSO-R 

The Jacobian of transformation is | J | = 1 and the joint p.d.f. of R and 
V decomes 


n(R,v) =22=Dper ocvco-R 
The marginal density of R is given by 
O-R = 
h(R) = J nD) R22 dy 
_ a-2 (9 — 
= Le O-%) oeR<e 


The density of U = R/0 is 
| dR (n — 1) R*-2(9-R) 
ha(u) = hy(R) | 2% | A= Re r 


=n(n —1)u*? (1-u),0<u<1 
_ 100 (1 - @)% confidence interval for 6 is given by 


PwwsU <1) =1-a .(*) 
where y is obtained from the equation 
fin = 
v 
=> nin—1) |" u*2(1—u)du =a 


nu™! —(n —1) u* i =O 


= y*! (n-(@- Dy] =a -.(**) 
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From (*), we get 


R 
= plrsos®|a1-c 
VW 


Hence the required limits for 6 are given by R and R/y, where y is the 
solution of (**). 
Example 15-48. Given one observation from a population with p.df. 


flx, 0 =5@ _x), 05x <6, 


obtain 100 (1 -— a)% confidence interval for 8. 
[Delhi Univ. B.Sc. (Stat Hons.), 1991) 


Solution. The density of u = x/6 is given by 


g(u) = lx, ): | & |-2¢ -—x). 6 


= 2(1 -u), OSu <l 
To obtain 100 (1 — a)% confidence interval for 6, we choose two quantities 
u, and u, such that 


Plu; Susu] =1-a saa) 
and Plu<u,] =P({u>ujJ =a/2 
& 
Now Plu < uy) = => i 2(1 - u) du =F 
0 
=> uu)? — 2uy + F=0 st) 


oa a. . a. 
Similarly, P(u>u,) = 7 > J 2(1 — u) du = my 
a J 


> Un? ~ 22 + (1 - 5 =o ate) 


From (*), we get 
u,S=Su,|=1-a = P|*+se@s*|=1-c 
0 Uy uy 
Hence the required interval for 0 is : a , where u, and u> are given 
1 


by (**) and (#**), 
15-15-1. Confidence Intervals for Large Samples. It has been 
proved that under certain regularity conditions, the first derivative of the 


logarithm of the likelihood function w.r.t parameter 6 viz., 7 log L, is 
asymptotically normal with mean zero and variance given by 


4 
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0 Q ta 
Var ‘s log L)=e t- log Ly =2(- 392 /og L] 
Hence for large a, 
2 log L 
LS NOD) ...(15-68) 


A | oa 
Var (5 log L) 


The result enables us to obtain confidence interval for the parameter 6 in 
large samples. Thus for large samples, the confidence interval for 8 with 
confidence coefficient (1 — a) is obtained by converting the inequalities in 


P[IZ|<Agl=1-—a (15-69) 
where A, is given by 
== {* exp (12/2) du = 1-0 _..[15-69(0)] 
V2n J-2 


Example 15-49. Obtain 100 (1 -@)% confidence limits (for large 
samples) for the parameter A of the Poisson distribution 


Ae i ex. 


a @ 
x! 
Solution. We have 


oO _@0 a A 
5h log L “n\-™ + ( 2 x;) log A - 2, log | 


= sp E(X) => 
na % 1 
Z= a = V(nf) (X~A) ~ NO, 1) (Using (15-68)] 


Hence 100 (1 —a)% confidence interval for A is given by (for large 
samples) = P[IW(nfa) (% - a) < Ag] = 1-0 ? 
Hence the required limits for 4 are the roots of the equation : 
War (x -A)I = he 
> n(X—-A)>-A.Ag*= 0 
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2 
= A2 — -a (25+ lez 0 


GesM)e [ler Jaf 


7. east) 


For example, 95% confidence ne for A is given by taking A, = 1-96 in 
(*), thus giving 


n=5 (22 384) .9/ (88 3-84x | ug? =z21960 JE. 


to order r'?, 
Example 15-50. Show that for the distribution : 
dF (x) = 9 e-*°;0<x<0o 
central confidence limits for @ for large samples with 95% confidence coefficient 


; 1-96 
are given by 8 =;1+ —— x 
(vn )/ 
Solution. Here L =0* exp |-0 z x | 


t) 
59 log L = [n log 0-0 5 x,] 


ce \ arf. & _ 2 
var(% log L) -(- 592 [98 L)=% 
Hence, for large samples, using (15-68) we have : 


Z= ~ N(,1) = Vn (1-0%)~ NO, 1) 


Vin? 


Hence 95% confidence limits for 6 are given by 
P[-1-96 < Vn (1-0) $ 1-96] — 


Now Vn(1-9%)5196 => f1- 1:96\1l <4 (*) 
Vn Jz 
and -1-96 sVn(1-6z). => o<(i+ 128 r ~c(P*) 
Vn Jz 
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Hence, from (*) and (**), the central 95% confidence limits for 6 are given 


by 
1-96\ 1 
ee Ue er 
fea he 


EXERCISE 15 (c) 


1. Discuss the concept of interval estimation and provide suitable 
illustration. [Dethi Univ. M.A. (Eco.), 1987] 
2. Critically examine how interval estimation differs from point 
estimation. Give the 95% confidence interval for the mean of the normal 
distribution, when its variance is known. 
[Madras Univ. B.Sc. Sept., 1988] 
3. What are confidence intervals ? How are they constructed using ¢- 
distribution ? 
[Madras Univ. B.Sc., March, 1989| 
4. The random variable X is uniformly distributed in (a, a + 2). Obtain 
limits x, and x, such that 
P(X $x) = PX 2 x2) = 0-025 
The random variable is observed once, the value being x). Give a.method of 


obtaining an interval estimate for ‘a’ which you expect to be correct in 95% of 
trials. [Calcutta Univ. B.Sc. (Maths. Hons.), 1990] 


5. Obtain 100 (1 —a)% confidence interval either for the unknown 
parameter p of a binnomial distribution when the parameter n is known or for 
the population correlation coefficient when the population is Normal. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1983] 

6. Let fo(x) = 1/6, 0 <x <6 and let L be the largest observation of a 
sample of size n from the above distribution. 

Obtain the distribution of (L/6) and hence deduce that the confidence limits 


corresponding to confidence coefficient a are L, and er respectively. 


(Delhi Univ. B.Sc. (Stat. Hons.), 1992 


7, (a) What are confidence intervals ? y is the largest observation in a 
sample of size n drawn from a rectangular population in (0, 8). Find the 
confidence coefficient corresponding to the confidence interval 


{y, y/ - apt} 


where ‘a’ is the level significance. 
[Bhartiyan Univ. M.Sc. (Maths.), 1991) 


(b) Prove that the confidence interval for 8 obtained in (a) part above is 
shorter than the one. obtained in Question 9 below. 


8. Develop a general method for constructing confidence intervals. 
Consider a random sample of size n from the exponential distribution with p.d_f. 


fx, 0) =e -%-%, 89 <x<~w, ~w0 << co, 
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Show that P {Xq)-+loga <$@<X,] =1-« 


where symbols have their usual meanings. Also interpret the result. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989) 


9. Consider a random sample X;, X2, ...,X, from an U[0, 6] population. 
Show that R and Rf are the confidence limits for © with confidence coefficient 
(1 — a), where R is the sample range and € satisfies the equation : 


E~1 (n-(n- 1E} =a 
[Delhi Univ. B.Sc: (Stat. Hons.), 1993, 1985] 


10. Explain the difference between point estimation and interval 
estimation. 

Obtain 100 (1 — a)% confidence interval for the population correlation 
coefficient ‘p’ when the random sample of size n has been drawn from bivariate 
normal population. ; 

[Delhi Univ. B.Se. (Stat. Hons.), 1988] 


11. Describe the pivotal quantity method for constructing confidence 
intervals. 
Obtain a large sample 100 (1 — «)% confidence interval for the parameter 6 
in random sampling from the population : 
dF (x) =6 e-*;x>0, 6>0 
[Delhi Univ. B.Sc. (Stat. Hons.), 1990] 
12, Develop a general method for obtaining confidence intervals. Obtain a 


100(1 — «)% confidence interval for large sample size for the paraineter 8 of the 
Poisson distribution : 


-8 Ax 
eo 0 1 
x! 


[Delhi Univ. B.Sc. (Stat. Hons.), 1987] 


13. Describe the general method of constructing the confidence interval. for 
large samples. 

If X,,X>, ...,X, is a random sample from an exponential distribution with 
mean 6, obtain 95% confidence interval for 8 when z is large. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1993) 
14. (a) Show that with the exponential distribution 
dF (x) = Oe -®, x 2.0 

central confidence limits for 6 for large samples of size n and 95% confidence 
coefficient are : {1+ 1-96/V n}/ x, 


where x is the mean of the sample observations x,, X2, ...,X, drawn randomly 
from the exponential population. 


fix, ®) = 


{Indian Civil Services, 1983] 
(b) Let X,, X>, ....X, be a random sample from a distribution with density 
function : f(x, 9) = Be“, 0< x < 00 
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Find a 100(1 — a) (when 0 < @ < 1) percent confidence interval for the 
mean of this population, for large samples. 
(Madras Univ. B.Sc., 1991) 
15. (a) Discuss the problem of interval estimation. Obtain the minimum 
confidence interval for the variance for a random sample of size n from a normal 
population with unknown mean. 
[Indian Civil Services, 1991] 
(b) Give a method of determining the confidence limits for a single 
unknown parameter, stating the conditions of validity. From amongst intervals 
of Confidence Coefficient a, how will you decide one as being superior to 
another ? 
[Indian Civil Services, 1989] 
16. Consider a random sample X,,X», ...,X, from the exponential 
distribution with p.d.f. 


= , otherwise 
If p is known, obtain a confidence interval for 8. starting from the sufficient 


statistic Xip. 
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CHAPTER SIXTEEN 


Statistical Inference-iI 
( Testing of Hypothesis, Non-parametric 
Methods and Sequential Andlysis ) 


16-1. Introduction. The main problems in statistical inference can be 
broadly classified into two aréas : 

(i) The area of estimation of population parameters and setting up of 
confidence intervals for them, i.e., the area of point and interval 
estimation and 

(ii) Tests of statistical hypothesis. 


The first topic has already been discussed in Chapter 15. In this chapter we 
shall discuss: (a) The theory of testing of hypothesis initiated by J. Neyman and 
E.S. Pearson (Section 16-2), (6) Sequential analysis propounded by A. Wald 
(Section 16-4) and (c) Non-parametric tests (Section 16-3), In Neyman-Pearson 
theory, we use statistical methods to arrive at decisions in Certain situations 
where there is lack of certainty, on the basis of a sample whose size is fixed in 
advance while in Wald’s sequential theory the sample size is not fixed but is 
regarded as a random variable. Before taking up a detailed discussion of the topics 
in’(a), (0) and (cj, we shall explain below certain concepts which are of 
fundamental importance. 

16-2. Statistical Hypothesis-Simple and Composite. aA 
statistical hypothesis is some statement or assertion about a population or 
equivalently about the probability distribution characterising a population which 
we want to verify on the dasis of in,vrmation available from a sample. Vf the 
statistical hypothesis specifies the population completely then it is termed as a 
simple statistical hypothesis, otherwise it is called a composite statistical 
hypothesis. 

For example, if X,,X>, ....X, is a random sample of size 1 trom a normal 
population with mean jt and variance 2, then the hypothesis 

Ho: b= Uo, 6? = So? 
is a Simple hypothesis, whereas each of the following hypotheses Is 4 composite 
hypothesis: 

(i) L=Uo, (di) 0? = oo 

Git) Lb < po, 6? = Go? (iv) YL>Uo, 0? = do? 
(v) 2 =o, 0? < Go’, (vi) HW =o, 0? > do? 
(Vii) [L < flo, 62 > Go’. 

A hypothesis which does not specify completely ‘r’ parameters of a 

population is termed as‘a composite hypothesis with r degrees of freedom. 
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16:2:1. Test of a Statistical Hypothesis. A test of a Statistical 
hypothesis is a two-action decision problem after the experimental sample 
values have been obtained, the two-actions being the acceptance or rejection of 
the hypothesis under consideration. 


16-2:2. Null Hypothesis. In hypothesis testing, a statistician or 
decision-maker should not be motivated by prospects of profit or loss resulting 
from the acceptance or rejection of the hypothesis. He should be completely 
impartial and should have no brief for any party or company nor should he allow 
his personal views to influence the decision. Much, therefore, depends upon how 
the hypothesis is framed. For example, let us consider the ‘light-bulbs’ 
problem. Let us suppose. that the bulbs manufactured under some standard 
manufacturing process have an average life of 2 hours and it is proposed to test 
anew procedure for manufacturing light bulbs. Thus, we have two populations 
of bulbs, those manufactured by standard process and those manufactured by the 
new process. In this problem the following three hypotheses may be set up : 

(i) New process is better than standard process. 

(ii) New process is inferior to standard process. 

(iit) There is no difference between the two processes. 

The first two statements appear to be biased since they reflect a preferential 
attitude to one or the other of the two -.processes. ‘Hence the best course is to 
adopt the hypothesis of no difference, as Stated in (iii). This suggests that the 
statistician should take up the neutral or null attitude regarding the outcome of 
the test. His attitude should be on the null or zero line in which the 
experimental data has the due importance and complete.say in the matter. This 
neutral or non-committal attitude of the statistician or decision-maker before the 
sample observations are taken is the keynote of the null hypothesis. 


Thus, in the above example of light bulbs if p19 is the mean life (in hours) 


of the bulbs manufactured by the new process then the null hypothesis which is 
usually denoted by Ho, can be stated as follows : 


. Ho: = Uo, 

As another example let us suppose that two different concerns manufacture 
drugs for inducing sleep, drug A manufactured by first concern and drug B 
manufactured by second concer. Each company claims that its drug is superior 
to that of the other and it is desired to test which is a superior drug A or B ? To 
formulate the statistical hypothesis let X be a random variable which denotes the 
additional hours of sleep gained by an individual when drug A is given and let 
the random variable Y denote the additional hours of sleep gained when drug B is 
used. Let us suppose that X and Y follow the probability distributions with 
means [ly and [ty respectively. Here our null hypothesis would be that there is 
no-difference between the effects of two drugs. Symbolically, 

Ho : Ly = py, 

16-2°3. Alternative Hypothesis. It is desirable to state what is 
called an alternative hypothesis in respect of every statistical hypothesis being 
tested because the acceptance or rejection of null hypothesis is meaningful only 
when it is. being tested against a rival hypothesis which should rather be 
explicitly mentioned. Alternative hypothesis is usually denoted by H,. For 
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example, in the example of light bulbs, alternative hypothesis could be 
H,:U>Wo Orp <p Or [ # flo. In the example of drugs, the alternative 
hypothesis could be //, : Ux >My Or Ux <pPy OF Py ¥ py, 


In both the cases, the first two of the alternative hypotheses give rise to 
what are called ‘one tailed’ tests and the third alternative hypothesis resulis in 
‘two tailed’ tests. , 

Important Remarks 1. In the formulation of a testing problem and de- 
vising a ‘test of hypothesis’ the roles of 7/) and //, are not at all symmetric. In 
order to decide which one of the two hypothescs should be taken as null hypoth- 
esis Hy and which one as alternative hypothesis H,, the intrinsic difference be- 
tween the rol-s and the implifications of these two terms should be clearly un- 
derstood. 

2. If a particular problem cannot be stated as a test between two simple 
hypotheses, i.e., simple null hypothesis against a simple alternative hypothesis, 
then the next best alternative is to formulate the problem as the test of a simple 
null hypothesis against a composite alternative hypothesis. In other words, one 
should try to structure the problem so that null hypothesis is simple rather than 
composite. 

3. Keeping in mind the potential losscs duc to wrong decisions (which 
may Or may not be measured in terms of money), the decision maker ts 
somewhat conservative in holding the null hypothesis as true unless there is a 
strong cvidence from the experimental samplc observations that it is falsc. To 
him, the consequences of wrongly rejecting a null hypothesis seem to be more 
severe than those of wrongly accepting it. In mot of the cases, the statistical 
hypothesis is in the form of a claim that a particular product or product process 
is superior to some existing standard. The null hypothesis /fp in this case is 
that there is no difference between the new product or production process and the 
existing standard. In other words, null hypothesis nullifies this claim, Tic 
tejection of the null hypothesis wrongly which amounts to the acceptance oO! 
claim wrongly involves huge amount of pocket expenses towards a substantive 
overhaul of the existing set-up. The resulting loss is comparatively regarded as 
more serious than the opportunity loss in wrongly accepting Hy which amounts 
to wrongly rejecting the claim, i.e., in sticking to the less efficient existing 
standard. In the light-bulbs problem discussed earlicr, suppose the research 
division of the concem, on the basis of the limited experimentation, claims that 
its brand is more effective than that manufactured by standard process. If in fact, 
the brand fails to be more effective the loss incurred by the concern duc to an 
immediate obsolescence of the product, decline of the concem’s image, etc., will 
be quite serious. On the other hand, the failure to bring out a superior brand in 


the market is an opportunity loss and is not a consideration to be as serious as 
the other loss. 


16:2-4. Critical Region. Let x,, x2, ..., x, be the sample obscrva- 
tions denoted by O. All the values of O will be aggregate of a samplc and they 
constitute a space, called the sample space, which is denoted by S. 

Since the sample values x,;, X2, ..., x, can be taken as a point in 
n-dimensional space, we specify some region of the n-dimensional space and see 
whether this point lies within this region or outside this region. We divide the 
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whole sample space S into two disjoint parts Wand S—Wor Wor W. The 
null hypothesis Ho is rejected if the observed sample point falls in W and if it 
falls in Wwe reject H; and accept Ho. The region of rejection of Hy when Hpois 
true is that region of the outcome set where Ho is rejected if the sample point 
falls in that region and is called critical region. Evidently, the size of the critical 
region is a, the probability of committing type ) error (discussed below). 

Suppose if the test is based on a sample of size 2, then the outcome set or 
the sample space is the first quadrant in a two-dimensional space and a test 
criterion will enable us to separate our outcome set into two complementary 
subsets, W and W. If the sample point falls in the subset W, Ho is rejected, 
otherwise Hp is accepted. This is shown in the following diagram : 
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16:2:5. Two Types of Errors. The decision to accept or reject the 
null hypothesis Hg is made on the basis of the information supplied by the 
observed sample observations. The conclusion drawn on the basis of a particular 
‘sample may not always be true in respect of the population. The four possible 
situations that arise in any test procedure are given in the following table. 

DOUBLE DICHOTOMY RELATING TO-DECISION AND 
HYPOTHESIS 


Reject Ho Accept Hy 


weeaeerea 


iy 
i) 
¢ 
¢ 
4 
i) 
‘ 
i) 
¢ 
6 
( 
4 
i] 
i) 
( 
8 
a 
6 
CY 
¢ 
i] 
iy 


From the above table it is obvious that in any testing problem we are liable 
to commit two types of errors. 
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Errors of Type I and Type II. The error of rejecting Hg (accepting 
H,) when Hp is true is called Type / error and the error of accepting Ho when Ho 
is false (H/, is true) is called Type // error. The probabilities of type I and type IJ 
errors are denoted by & and B respectively. Thus 
a ‘= Probability of type J error 
= Probability of rejecting Hp when Ho is true. 
8 = Probability of type II error 
= Probability of accepting Hy when Hp is false. 
Symbolically: 
P (x € WIHo)=a, where X = (21, X2, ..-5 Xp) 
2s i Lo dx = 0 (16-1) 
where Ly is the likelihood function of the sample observations under Hp and jdx 
represents the n-fold integral 


[Ju [dey dg ... dX. 


Again 
Pixe WIH,)=B8 
= [ tdx=B ...(16-2) 
Ww 
where L, is the likelihood function of the sample observations under H,. Since 
fe dx+ [Ly dx =1, 
we get 
Jali de=1- J Lydx = 1-8 (16-24) 
= P(xe W1H,)=1-6 .» (16-26) 


16°2°6. Level of. Significance. a, thé probability of type 1 error, is 
known as the level of significance of the test. It is also called the size of the 
critical region. 

16:2-7. Power of the Test. 1-8, defined in (16-2a) and (16- 2b) is 
called the power function of the test hypothesis Hy against the alternative 
hypothesis H,. The value of the power function at a parameter point is called 
the power of the test at that point. 

Remarks 1, In quality control terminology, a and B are termed as 
producer's risk and consumer’ s risk, respectively. ‘ 

2. An ideal test would be the one which properly keeps under control both 
the types of errors. But since the commission of an error of either type is a 
random variable, equivalently an ideal test should minimise the probability of 
both the types of errors, viz., a and §. But unfortunately, for a fixed sample size 
n, a and B are so related (like producer’s and consumer's risk in sampling 
inspection plans), that the reduction in one results in an increase in the other. 
Consequently, the simultaneous minimising of both the errors is not possible. 
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Since type I error is deemed to be more serious than the type Il error (c.f. 
Remark 3, § 16-2-3) the usual practice is to control @ at a predetermined low 
level and subject to this constraint on the probabilities of type I error, choose a 
test which minimises B or maximises the power function 1 — B..Generally, we 
choose a = 0-05 or 0-01. 

16-3. Steps in Solving Testing of Hypothesis Problem. The 
major steps involved in the solution of a ‘testing of hypothesis’ problem may 
be outlined as follows : 

1. Explicit knowledge of the nature of the population distr. ::ion and the 
parameter(s) of interest, i.e., the parameter(s) about which the hypotheses are 
set up. 

2. Setting up of the null hypothesis Ho and the altcrnative hypothesis H, 
in terms of the range of the parameter values each one embodies. 

3. The choice of a suitable statistic ¢ = ¢ (4, X2, ..... ,X,) Called the tesz 
statistic, which will best reflect upon the probability of Ho and H;. 


4, Partitioning the set of possible values of the test statistic ¢ into two 


disjoint sets W (called the rejection region or critical region) and W (called the 
acceptance region) and framing the following test : 
(@) Reject Ho (i.e., accept H,) if the value of ¢ falls in W. 


(ii) Accept Hp if the value of ¢ falls in W. 

5. After framing the above test, obtain experimental sample observations, 
compute the appropriate test statistic and take action accordingly. 

16-4. Optimum Test Under Different Situations. The discus- 
sion in § 16-3 and Remark 2, § 16-2-6 enables us to obtain the so called best 
test under different situations. In any testing problem the first two steps, viz., 
the form of the population distribution, the parameter(s) of interest and the fram- 
ing Cf Ho and fH, should be:obvious from the description of the problem. The 
most crucial step is the choice of the ‘best test, i.e:, the best statistic ‘t' and the 
critical region W where by best test we mean one which in addition to con- 
trolling a at any desired low level has the minimum type II error B.or maximum 
power | — 8, compared to B of all other tests having this a: This leads to the 
following definition. 

16-4-1. Most Powerful Test (MP Test). Let us consider the prob- 
lem of testing a simple hypothesis 

Ho : @= Oo 
against 2 simple alternative hypothesis 
H 1: = 0, 

Definition. The critical region W is the most powerful (MP) critical 

region of size a (and the corresponding test a most powerful test of level a) for 
wsting Ho: O= O against H,:0=9, if’ . 
P(xxeEeW!h) = J, £0 dx = 0 w.. (16-3) 

and P(x eWIlh) =P (x EW, Nh) . (16-34) 

for every other critical region W, satisfying (16-3). | 
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16-4-2. Uniformly Most Powerful Test (UMP Test). Let us 
now take up the case of testing a simple null hypothesis against a composite 
alternative hypothesis, ¢.g., of testing 

Ho :6= Oo 
against the alternative 
H,:9#495 

In such a case, for a predetermined a, the best test for Ho is called the 
uniformly most powerful test of level a. 

Definition. The region W is called uniformly most powerful (UMP) | 
critical region of size a [and the corresponding test as uniformly most powerful 
(UMP) test of level a)-for testing Hp: 8 = 9 against H, : 0 # Qo i.e., 
Hy: 6 = 6; #9 if 


P(x eWlH)) = Jy Lode =o ...(16-4) 
and P(x EeW1lH,) = P(x é€ W, |H;) for all 6 # Qo, .-.(16-4a) 
whatever the region W, satisfying (16-4) may be. 


16-5. Neyman J. and Pearson, E.S. Lemma. This Lemma pro- 
vides the most powerful test of simple hypothesis against a simple alternative 
hypothesis. The theorem, known as Neyman-Pearson Lemma, will be proved 
for density function f{x, 9) of a single continuous variate and a single parameter. 
However, by regarding x and 6 as vectors, the proof can be easily generalised for 
any number of random variables x;, x2 ,..., X, and any number of parameters 0), 
Qo, .... 0. The variables x;, x9, ...., X, occurring in this theorem are understood 
to represent a random sample of size n from the population whose density 
function is f(x,8). The lemma is concerned with a simple hypothesis 
Ho : 9 = 9 and a simple alternative H, : 0 = @;. 

Theorem 16-1. (Neyman-Pearson Lemma). Let k > 0, be a constant and 
W be a critical region of size a such that 


W = xe S: fa.) , > kT 


Six, 99) 
L, ] 
= W= 4yxEeS: 7 >k --- (16- 
Lo J (16-5) 
and W = AL eg 
— >. ¢ E S o Lo < k eee t0:34) 


where Lo and L; are the likelihood functions of the sample observations 
¥ = (X}, X2y,...,Xq) under Ho and H, respectively. Then W is the most powerful 
critical region of the test hypothesis Hp : @ = Qo against the alternative 
H,: @= 0), 


Proof. We are given 


P(x € WIHy) =] Lodx =a ... (16-6) 
Ww 
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The power of the region is 
P (x € WIH,)= J L,dx=1-B, (say). ... (166a) 


Ww 
In order to establish the lemma, we have to prove that there exists no Other 
critical region, of size less than or equal to a, which is more powerful than W, 
Let W, be another critical region of size a, <a and power 1 — B) so that w 
have ' 


Px € WlHo) =] Lo dx =a sae (16:7) 


and P(x e W1H,)=J Ly dx=1-B, --- (16-7a) 


Now we have to prove that]-B21 — B, 


Let W=AUC and W; =BUC 
(C may be empty, i.e., W and W, may be disjoint). 
If a, <a, we have 


Jy, Lodx S Jy Lo dx 


= aglode 5 Iyyclo dx 

=> Jp bo ax < [, Lo dx 

- J, todx 2 f, Lodx ..- (168) 
Since A c W, 

(165) => [,L,dx>K J, Lodx 2k J, Lodx .. (16-84) 


(Using (16:8)} 
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Also (16-5 (a)} implies 
- <kVxeEW 


This result also holds for any subset of W, say W AW, =B. Hence 
Jp Li dx SkJ, Lo dx <J, Ly, dx [From(168a)) 


Adding [ o Li dx to both sides, we get, 
Sw, L, dx Ss las L, ax 


=> 1-B 21-6, 

Hence the Lemma. 

Remark. Let W defined in (16-5) of the above théorem be the most 
powerful critical region of size o for testing Ho : © = 9» against H, : 6 = 0,, and 
let it be inde endent of 6, € ©, =@ — Go, where Go is the parameter space 
under Ho. Then we say that C.R. W is the UMP CR of size a@ for testing 
Ho: 9 = Qo, against H, : 6 € @). 

16-5-1. Unbiased Test and Unbiased Critical Region. Let us 
consider the testing of Ho : © = Qo against H, : @ = 0. The critical region W and 
consequently the test based on it is said to be unbiased if the power of the test 
exceeds the size of the critical region, i.e., if 


Power of the test 2 Size of the C.R. .. (16.9). 
= 1-B 2a 
=> Pe, (W) 2 Pe(W) 
=> P{x:xe WIH,) 2 P[x :x € WIA] .- (16-9a) 
In other words, the critical region W is said to be unbiased if 
Po(W) = Pa (W),V O# 6) € @ .- (16-965) 


Theorem 16-2. Every most powerful (MP) or uniformly most powerful 
(UMP) critical region (CR) is necessarily unbiased. 

(i) Jf W be an MPCR of size @ for testing Ho : 6 = 6 against H, : = 8;, 

len it is necessarily unbiased. 

(ii) Similarly if W be UMPCR of size a for testing Ho: 0 = 9 against 
H,: 0€ ©), then itis also unbiased. 

Proof. Since W is an MPCR of size @ for testing Hy : 0'= > against 
H,: @=0,, by Neymari-Pearson Lemma, we have ; for V k > 0, 

= (x 7 L (x, 01) 2K L (x, 80} = {x : Ly 2k Lo} 

and W’= {x: L (x, 8,) <kL (x, 9)} = {x :L, <k Lo), 

where & is determined so that the size of the test is a 1.€., 
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Pa(W) =P [xe WIHol = J, Lo dx =a a (9) 
To prove that W is unbiased, we have to show that : 

PowerofW2a_i.e., Po, (W)2a -- (if) 
We have : 


Po (W)= fy Lidx 2 kf, Lodx =a 


[-- On W,L, 2k Lo and Using (| 
ie., Po (W)2ka, Vk>0 A (179) 
Also 
1-Po (W) =1-P(xe WIA) =P xe WIM) 


7 Sw: Ly dx 
<k ff. Lodx =k P (x: xe W’lHo) 
(.-On W’,L, <k Lo} 
=k[1-P (x:xe WIHA))] 
=k (1-a) [Using (1) 
ie, 1-Pg (W) <k(1-a), Vk>0 .. (iv) 
Case (i) k2 1. If & = 1, then from (iii), we get 
Po (WW) 2ka2a 
=> W is unbiased CR. 
Case (ii) O<k <I. If0<k <1, then from (iv), we get: 
1 ~ Po, (W) <l-a 
=> Pe, (WW) >a 
= W is unbiased C.R. 
Hence MP critical region is unbiased. 
(ii) If W is UMPCR of size o then also the above proof holds if for 6, we 
write 8 such that 8 € @,. So we have 
Po (W)>a,V 8e 8, 
=> W is unbiased CR. 
16-5-2. Optimum Regions and Sufficient Statistics. Let X;, X2,.... X- 
be a random sample of size n from a population with p.m.f. or p.d.f. f(x, 9), 


where the parameter 8 may be a vector: Let T be a sufficient statistic for @. 
Then by Factorization Theorem, 


L (x, 8) = i f (in 9) = 86 (¢ (x): A(x) «2 (*) 
where go («(x)) is the marginal distribution of the statisitc T = ¢(x). 
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By Neyman-Pearson Lemma, the MPCR for testing Ho : 8 = 99 against 
H, : 8 = 80, is given by: 
W ={x: L(x, 0) DAL (x, 0)}, V &> 0 st*) 
From (*) and (**), we get 
W = (x: go, (F(x) -A)Zk. gq, (tx). A(x)}, Vk >0 
= (x: go, (QO) ZK. 8a ((x)}, VE>0 
Hence if T = ¢(x) is sufficient statistic for 6 then the MPCR for the test 
may be defined in terms of the marginal distribution of T = «(x), rather than the 
"joint distribution of X;, X2, ..., X,. 
Example 16-1. Given the frequency function : 


fx, 0) = ‘. 0<x<0 


= 0, elsewhere 
and that you are testing the null hypothesis Ho: @ = 1 against H, : @ = 2, by 
means of a single observed value of x. What would be the sizes of the type 1 and 
type II errors, if you choose the interval (i) 0-5 <x, (ii) 1 $x $1-5 as the 
critical regions ? Also obtain the power function of the test. 
[Gauhaii Univ, B.Sc. 1993; Calcutta Univ. B.Sc. (Maths Hons.), 1987) 


Solution. Here we want to test 
Hy: 8 = 1, against H, : 6 = 2. 
(i) Here W = (x: 0-5 <x} = (x: x 20-5} 


and W =(x:x<0-5) 
a =P(xe WIHo} = P(x20-516=1) 
=P(0-5<x<0160=1} =P(05<x<116=1) 
1 , 1 
mr fi. [ Ax, 0) Joni de a fi. 1. dx = 0-5 
Similarly, 
B =P (xe WIH,) =P (x< 0-516 =2) 
0-5 OS | 
=|, [Axo lord =|) 2 dx=0-25 
Thus the sizes of type I and type II errors are respectively 
a =0Sand B =0-25 
and power function of the test = 1 - B = 0-75 
Gi) W = (x: 1<x <1-5) 
15 
a =P (xe Wl0=1)}= fi [ Ax, 0) Jon de =0, 
since under Hy : 6 = 1, f(x, 8) = 0, for 1 <x < 1:5. 
B =P (xe WI0=2) =1-P (xe W180 =2} 
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x|'5 
2 |, = 0-75 


1-5 
-1-[ [flx, ©) Jon2dx = 1- 


Power Function = 1 — B = 1 — 0-75 = 0-25 
Example 16-2. If x 21, is the critical region for testing Hp :@=2 


against the alternative @ =1, on the basis of the single observation from the 
population, 


fix, 0) = @exp (Ox), 0 <x < ~, 
obtain the values of type I and type II errors. 


[Poona Univ. M.CA. 1993; Allahabad Univ. B.Sc., 1993; 
Dethi Univ. B.Se (Stat. Hons.), 1988] 


Solution. Here W = {x:x21) and W= {x:x< 1). 
and Ho: 8 =2, H,:0=1 

= Size of Type I error 

=P [xe WI-Ho] = Pix 2110 = 2] 


5 f [Ax, 0) lon2 dx 


wo |? emacea| S[) 
1 


ze~t= Le? 
B = Size of type Il error 


= Pixe WIM) = P(x<116=1) 


I. 1 
= e*dy=| — 

0 -l|o 
es (1 ~e-) = a l 

Example 16-3. Let p be the probability that a coin will fall head in a 


‘ ; 1 F a 
single toss in order to test Hg : p= > against H, : p= 7 The coin is tossed 5 


times and Ho is rejected if more than 3 heads are obtained. Find the probability 
of type I error and power of the test. 
Solution. Here 


Ho: Pp => and Hy: p=*, 


If the r.v. X denotes the number of heads in n tosses of a coin then 
X ~ B(n, p) so that 


P(X =x) = [r)¢-or 
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5 
(2) (1 -p)°-*, .(*)P 
since n = 5, (given). The critical region is given by 


W =(x:x24) => W = (x:x9S3) 
a... = Probability of type I error 
=P (X241Hol 


~ Pix =4ip=}] + PIX =51p=4) 
=[r@et (5) (2)5 poe 
= 5(4)5+(2)5=6(3)° 


“16 
8 = Probability of Type II error 


=Pixe WIH,J=1-P [xe WIM] 
=1-[PX =4|p=3)+P(X=5lp="] 


f(.}aro (Je 


3,4, 5 3 
=1-() (4+; 


-, 81,47 
128 128 
“. Power of the test is 
81 
1-B =)58 


Example 16-4. Let X ~ N(pt, 4), up unknown. To test Ho: p = -I 
against H, : t= 1, based on a sample of size 10 from this population, we use 
the critical region x, + 2x2 +... + 10X:9 2.0. What is its size ? What is the 
power of the test ? 

Solution. Critical Region W = (x: x; + 2x2 +... + 10x,9 20). 

Let U=x, + 2x, + ... + 10xi9 

Since x,’s are i.i.d. N(p, 4), 

U~N[(1+2+... + 10) p, (12 € 22+... + 10%) o?] = N (551, 38507) 


=> U-~N(S55p, 385 x 4) =N(S5p, 1540) ..(*) 
The size ‘a’ of the critical region is given by : 
a=P (xe WI Mo) =P(U 201 Hp) .(**) 


Under Ho: jp =-1, U ~ N(-S5, 1540) 
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sd 7 ae YEW) _ U+55 
\ Su 1540 
55 " 55 


Under Ho, when U =0, Z= ea = 39.2428 = a 
a =P (Z 21-4015) [From (**)] 
= 0:5 -P(0 <Z < 1-4015) 
= 0-5 —0-4192 (From Normal Probability Tables) 


= 0-0808 
Alternatively, a = 1 —P(Z < 1-4015) = 1 —®(1- 4015), 
where ® (-) is the distribution function of standard normal variate. 
Power of the test is given by : 
1~B =P(xe WIH,;)=P(U201M)) 
Under H,: =1, U~ N(5S5, 1540) 


= Z =O) =— 1-40 (when U =0) 
y 1 


1~B =P(Z2- 1-40) 
=P-1-4<¢Z<0)+05 
=POSZ< 1-4) +:0-5 ‘(By symmetry) 
= (-4192 + 0-5 . 
= 0.9192 

Alternatively, 
1~B =1-P(Z <-1-40) = 1-9 (- 1-40). 
Example 16-5. Let X have a p.d/f. of the form : 


fix, 9) afer: 0<x< o, @>0 


= 0, elsewhere. 
To test Hp: @ = 2, against H,; : @= 1. use the random sample x), Xz of size 
2 and define a critical region : 
W =. {(x:, X2) £95 SX; + Xz} 
Find: (i) Power of the test. 
(ii) Significance level of the test. 
Solution. We are given the critical region : 
W = {(x1, xq) 2 9-5 Sx, +22} = ((x, x2) 22, + x2 29-5} 
Size of the critical region i.e., the significance level of the test is given by : 
a=P(xe W | Ho) = P[x, + x2 29-5 | Ho] -.(*) 
In sampling fromthe given exponential distribution, 


é z Xi~ X20 * (c.f. Example 16-8] 


=> Us 2 (X1 +X2) ~ Xa, (n = 2). 
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a= P [; (x; + x9) 2 2 x 9-5 1 Ho | [Form (*)] 
=P [x74 2 9.5] (.- Under Ho, 6= 2) 
= a. = 0-05 [From Probability Tables of ?-distribution] 


Power of the test is given by 
1-B = Pee WIM) = P(x, +22 29-5 1H;) 


2 2 
=P a +X) 2 9 * 9-5 | Hy] 
= P [xe 2 19] (-- Under H,, 6= 1) 


Example 16:6. Use the Neyman-Pearson Lemma to obtain the best 
critical region for testing @ = 8 against 8 = 8; > @) and @ = 8, <', in the case 
of a normal population N(@, 02), where o? is known. Hence find the power of 


the test. 
[Delhi Univ. B.Sc. (Stat. Hons), 1986; Gujarat Univ. B.Se. 1992] 


Solution. 
Ls en 0) = ee --L r (x; - 6)? 
int oV2n 20? j= 1 


Using Neyman-Pearson Lemma, the best critical region (B.C.R.) is given 
by (for k > 0) 


1 A . 
exp - 402 2 (x;- 6,)? 


| 


= : 2k 
exp ; ach ,2, i 0 
= co] = 13, (x; - 0,)2 - 2, (x; - 20" 2k 


1 A 
= on 53 (01? - 00) + <5 (81 - 9) 2 | >k 


n 1 . 
= ~~, (0,2 - O52) + (0, - 0 x; 2logk 
292 (81° ~ 80° ) oe (81 o) 2 g 
(since log x is an increasing function of x) 
os o* 6,7 — Oo 
=> Xx(0; —8o) 2 2 Bk Tt 


Case (i) If 8, > @, the B.C.R. is determined ‘by the relation (right-tailed 
test) : 
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5%. log k , at Qo 


. n 6; = O* 2 
i.é., xX >dj,, (say). 
-- BCR is W ={x:X>A,)} { (16-10) 
Case (ii) If 8, < 9, the B.C.R. is given by the relation (left tailed test) 
2 logk 9 
<> 06," + 14S @2,, (say) 
Hence B.C.R. is W, = {x:x <A,} ... (16-11) 


The constants A, and A, are so chosen as to make the probability of each of 
the relations (16-10) and (16-11) equal to « when the hypothesis Hp is true. The 
6? 
sampling distribution of x, when H; is true is N | 9; ,, v » G@ = 0, 1). 
Therefore the constants A, and A, are determined from the dane 
PL x>A,1Ho] =. and a { 
| 


P(x >i, | Ho) = P[z> “AW |= =a;Z~N(0, 1) 
o 
A, — 9% 


oO 
=> = 7, => A, =0),+-=2z »»-(16-12) 
oNn ai ie 


where z,, is the upper a@-point of the standard normal variate given by 
P(Z>zq) =a .-(*) 
Also P(X <A,1 Hp) =a => P(x 2A, (Hp) =1-4 


= ieee Ne) “178 => Az = Bo _ 


o/Nn oNn ia 
6 
= Me = 059 + = 2) mip .--(16-12@) 
Vn 


Note. By symmetry of normal distribution, we have z, _¢ =— 2. 
Power of the test. By definition, the power of the tést in case (é) is : 
1 -f = P[xe WIA] = P[x 2h lH] 


A, -98 x-0 
=?|2 2 aa [... Under H;,Z= —_ ~ NO, 1)] 


on oNn 
0) + ie -0, 
=P|z > ——2*——_ [Using (16-12)] 
G/N : ' 


= Plz 2 27q-- | (- 6, > Qo) 


Statistical Inference - II ( Testing of Hypothesis ) 16-17 


6,-90 
=1-P(Z<A,) E = 2g - re »( sy) 
=1-%(A,), ...(16-13) 


where © (.) is the distribution function of standard normal variate. 
Similarly in case (ii), (0; < 9%), the power of the test is 


A. - 9 
poe epee (2 < ae 


Le) 


=P|Z< <a [Using (16-12a)] 
re) 
-P|z <21-q+ oe («Qo > 61) 
= (Ay), ...(16-13a) 
where Dy yg + WE) Ya Oo- 8), 16.138) 


UMP Critical Region. (16-10) provides best critical region for testing 
Ho : 8 = 9 against the hypothesis, H, : 6 = 6,, provided 6, > 69 while (16-11) 
defines the best critical region for testing Ho: 8 = 09 against H, : 8 = 6;, 
provided 6, < 69. Thus the best.critical region for testing simple hypothesis 
Ho : 8 = 8 against the simple hypothesis, H, : 6 = 9, + c,-c >-0, will not serve 
as the best critical region for testing simple hypothesis Ho : 8 = 99 against 
simple alternative hypothesis H, : 8 = @9—c,c > 0. 

Hence in this problem, no uniformly most powerful test exists for testing 
the simple hypothesis, Ho : © = 6p against the composite alternative hypothesis, 
H 1° 6 # Qo. 

However, for each alternative hypothesis, H, : 6 = 6, > 89 or 
H,: 0 =8, < 80, a UMP test exists and is given by (16-10) and (16-11) 
respectively. 

Remark. In "particular, if we take n = 2, then the B.C.R. for testing 
Ho: 8 = 8p, against H, : 8 = 8, (> 89) is given by : [From (16-10) and (16-12)] 


W = (x: (x, +x,)/2 20+ 02,/V2} [X= (x, 4+2,)/2) 
= (xix, +x, 2209 + V2 0 z,)} 
= {x 7X1 +%X%2] C}, (say),. Or aad | 
where C = 209+ V2.6 24 = 20) + V2 ox 1-645, if « = 0-05. 


Similarly, the B.C.R. for testing. Ho : © = Oy against H, : 8 = 0, (< 9p) with 
n= 2 and a = 0-05 is given by [From (16-11) and (16-12a)] : 
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W, = {x1 (x, +.x)/2 $ 0 -02,/ V2} 


= {x : (x, +x) $ 20) - V2 0x 1445} 
= {x:x, +x. <C,} , (say), oo (***) 


where —s- C,_-« = 20) -V¥ 2.024 = 20) - V2 0x 1-645. 
The B.C.R. for testing Ho : 8 = Qo against the two tailed alternative 
H, : 8 = 9, Qo), is given by : 
= {x: (x, +2. 2C)U (x4, +242 $C))} woo #¥**) 
The regions in (**), (***), and (****) are given by the shaded portions in 
in following figures (i), (i) and (iii) respectively. 


x2 
J 
iL 
Dy . 
Y 
Y4 
Ws 
L 4 Vas 
0 —Y xy 0 x, 
Fig. (i) Fig. (i) 7 Fig. (itt) 7 
BCR }| H,:8=8, BCR, H,:8=, BCR} Hy: 8 =, 
for "H,:@=0,(>0,) for f§ ‘H,:@=0;(<@,) for J “H,:@=6, (#@,) 


Example 16-7. Show that for the normal distribution with zero mean 
and: variance o”, the best critical region for Ho : 0 = Og against the alternative 
H, : 0 = 0; is of the form: 


a 
Dx? Saq,for 09> 0; 
itawl] 


and 2d x7 2b,,for o9< 6; 


i=l 
Show that the power of the best critical region when Oo > O; is 


rp (% 


5° X7a.n } where x’7,,, is lower 100 a-per cent point and F(-) is the 
1 


distribution function of the z- distribution with n degrees of freedom. 
Solution. Here we are given : 


1 x 
f(%, 6) = — =e (- agi i SE <0, > 0, 
2 20 


The best criucal region (B.C.R.), according to. Neyman-Pearson Teena: 1S 
given by (for k, > 0) 


L 
<0 < a Aa, (say) 
a 
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1) 1s 12 (F1°-9 
=> nos (Z J - 2, ex Slog Ag 
(since log x is an increasing function of x). 
Go? - 0)” -6,? < O1 * 
=> 26,2 6,2 2%, x; <(tos A,-—n log ( ) geal) 


Case (i). If 6; <p, then B.C.R. is given ee [From (*)] 
7 x7 <(toe Ag -n log = Nes 2o¢ 5 = dq’, (Say). 


So tS 


1.é., W=fx : Ex Sag}. foro, < op .-.(16-14) 


Case (ii). If 6; > do, then B.C R. is given by [From (*)] 
2 x;? > og Ag -—n log E )} a (say). 
ia 0 


So" —- 6) 


A 
i.e., Wy, =x : 2 x;7 2 bat for 6, > So ...(16-14a) 
The constants a, and bg are so chosen that the size of the critical region 
iS Ol. ‘ 
Thus a, is determined so that P[xée W|Ho] =a 


an 
=> P| 1S 0g Ho | =@ 
i=l 
3 Pp} Ss <1] =0 1%) 
i= 1 Oe Oo” 
Since under Ho, 


n y2 
Xm? = 2 qr is ax?-variate with n df, 


SX2nn = O80 Xran=e ...(16-15) 


where 12,, , is the lower 100 a-per cent point of chi-square distubution with 
ndf. given by 
P (x? $X7a,n) =O ..-(16-15a) 
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Hence the B.C.R. for testing Hy: 0 = 69 against H; : 0 = 0; (< Go), ie 
given by [From (16-14) and (16-15)] : 


W= ‘ : 2 x;? S$.” ee n ---(16-15b) 


where x,, , is defined in (16-15a). 
Also by definition, the power of the test is : 


a 
1-8 =P[xe winl=P| nS ag Hs | 
i=l 


a x;? & »y x;? 

is ial 

=P < l\H,|=P <y7*. ,!H 
Gy? 2 1 Gye Xa, 1 
> x; 

= pi iz < Fo" \H 


Gy 
= P| x Ss 2 to 7 ’ 
since under H, Yx?/o,2.is a y?-variate with n df. 
2 
Hence, power of the test = F Gy ey ara ..(16-15c) 
1 


where F(-) is the distribution function of chi-square distribution with n d.f. 
Remarks 1. Similarly, for testing Hp : 6 = Go against //, : 0 = 0; (> Op), 
bg in (16-144) is determined so that : 


P[xe W,IM| =@ 


n 
=> a : Ex? 2 ba 1H | =O 
2 

= P| Ta 2 Bai Ho] =o 

2 > So 

ba 
=> P|. nw <3] =1-0 
b 

=> ae = 171 -aa => bg = Oo? .X%71-aan .- (16-16) 


where x2, , is defined in (16-15a). 
Hence the B.C.R. for testing Hp : O = Oo against H, : G = 0, (> Oo), Is 
given by : 


Statistical Inference - II ( Testing of Hypothesis ) 16.21 


W, a fe 265" eh nae .--(16-16a) 
sal 


The power of the test in this case is given by 


n 
1-6 =P(xe winy=P| Sx2 zo zh aa! th | 
1 


i= 


x? 
=P imi | oo? 2 \H 
2 
=f a2 2 ei Me ase | ; | ...(16-16b) 


a 
since under H,, ¥x;7/6,? isa y2-variate with n df. 
i=l 


2 
= cs %71 -G,a |> :-.€16°16c) 


where F(.) is the distribution function of chi-square distribution with n d.f. 
2. Graphical representation of the B.C.R. for the particular case n = 2. 
For n = 2, the B.C.R: for testing //) : 6 = Go, against //, : 6 = GO; (< Go) ‘is 
‘given by (From ,16-158)] 
2 
W = fc > Ex? <$ Go? . 7a. } 
j= 


| 
= {x:x,2+x,2sa?}, 


where a? = Go*y7q, 2. Thus the B.C.R. is the interior of the circle with centre 


(0, 0) and radius ‘a’ and is shown as the shaded region in Figure (i) on page 
16-22. 


Similarly, from (16-16a), the B.C.R. for testing Hy): 0 =o, against 
H,;: 6 = 0, (> Oo) for n = 2 is given by: 
WwW, = {x 2X12 + X_? > Oo? x7; seca = {x X12 + Xp? > b?} 
where b? = Oo? . x7; a 2; Thus, B.C.R. is the exterior of the circle with centre 


(0, 0) and radius b and is shown as the shaded region in Figure (i) on page 
16-22. 


Similarly the B.C.R. for testing Ho: 0 = Go against the two-tailed 
alternative H, : = 6, (#0), for n = 2 is given by : 


W, =W,UW, 


= {x : x12 + x22 < a?} U {x 2x)? + x92 > 5?) 
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and is shown as the shaded region in the Figure (iii) below. 


Fig. (i) Fig. (ii) Fig (iii) 

3. (16-14) defines an UMP test for testing simple hypothesis Ho : 6 = q 
against simple alternative hypothesis H, : 6 = 6, (< Oo) whereas (16-14a) 
defines an UMP est for testing simple hypothesis Ho : 6 = Go against the 
simple alternative hypothesis H; : 6 = 0; (>). However no UMP test exists 
for testing simple hypothesis Ho : 6 = do against the composite alternative 
hypothesis H, : 6 # Go. 

Example 16-8. Given a random sample X,,X», ....X, from the distri- 
bution with p.df. f(x, 9) =6 e-% ,x>0 
show that there exists no UMP test for testing 

Ho: 8 = Oo against H, : 8 # 8p. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1988; Gorakhpur Univ. B.Sc., 1993] 


Solution. L= UT Axis 6) = 6" . exp - ®t ni 
iu isl 


Consider H,:9=60,, (0, # 99). 
The best critical region, using Neyman-Perason Lemma is given by : 
0," exp (- 6, Lx] 2 k.-Oo" exp [- % Lx];k >0 


ee {9 
= exp [(8o- 0) 2 x;] 2k. 8, 
= (Q99-6,) Xx; Z2log [k. a y| = k;, (say). 
(*) : | 


‘Case (i) If 0; > 6, then B.C.R. is giyen’by [From. (*)] 


» k . 
Lx; < 1 — =A, (say). 
Oy -— 0; 
Case (ii) If 8; < Qo, then B.C.R. is given by [From (*)] 
) k qY 
> x; 2 =A, (say). 
Oy) — 9, 
The constants A, and A, are so determined that 
PIXx; $4, 1A] =Q and Pl Xx; 24,1 Ho =a 
=> P[20Yx, < 20A, lio] =a |> P[20 Dx; 2>201,1H>o] =a 
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But in random sampling from the given exponential distribution, 


My, = TM, O=[M, (0) 


t a 
1 -§ 
=> Moos. (‘O = Myy. (2:8) = (1 - 20)", 


which is the m.gf. of a y?-variate with 2n. df. Hence by uniqueness theorem 
of m.g/f.'s, 


a 
20 ; » X ia X20) 


Using this result in (**) 
P[209 Lx; Stu) =PIX7en) Sti] =a 


= Hi =X71-ar 2" 
where x2, , is the upper ‘a’ point of 2-distribution with n.df. given by 
PQ? > X7a,n) =O ..@ 


Hence B.C.R. for testing Ho : 6 = 89 against H, : 8 = 6, (> Qo) is given by 
Wo = {x : 200%; <7; goon} 
l 
= {x a 7, $59, X71 -a, 201 


and since it is independent of 6,, Wo is U.M.P.C.R. for Hy: 8 = Qo against ° 
H, :6= 0, (> Qo). 


Similarly from (***), we get 
P[20o2%; 22] = Plx2en) 2 M2] = 
= He =X7a,2" 


Hence B.C.R. for testing Ho : 8 = Gp against H, : 8 = 0, (< 9p) is given by : 
W, = (x: 209%; 2775, 25] 


] 
= {x ; 2x; 2 99, Xa.2) 
and since itis independent of 8,, W, is also UPM C.R. for Ho : 9 = 9p against 
A, 4. 0; (< Oo). : 
owever, since the two critical regions Wo and W, are different, there 


exists no critical region of size @ which is U.M.P. for Ho : 6 = 9» against the 
two tailed alternative, H, : 6 # 9. 
Power of the test. The power of the test for testing Hy : 9 = 99, against 
H, : © = 0, (> 9p) is given by 
1-B =P{xe WolAj] 


= 1 
=P| 2 £1 SCI 0201 | 


i=l 
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=P|20, Eis Bata Ms | 
is 1 hy 


0, 
=P Ha = @* - &, (2) | -..(*) 


A 
since under H,, 20, 2 x;~ X72) - 
i=l 


Similarly the power of the test for testing Ho: © = Qo, against H; :0=6, 
(< 60) is given by : 
1-8 =P(xe W,!1H,) 


pl > 12 
P| Ee ahe can | 
yy > PL? 
=P)20, 2 x,2 2 Xa, ne! Hy 
. is] QB 


6 
=P ras 2 @, Xe. » | ssl **) 


Remark. The graphic representation of the B.C.R. for Ho : © = 9o agains 
different alternatives H, : © = 6, (> 0,), H, : © = 6, (< 99) and H, : © = 0, (# ) 
for n = 2, can be done similarly as in Example 16-6, for the mean of norma 
distribution. 


Example 16-9. For the distribution : 


i aia Y)} dx, x2y 
4F=10 2e7 


aan that for a hypothesis Ho that B = Bo, y= ¥% and an alternative H, that 
B= B). Y= nh, the best critical region is the region given by 


2 I ; I Bi 
S—— - —- ~log k + log “> 
x B = Bo {nbs YoBo pe nes B 
provided that the admissible hypothesis is restricted by the condition 


¢ 


Y SY, B: 2Bo (Gauhati Univ. M.Sc., 1992) 
Solution. f(x;B,y) =Bexp(-B@-Y), x27 
= 0, otherwise 


JU fGi , B, Y) = B* exp (-B 2 (x; ~Y¥)}; X1, X2, «0-5 Xn 2Y 


= 0, otherwise 
Using Neyman-Pearson Lemma, the B.C.R. for k > 0, is given by 
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Bi" exp 3, (x; - ¥1) | 


Bo" exp [Po 3 2, (x; - 10| 


2k 


Py 


> | 


> n log (Bi/Bo) — 2x (By - Bo) +.2Bini — 2BoYo 2 log & 
(since log x is an increasing function of x). 


= EB, Bo) {1Bs - YoBo - Liog & + log ‘% |] 


exp - 2, (xi - 11) + Bo x (xX; - 2) 2k 


J 


(f Jo [-B, a X —¥1) + Bo n( X —Y0) J =k 


=> x Sap (nb - YoBo - * 198 k + log ( ) 
provided B, > Bo. 


Example 16-10. Examine whether a best critical region exists for 
testing the null hypothesis Ho : 8 = O89 against the -alternative hypothesis 
H,:@=86,> 9p for the — 6 7 the distribution : 


fix, 9) = 


OPT ene 


[Bangalore Univ. B.Sc., 1992] 
: = 5 Se ] 
Solution. i f(%j, 9) = (1 + 8) aus G+ 6)? +6)? 
By a aaa — the B.C.R. is een by 


(1 + 6,)* q > k (1 + 0o)* TT 


a ——s 

1 (x; + 6)? 1 (x; + Oo)? 

=>.n log (1 + 6,) -2 2 log (x; + 9) 

> log k + n log (1 + 09) -2 2 log (x; + 9p) 


a 
xX; + 9o 1 + 69 
=> 2 2» log ie ator e+ mos (9 
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Rane ee, tee x; +9 
Thus the test criterion is > log ae , which cannot be put in the 
ful é | 


form of a function of the sample observations, not depending on the hypothesis. 
Hence no B.C.R. exists in this case. - 


EXERCISE 16(a) 


1. (a) What are simple and composite statistical hypotheses ? Give 
examples. Define null and alternative hypotheses. How is a statistical 
hypothesis tested ? © 

(5) Explain the following terms : 

(i) Errors of first and second kinds. 
(ii) The best critical region. 
(iit) Power function of a test. 
(iv) Level of significance. 
(v) Simple and composite hypotheses. 
(vt) Most powerful test. 
(vii) Uniformly most powerful test. j 

(c) Identify the composite hypotheses in the following, where 1 is the mean 
and o? is the variance of a distribution. 

@) Hy : us 0, e-=l1 - 


(ii) Hy : w= N, o = 0 
(ii) Ho: ws 0, o = abitary 
(iv) Hjp-: o% = O% ‘(agiven value), 1 arbitrary. 


(d) (i) Explain the concepts of Type I and Type II errors, with examples and 
bring out their importance in Neyman and-Pearson testing theory. 
2. “In every hypothesis testing, the two types of errors are always present.” 
If this is true then explain what is the use of hypothesis testing. 
[Delhi Univ. M:C:A., 1990] 
3. What is a statistical hypothesis ? Define (i) two types of errors, (ii) 
power of a test; with reference to testing of a hypothesis: Explain how the best 
critical region is determined. State clearly the theorem which is used to 
determine the best critical region for simple: hypothesis at a given significance 
level. 
(Caleutia Univ. B.Sc. (Maths. Hons.), 1992) 
4. Explain the concept of the most powerful tests and discuss how the 
Neyman-Pearson lemma enables us to obtain the most powerful critical region 
for testing a simple hypothesis. against a simple alternative. 
[Madras Univ. B.Se., 1988) 
‘3. What is meant by a statistical hypothesis ? Explain the concepts of type 
I and type I exrors. Show that a most powérful test is necessarily unbiased. 
[Delhi Univ. B:Se. (Stat. Hons.), 1992, 1985] 
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6. What are simple and composite statistical hypotheses ? State and prove 
Neyman-Pearson Fundamental Lemma for testing a'simple hypothesis against a 
simple alternative. [Delhi Univ. B.Sc. (Stat. Hons.), 1993, 1986) 

7. (a) Explain the basic concepts of statistical hypothesis. Discuss the 
problems associated with the testing of simple and composite hypotheses. Show 
that a most powerful 1 test is necessarily utibiased. 


(Delhi Univ. B.Sc. (Stat. Hons.), 1983) 

8. State Nevada Poarsea Lemma. 

Prove that if W is an MP region for testing Ho : 6 = Oo against H, : 6 = 6, 
then it is necessarily unbiased. Also prove that the same holds good if W is an 
UMP region. [Dethi Univ. B.Sc. (Stat. Hons.), 1982] 

9. (a) Let p denote the probability of getting a head when a given coin is 
tossed once. Suppose that the hypothesis Ho : p = 0-5 is rejected in favour of 
H, : p = 0-6 if 10 tials result in 7 or more heads. Calculate the probabilities of 
type / and type // errors. [Calcutta Univ. B.Sc. (Maths Hons.), 1989] 

(6) An um contains 6 marbles of which 6 are white and the others black. In 
order to test the null hypothesis Ho : © = 3, against thé alternative H, : 6 = 4, 
two marbles are drawn at random (without replacement) and Hp is rejected if both 
the marbles are white; otherwise Ho is accepted. Find the probabilities of 
committing type J and type // errors. 

If it is decided to reject Ho when both marbles are black and to accept it 
otherwise, find the probabilities of rejecting Ho (1) when Hg is true and (ii) when 
H, is true. Comment on your results. 

10. (a) p is the probability that a given die shows even number. To test 
Ho: p= ‘ against H, : p= ; , following procedure is adopted. Toss the die twice 
and accept Hp if both times it shows even number. Find the probabilities of type 
I and type I errors. 

(b) Let p be the probability that a! coin will fall head in a single toss. In 
order to test the hypothesis Ho : p = + the coin is tossed 6 times and the 
hypothesis Ho is rejected if more than 4 heads are, obtained. Find the probability 
of the error of first kind. If the alternative hypothesis is H, : p = 2 , find the 
probability of the error of second kind. 

(c) 7 a Bernoulli distribution with parameter p, Ho: p = : > against 


H, :p ==, is rejected if more than 3 heads are obtained out of 5 throws of a 
coin. Find the probabilities of Type I and Type II errors. 
Dethi Univ. B.Sc. (Stat. Hons.), 1987] 
11. (a) Let X,, X2, ...; Xp be a random sample from N(0, 25).If, for 
testing Hp : @ = 20, against H, : © = 26, the critical region W is defined by 


W = (x 1X > 23-266), 
then find the size of critical Tegion aid ‘the power: 
[Dethi Univ. BSc. (Stat. Hons.), 1987] 
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(b) Let X ~ N (yt, 4), p unknown. To test Ho : pb = —1 against H, : = 1, 
based on a sample of size 10 from this population we use the critical region 
X, + 2x2 +... + 10x19 20. What is its size ? What is the power of the test ? 

(c) A sample Of size 16 is drawn from a normal population with mean 2 
and standard deviation o for testing the hypothesis Hp : 1 = 0 = 1, against the 
alternative hypothesis 7, : 1. = 0 = 2. It is decided to reject the hypothesis Ho if 
the sample mean exceeds 1-5 and otherwise accept it. 

Calculate the probabilities of errors of. the first and second kind in this 


procedure. 


1 * 2 
Given | = “12 gy = 0.8413 and f ie etl? at = 0.9773 
= 513 2x 


12. (a) The hypothesis pp = 50, is réjected if the mean of a sample of size 
25 is either greater than 70-54 or less than 31-19. Assuming the distribution to 
be normal with s.d. 50, find the level of significance. Obtain the power 
function for the test and sketch the power curve with two values above 50 and 
two values below SO. 

(6) Calculate the size of the type II error if the type I error is chosen to be 
a = 0-16 if you are testing Ho: = 7 against H,:p = 6, for a normal 
distribuiion with o = 2, by means of a sample of size 25 and if the proper tail 
of the x? distribution is used as the critical region. 

13. (a) Given the frequency function : 


fix,0) =2, 0Sxs0 


= 0, elsewhere 
and that you are testing the hypathesis Ho: 9 = 1-5 against H, : ® = 2-5, by 
means of a single observed value of x, what would be the sizes of the type I and 
type II errors, if you choose the interval 0-8 < x, as the critical region ? Also 
obtain the power function of the test. 

(b) It is desired to test the hypothesis Hy: 6 = 0 against H, : 9 > 0, by 
observing a random variable X which is uniformly distributed on [9, 6 + 1). 
Given only one observation, sketch the power function of the test whose critical 
region is defined by (x > c}. What value of c would you choose ? 

Given n observations, derive the general formula of the power function of 
the test whose critical region is defined by : (at least one x is greater than c} 
and indicate how you would construct a confidence interval for 8. 

14. Let X have a p.d.f. of = form : 


f (x, 9) = 5 exp (-x /8),0<1< 20,0 >0 


ss : elsewhere. 
To test Hy: 8 = 2 against H, :6 = 1, use 4 random sample X,, X2 of size 2 and 
define a critical region C = { (x), x2) : 9-5 $x, +2). 
Find (i Power function of the test. 
(ii) Significance level of the test . 


Statistical Inference - II ( Testing of Hypothesis ) 16.29 


(b) Let X have a p.d.f. of the form : 
f (x30) =0x®-! ,0<x<1 
= Q, elsewhere 
To test the simple hypothesis Ho: 6 = 1 against the alternative simple 
hypothesis H, : 8 = 2, use a random sample X,, X2 of size 2 = 2 and Gefine the 
Critical region to be 


3 
C= ((x1, 42): GS m1 22) 


where x, x2 are the values assumed by a sample. Obtain the power function of 
the test. [Madras Univ. B.Sc., Stat-Main, 1901] 
Hint. ¥ =—- log X has an exponential distribution with parameter 6 i.e., 
Y~ y(@, 1). 
15. (a) Give a working rule of finding the best critical region for testing a 
simple hypothesis against a simple alternative. 
For a normal (m, 6?) population with known 6, construct a test for the 
null hypothesis Ho : m = mo against the alternative m > mo. 
[Calcutta Univ. B.Sc., (Maths Hons.), 1989] 
(b) Let (x,,%2, ..., X,) be a random sample from N (8, 0”), where o? is 
known. Obtain an UMP test for testing Ho : © = 9 against H, : 8 > Oo. Also 
find the power function of the test and examine if the test is unbiased. 
[Dethi Univ. B.Sc. (Stat. Hons.), 1986, 1982) 
16. (a) Obtain the most powerful test for testing the mean pt = [Wp against 
jl = $y, (J) > fo) When o? = 1 in normal population. 
(6) Obtain the most powerful test of size a for Ho : 2 =o against 
H,: =p, when pL; > po, if the probability density function of the random 
variable X is 


ff, Wa. exp | + (x - fae <X <0 


17. (a) Let x;, x2, ..., x, denote a random sample from the distribution 
that has p.d.f. 

1 
x, pp) =——— 
f @, pW) Voz 

It is desired to test Hy : pp = O against H, : p= 1. 

(b) Let X,, X2, ..., X,, denote a random sample from the normal distribution 
N(9, 1), 8 is unknown. Show that there is no uniformly most powerful test of 
the simple hypothesis Hy : 8 = 89, where 6 is a fixed number against the 
alternative composite hypothesis H, : 6 # Qo. 

18. Let x1, x2, ..., X, be a random sample from N(jt, 6), where pp is 
‘known, Obtain an UMP test for testing Ho : 8 = 99 against H, : 8 < @. Also 
find the power function of the test. [Delhi Univ. B.Sc. (Sat. Hons.), 1985) 


19. Define M.P. region and U.M.P. region. Show that an M.P. region is 
necessarily unbiased. 


ae eeeree 
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Obtain M.P. regions of size o for testing— 
(i) Ho: 6 = 4 against H, :6=6,, (8, > 9) 
(ii) Ho: 8 = 9 against H; : 8 =9,, (0, < 9) 
for N(u, 9), where 1 is known. 
Show that the tests in () and (ii) are U.M_LP. against one-sided alternative. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1989] 
20. (a) Let x), xo, ..., X, be a random sample from a normal distribution 


N(0, o7). Show that there exists a uniformly most powerful test with 
significance level a for testing 


Hy: 0% = 6, against H, :02 < 6,2. 
If n = 15, @ = 0-05 and 6,2 = 3, determine the best critical region and the 
power function of the above test. {Gujarat Univ. B.Sc., Oct. 1993] 
(b) State Neyman and Pearson’s fundamental lemma.and apply it to obtain 


the test for testing o? = 1 against o? > 1, when the sample is from N(0, 07). Is 
this test UMP ? Is, it unbiased ? Give reasons. 


[Indian Civil Services (Main), 1990) 

21. A sample of size 25 is drawn from a normal population with unknown 
mean j and variance 16. It is required to test the hypothesis Ho : pp = 1-0 against 
the alternative H, : 1 = 3-0 at 5% level of significance, Obtain the most 
powerful test for testing Ho against H, and state how you will find its power. Is 
the test uniformly most powerful ? 

22. Explain the statistical procedure of testing the following hypothesis 
regarding the standard deviation (0) of normal population : 

Hy: 8 = Go 
H; :G=0;>Go 

Will the test criterion remain the same when 0; is changed to 0 # do ? 

23. State and prove Neyman-Pearson Lemma. If x 21 is the critical 
region for testing Ho : 6 = 2 against the alternative , : 6 = 1, on the basis of a 
single observation from the population 

Sx, 9) = 06 -%,0s5x< 0,8 >0, 
obtain the values of type I and type II errors and the power function of the test. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1988] 

(6) Given a random sample X;, Xo, ..., X, of size a from the distribution 

with p.d.f. 
Kix, 9) = 8 e -% 5x >0,0<0<x, 
show that UMP test for testing Ho : 8 = Oo against H, : 8 < Qp is given by 


[Dethi Univ. B.Sc. (Stat, Hons.), 1988} 


(c) Explain the Neyman-Pearson Lemma for finding the best critical region 
for testing a simplé hypothesis about the parameter 9 of the density function 
fix, ©). Illustrate your answer by constructing the best critical region for 
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testing, Ho: 9 = Oo against H, : © = 6, < 99, where 9 is the parameter of the 
distribution with p.d.f., 
fix, 9) = Oe -8* 50 <x< 0, 6>0. 
[Meerut Univ. B.Sc., 1993; Poona Univ. B.Sc., Oct. 1991] 
24. (a) Two independent observations x,, x. are made on a random variable 
X with density function : 


Six, 9) = 5 exp (-x/0); O<x<o, 0>0. 


Test the null hypothesis o : @ = 2 against the alternative H, : 6 = 4. If Ho is 
accepted when x, + x2 < 9-5, and rejected otherwise, obtain the level of 
significance and power of the test. 

(b) Let X, be a random sample of size one from a population with 


PAL. flx) = 5% x20, 8 > 0. Obtain : (A). the BCR. of size a for testing 
Ho: 9 = Qo against H, : 6 = 9, and (ii) the power of the test. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1983] 


25. (a) Obtain the statistic for testing the hypothesis that the mean of a 
Poisson population is 2 against the alternative that it is 3, on the basis of n 
independent observations. 

(6) Suppose you are testing Ho : A = 2 against H, :X = 1, where A is the 
parameter of the Poisson distribution. Obtain the dest critical region of the test. 

26. (a) Suppose a random sample of size n is taken from the Poisson 


e — o = ¢ 
population 2.5 ea A = A a »x=0, I, 2, .. 


region of size a for testing the hypothesis A = Ao against A = A,, (A; > Ao). 
How can you use the above result to find a confidence interval for 0 ? 
Write an expression for the power function of the test for the hypothesis 
A =X against A > Ap. 
(b) X,, Xo, .... X19 1S a random sample of size 10 from a Poisson 


.. Give the most powerful critical 


‘ 10 
distribution with mez i 6. Show that the critical region C defined by £ x; > 3, 
iavl 
is the best critical region for testing Ho: 6 = 0-1 against H, : 8 = 0-5, 
[Madras Univ. B.Sc., Oct. 1991} 


27. (a) Let X,, X2, ..., X, denote a random sample from a distribution 
having p.d.f. : 


Six, p) =p* (1 -p)'!*73x=0,1;0<p<]1 
= 0, elsewhere | 
It is desired to test Ho: p =5 against H, : p =. 
(b) Suppose X has Bernoulli distribution with-probability of success 8. On. 
the basis of a rencom sample of size n it is proposed to relent the null 
hypothesis, Hp : 8 = t 2 if 
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(X, 4X2 4+...+X) 2201 <2 


For n= 5, find the level of significance of the test. 

28. (a) Let x;, x2, ...,X, be arandom sample from a Bernoulli distribution 
with density : 

f(x : 8) = 6 (1 -6)'*5x=0,1 
Obtain a uniformly most powerful size—a test for Hy: 0 =@ 9 against 
H, : 8 > 09. Would you modify the test if #7, : 6 <Q? 
[Dethi Univ. M.A. (Eco.), 1987) 

(6) The probability that a given machine produces a defective item is p and 
the quality of the items varies independently from one to another. Given a 
random sample of n = 20 items produced by the machine, what is the form of 
the best acceptance region for testing Hg: p= 0-05 versus H, : p = 0-10? 
What are the possible values of a < 0-1 (probability of type I error) in this case 
and the corresponding values of B, the probability of type ‘II error ? 


29. Derive a most powerful test of the hypothesis 6 = r against the 
alternative 6 == for the parameter 9 in a geometric distribution 6 (1 — 6)*, 


x=0, 1, 2,... based on a random sample of size 2. 

30. Describe the method for finding the best critical region of size o for 
testing a simple hypothesis against simple alternative one. Illustrate it by 
finding BCR for testing Ho: 9 = 0 against H, : ® = 1, for the Cauchy 
distribution. 

dx 
OO = aU + (x Oy" ~ <*<* 
based on a random sample of size 1. 
31. The distribution of x is : 


flz,@) =4,0-1<x<60+1 


= 0, otherwise 
If Ho: 6 = 4 and H, : @'=5, determine the critical region on the right hand tail 
of the distribution corresponding to @ = 0-25. Also calculate the probability of 
type II error. 
[Kurukshetra Univ. M.A. (Eco.), 1992] 


32. (a) Define simple and composite hypotheses. State and prove Neyman- 
Pearson Lemma. 


(b) Let X;, Xz, ..., X, be a random sample of size n from p.d.f. 
f(x, 9) = Ox®"!,0<x<1,6>0. 
Obviain the U.M.P. region of size a for testing Ho : 6 = Oo against 
ff; : 8> Qo. Also find the power function of the test. 


33. (a) Let x;, x2, ..., x, be m independent observations on a random 
variable X with density function 


f(x, 8) = 0x8 -' 5; 0<x<1,6>0 
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Show that the best critical region for testing Hy : 6 = 1 against H, : 6 = Z, can 
be defined in terms of the geometric mean of x,, x2, ...,X,- 
(b) Let X,, Xz, ....X,-be a random sample from a distribution with p.d_f. 
Ox®-! if0O<x< 1 
x, 6) = 
fe) 0, otherwise 


where 0 < 6 < oo. Show that the M_P. test of level a for testing Hy: 0 = 1 
against the alternative H, : 6 = 2, is given by the critical region : 


: ‘l 
* i x;> exp [ - 0 ]| 


where x7, _ « 2, is the lower o - point of the x?-distribution with 2n df. 


[Dethi Univ. B.Sc. (Stat. Hons.), 1987| 
34, Let X,,X>, ...,X, be a random sample from a p.d.f. 
6@x®-! 0s x<1,0>0 
x, 8) = 
fix. ®) 0 , elsewhere 


Find an U.M_P. test of size o for testing Hy: 6 = 1 against H, :@ > 1. Also 
obtain the power function. [Delhi Univ. B.Sc. (Stat. Hons.), 1992] 
35. Let X;, X>, ...,X, be a random sample from discrete distribution with 


probability function f(x) for which x takes non-negative integral values 
0,12, 600-2 
According to Ho : 


= 0,1, 2 
fy f= x ! » i= ? l, > e006 
0 , otherwise 


According to H, : 


fo] seeTi® = 05 D472 883 
0 , otherwise 
Obtain the critical region.of the most powerful test of level a for testing Ho. 
against H,. Also find the power of the test for the case n= 1 andk=1. 
36. Ho denotes the null ag that a given distribution has the p.d.f. 
Se ee 


V2n 


and H, denotes the alternative hypothesis that the distribution has the p.d.f. 
+ exp (-lxl),-~<x<0, 


»,- 0 <X< 0 


Obtain the most powerful test for testing Ho against [). 


37. It is required to test Ho against H, from a.single observation x, where 
Ho is the hypothesis that the p.d.f. is 
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1 1 
fix) = == exp (- 5x7), (29 < x <0) 
Vn? 
and H, is the hypothesis that 7 p.d.f. is 
fx) = TT rag Perce se es 
Obtain the most powerful test with level of significance a in this case. 


38. State Neyman-Pearson fundamental lemma. With the usual notations, 
if B is the power of the most powerful test of size a for testing Ho : p = Po 
against H,; : p = p;, show that a < B unless po = p;. 


If PolX) = pew? ~0 <xX<&b 
V2n 
Pix) = a —oo < xX < 00 

mo l+(x-p)?? 


and jt is known, determine the most powerful test of size a. Calculate its 
power, if & and 41 have specified values. 

16:6. Likelihood Ratio Test. Neyman-Pearson Lemma based on the 
magniade of the ratio of two probability density functions provides the best test 
for testing simple hypothesis against simple alternative hypothesis. The best 
test in any given situation depends on the nature of the population distribution 
and the form of the alternative hypothesis being considered. In this section we 
shall discuss a general method of test construction called the Likelihood Ratio 
(L.R.) Test introduced by Neyman and Pearson for testing a hypothesis, simpie 
or composite, against a simple or composite alternative hypothesis. This test is 
related to the maximum likelihood estimates. 

Before defining the test, we give below some notations and terminology. 

Parameter Space. Let us consider a random variable X with p.d.f. 
f(x, 8). In most common applications, though not always, the functional form 
of the population distribution is assumed to be known except for the value of 
some unknown parameter(s) 8 which may take any value on a set ©. This is 
expressed by writing the p.d.f. in the form f(x, 8), 6 € ©. The set ©, which is 
the set of all possible values of 6 is called the parameter space. Such a situation 
gives rise not to one probability distribution but a family of probability 
distributions which we write as { f(x, 6), 6 € ©). For example if X ~ N(u, 0%), 
then the parameter 

= {(, 67): -w < pl <0, 0 < ao < 00} 
In particular, for Pe 1, the family of probability distributions is given by 
(N (1, 1); he O), where @ = (pL: — 00 <p <0) 

In the following discussion we shall consider a general family of 

distribuuions 
{ fix ; 6, 68>, oe 6,) 76;€ , jx 1. 2; weeyk) 

The null hypothesis Ho will state that the parameters belong to some 

subspace @p of the parameter space ©. 
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Let x}, %>, ..., X, be a random sample of size n > 1 from a population with 
p.d.f. f(x, 6;, 82, ..., 0%), where @, the parameter space is thé totality of all 
points that (6), 62, ..., 8,) can assume. We want to test the null lypothesis 

Ho: @;, 92, ..., 8) € Go 
against all alternative hypotheses of the type 
Hy, ‘ (8,, 62, roe 6,) E @- Qo 
The likelihood function of the sample observations is given by 


L= [Lf (%;:0;, 2, .... 9) ...(16-16) 
inl 


According to the principle of maximum likelihood, the likclihood equation 
for estimating any parameter 6; is given by 


aL 
36,> 0, @=1,2,...,k) .. (16°17) 


Using (16-17), we can obtain the maximum likelihood estimates for the 
parameters (8), 85, ..., 8,) as they are allowed to vary over the paramcter space © 
and the subspace @o. Substituting these estimates in (16-16), we obtain the 
maximum values of the likelihood function for variation of the parameters in © 
and @ respectively. Then the criterion for the likclihood ratio test is defined as 
the quoticnt of these two maxima and is given by 


Ud) veo, 1+) 


L(6) 7 L(x, 8)" 


he Moxy, X25 oes Xq) = ...(16-18) 


A A 
where L(@ ) and L(@) are the maxima of the likelihood function (16-16) with 
respect to the parameters in the regions @p and © respectively. 
The quantity 4 is a function of the sample observations only and does not 
involve parameters. Thus A being a function of the random variables, is also a, 
random variable. Obvious A > 0. Further 


Qa°-8 => L(@o)sLlL(8) => Asi 


Hence, we get 

O<A sl .. (16-19) 
The critical region for testing Ho (against H,) is an interval 

0<A<ho, ...(16-20) 


where Ao is some number (< 1) determitied by the distribution of A and the 
desired probability of type 1 error, i.e., Ao is given by the cquation : 


P(X < Ao |.Ho) = @ ». (16-21) 
For example, if g(.) is the p.d.f. of A then Ao is determined from the equation : 


rg 
J g( Att) ad =o (16-21) 
0 : é 
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A test that has critical region defined in (16-20) and (16-21) is a likelihood 
ratio test for testing Ho. ' 

Remark. Equations (16-20): and (16:21) define the critical region for 
testing the hypothesis Ho by the likelihood -ratio test. Suppose that the 
distribution of A is not known but the distribution of some function of A is 
known, then this knowledge can be utilized as given in the following theorém. 

Theorem 16-3. If A is the likelihood ratio for testing a simple 
hypothesis Ho and if U = $ (A) is a. monotonic increasing (decreasing) function 
of A then the test based on U is equivalent to the likelihood ratio test. The 
critical region for the test based on U is 

9(0)<U< (Aq) [6()<U <¢(0)] .- (16-22) 

Proof. The critical region for the likelihood ratio test is given by 

O0<X<Ap, where Ay is determined by 


do 
| 2(Al Hy) dd=@ .(*) 
0 


Let U = (A) be a monotonically increasing function of A. Then (*) gives 


dg 109) : 
a= | a(h| Ho) d= [ h(u | Ho) du 
0 0) 


where h(u | Ho) is the p.d.f. of U when Hp is true. Here the critical region 
0 <A <Apo. transforms to ¢(0) < U < $(Xo). However if U = ¢(X) is a 
monotonic decreasing function of 4, then the inequalities are reversed and we get 
the critical region as ¢(A) < U < ¢(0). 

2. If we are testing a simple null hypothesis Ho then there is a unique 
distribution determined for 4. But if Ho is composite, then the distribution of 
may or may not be unique. In such a case the distribution of 1 may possibly be 
different for different parameter points in @g and then Aq is to be chosen such 
that 


L¥) 
: J g(A1Ho) dh sa (16-23) 
0 


for all values of the parameters in @p. 
However, if we are dealing with large samples, a fairly satisfactory situation 
io this testing of hypothesis problem exists as stated (without proof) in the 
following theorem. 

Theorem 16-4, Let x;, x2, ....%, be a random sample from a population 
with p.df. f(x > Or; 0, .... 0): where the parameter space © is k-dimensional. 
Suppose we want to test the v ‘mposite hypothesis 

Ho . 6, = 6,’ 0, = 6,5 ay 6, = ae a < k 
where 0,’, 02’, ..., 0,’ are.specified numbers. When:Ho is true, -2 log, A is 
asvitptotically distributed as chi-square with r degrees of freedom, i.e., under 
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Ho, 2 logA~Yy?, if nis large. .-(16-24) 


Since 0 <A < 1, -2 log, A is an increasing function of A and approaches 
infinity when 4 —> 0, the critical region for —2 log A being the right hand tail of 
the chi-square distribution. Thus at the level of significance ‘a’, the test may be 
stated as follows : 


Reject Ho if -—2 log, A > Y~p)(a) 
where {(,)7(q) is the upper a-point of the chi-square distribution with r d.f. 
given by 
P[x? > X%)7(@)] = @, 

otherwise Hy may be accepted. 

_16-6-1, Properties of Likelihood Ratio Test. Likelihood- ratio 
(L.R.) test principle is an intuitive one. If we are testing a simple ‘hypothesis Ho 
against a simple alternative hypothesis H, then the LR principle leads to the 


samt test as given by the Neyman-Pearson lemma. This suggests that LR test 
has some desirable properties, specially large sample properties. 


In LR test, the probability of type J error is controlled by suitably choosing 
the cut off point Ao. LR test is generally UMP if an UMP test at all exists. We 
state below, the two asymptotic properties of LR tests. 


1, Under certain conditions, ~-2 log, A has an asymptotic chi-square 
distribution. 
2. Under certain assumptions, LR test is’ consistent. 


16-7. In this section we shall illustrate how the likelihood ratio criterion 
can be used to obtain various standard tests of significance in Statistics. 


16-7-1. Test for the Mean of a Normal Population. Let us take 
the problem of testing if the mean of a normal population has a specified value. 
Let (x1, X2, ..., X,) be a random sample of size n from the normal population 
with mean 1 and variance o”, where 1 and o? are unknown. Suppose we want to 
test the (composite) null hypothesis 


Ho : Lt = Up (specified), 0 < a2 < co 
against the composite alternative hypothesis 
H,:W#p930< 0% <0 
In this case the parameter space @ is given by 
@ = ((t, 0) : — 0 <p < 0,0 <6? <0} 
and the subspace @ détermined by the null hypothesis Ho is given by 
Gy = ((t, 67) : p= po, 0 < a? < 09} 


The likelihood function of the sample observations x,, x2, ..., X, iS given 
by 


16:38 Fundamentals of Mathematical Statistics 


afl2 n 
L -( oH ) . exp - m= 2. (x; - | (16-25) 


e 


The maximum likelihood estimates of 1 and o? are given by : 


A.B oe = 
ee FN a 
, 12 ...(16-26) 
Oz” x (x4; -x )* =s? 
| 


Hence substituting in (16-25), the maximum of L in the parameter space © 
is given by 


A 1 nl2 n 
L(@) =} ant - Xp |- 5 .-(16-27) 


In Qo, the only variable parameter is o? and MLE of o+ for given j1 = [Ho is 
given by 


62 =25(x,- Ho)? = 50%, (say) (16-28) 
=1 56-2 ¥- po? 


=1 L(x; —X )? + (% = Mo), 
the product term vahishes, since 
L(x; — x) (x — Ho} = (x — Mo) LQ; -x) =0 


G2 = 52+ (%— lo)? = 592, (say). ,..(16-28a) 
Hence substituting in (16-25), we get. 


A 1 nf2 
L(@o) = E a exp (-n/2) .-. (16-285) 
The ratio of (16-224) and (16-27) gives the likelihood ratio criterion 
A 
n/2 
n ee. s (16-29) 
L(®) 50 


2 nf2 a | nl2 
(ee! ae : | ...(16:29a) 
r + (x - =a UL + [(X — Mo)?/s?] 


We have proved earlier (§ 14-2) that under Ho, the statistic 
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_x-Ho 


= 


= Ms 
where S2 = PEF Po, 
follows Student’s t-distribution with (n — 1) df. 
Thus 
Be ioe X— fo, xX — Ho 
~te_ .--(16-30) 
SNn ~ SN n—-1 
Substituting in (16-29a), we get 
it ! = (12), (say). (16-31) 


1 + 
n- 1 


The likelihood ratio test for testing Hp against H, consists in. finding a 
critical region of the type O <A <Ag, where Aq is given by (16-21a), which 
requires the distribution of A under Hp. In this case, it is not necessary to obtain 
the distribution of A.since A = ¢(t2) is a monotonic function of ¢? and the-test 
can well be carriéd on with 2 as a criterion as with A [c.f. Theorem 16-1]. Now 
t2 = 0 when A = 1 and /2 becomes infinite when A = 0. The critical region of the 
ER test viz., 0 < A < Ag,-on using (16-31) is equivalent to 


1 [2 ee os 
Tan] S Ap 


1 (2 ae 4 
=> 1) > Ao 
2 


1 2 (Ap)-2" = 


=> 12 >(n—1) [Ag **- 1] = A?, (say). 
Thus the critical region may well'be defined by 


lets 2A .--(16-32) 


where the constant A is determined such that 
PUltl2AlHol=a .--( 16°33) 


Since under Ho, the statistic ¢ follows Student’s ¢ distribution with (n — 1) 
d.f., 


A= bn -l (a2) 
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where the symbol ¢,(a) stands for the right tail 100 a% point of the 
t-distribution with n d.f. given by 


P{t>t,(@)) = J. fo dt= a .»(16-33a) 


’ . 
where f( - ) is the p.d.f. of Student’s ¢ with n d.f. The critical region is shown in 
the following diagram. 


o/2 


Fn 31-3) 0 n-1(%/2) 


Thus for testing Ho : Ut = Uo against [Lb # Up (0?-unknown), we have the 
two-tailed t-test defined as follows : 


if \tl= Vn (&— bo) > th (2), reject Ho and if \t\< t,_; (a2), Ho 
may be accepted. 


Important Remarks. 1. Let us now consider the problem of testing the 
hypothesis 
Ho : Lb = Wo, 0 < 6? < 00 
against the alternative hypothesis 


H, : 1 > Uo, 0 < G2 < 00 
Here © = (jt, 67) : - 00 <i < 00, 0 < 6? < 00} 
and @o = ((U1, 67) : P= Ho, 0 < 0? < 00} 


The maximum likelihood estimates of 1: and o? belonging to © are given 
by 


Jz. if F Do 


faq (16-34) 
t Ho, 1f x < po. 
s*, ifx2 
and | ee (16-340) 
So’, if'x < Ho 


where So? = - ; z (x;- Ho)? »-»(16-345) 
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Thus 
al2 
[Gay on) zen 
L(®) = (16-35) 


1 af2 n eee 
Onset EXP |—- |, if xX < Ho 
y, 


In @o, the only unknown parameter is o? whosé MLE is given by 
6? = So”. Thus 


A 1 )al2 n 
L(@) = Ons exp |- 5 .-. (16-36) 
A eee 
" _ (8) _ (52/502) "/2, if ¥ 2 Wo (1637) 
L(@) 1 if X¥ < po 


Thus the sample observations (x;, X, ..., Xn.) for which x < [lg ‘are to be 
included in the acceptance region. Hence for the sample observations for which 


X > Uo, the likelihood ratio criterion becomes 
V = (s2/sq7)*"/2, X > Wy ...(16-37a) 


which is the same as the expression obtained in (16-29). Proceeding similarly as 
in the above problem, the critical region of the form 0 < % < Ag will be equi- 
valently given by [c.f. (16-32)] 


ies 2 
2 = Maer» 4? 


or by t ae 2A (-?X2po) ..-(16-38) 


where ¢ follows Student's t distribution with (” ~ 1) d.f. The constant A is to be 
determined so that 


P(t>A)=a .. (16-39) 
=> A = ta_1 (Q) 
Hence for testing Ho : [t= Up against H, : > Up, we have. the right, 
tailed-t-test defined as follows : 
Un (3- pot 
Reject Ho if t = “A tAS Ho! > te 1 (@) and 


if t<t,_,(Q@), Ho may be accepted. 
2. If we want to test 
Ho: [b= flo, 0 < 6? < 00 
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against the alternative hypothesis 
Hy 24 <9, 0 < 62 < 0, 


then proceeding exactly similarly as in Remark 1 above, we shall get the critical 
regiun given by 


(<-—¢t,_, (a) . (16-40) 
In this.case we have the left tailed t-test defined as follows : 
Vn (xX — Uo) 
If t= 5 <-—t,_,(Q@), reject Hp otherwise Ho may_ be 
‘accepred. 


3. We summarise below in a tabular form the test criterion, along with the 
confidence interval for the parameter for testing the hypothesis Ho : ph = py 
against various alternatives for the normal population when o* is not known. 


[Here ¢, (@) is upper a-point of the ¢-distrbution with n d.f. as detined in 
(16-33a).j 


NORMAL POPULATION N (u, 0?) ;o? UNKNOWN 


Reject Hg at 
Level of 
Statistic { Significance, «, if 


Hypothesis (1 ~ @) confidence 


interval for pt 


Two tailed 
test 


11> ¢,_, (a2) a t,-\(al2) Sp 


Sx ee 


Right tailed 
test 


$>4,_(@) p2x-—=4,_ (a) 


= 


Left tailed 
test 


psz+n_@ 


Vn 


t<-t,_,(a) 


16:7:2. Test for the Equality of Means of Two Normal 
Porulations. Let us consider two independent random variables X,; and X, 
following normal distributions N(Q1,, 0,2) and N(jp, 62”) respectively where the 
means {1;, {2 and the variances 0,2, 07 are unspecified. Suppose we want to test 
the hypothesis : 

Ho : 1 = U2 =H, (say), (unspecified); 0 < 0,7 < 0, 0 < 62? < ©, 
against the alternative hypothesis 

H, : Wy # My, 67 > 0, a2? > 0. 

Case {. Population variances are unequul. 
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@ = (Uh, He, 017, 02”) : - 0 < pt; < 9, 7 5 0, i = 1, 2) 
and Qy = ((p, O12, 02”) : — 00 <p < 00,67 >0, i= 1,2) 
Let x}; (i= 1,2, ..., m) and x2; (j = 1, 2, ..., )_be two nerendent random 
samples of sizes m and n from the populations N(\}, 012) and N(jto, 057) 
respectively. Then the tikelihood function is given by 


Pe (ace ere 
Li (saa | - OJP - 26,2 2, (x1; Ha) 


ee 1 2 
x fog exp - 267 *, (x2; - 2) | ...( 16-41) 


The maximum likelihood estimates for L141, He, 012 and c,* are given by the 


equations : 
oO —_ : ah > — > : 
Ou; logL =U => am te ’ 
t] A 1 am 
s—logL =0 => =— = 
Sy, 08 2 =F * X2j = Xp 
9 F 1” ...(16-414a) 
aa,2 log i. =8 = of = mS 2 (ni — x, )? = 5,7, (Say). 
and 5o7 we L =0 => So? =* 5 (yj -3)?= 57, (sa) 
Substituting in (16-41), we get 
1] \*2 f 1 wie 
L(8) -(ans3) ; ass } .e7 m+n .--(16-42) 


In @o, we have py = p22 = 1 and the likelihood function is given by : 


m2 i. = 
L(y = tana 7 - €xp f 262,~, (x15 - w | 


a | 


= eee oar a See )? 
2no,2 : 20,7 i 243-H 
To obtain the maximum value of L(@o) for variations in 1, o,? and 6,2, it 
will be seen that estimate of 1 is obtained as the root of a cubic equation 
mx, — 2 (x, — 
ea) . AA, = Bb) (16-43) 
A 9 - A 2 
2 Gai) 2 Gay - wd 
ss j= 


and is thus a complicated function of the sample observations. Consequently the 
likelihood ratio criterion A will be a complex function of the observations and 
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its distribution is quite tedious since it involves the ratio of two variances. 
Consequently, it is impossible to obtain the critical region 0 < 4 < Apo, for 
given a, since the distribution of the population variances is ordinarily 
unknown. However, in any given instance the cubic equation (16-43) can be 
solved for 1 by numerical analysis technique and thus A can be computed. 
Finally, as an approximate test, -2 log, 2 can be regarded as a x2-variate with | 
df. (c.f. Theorem 16-2). 

Case 2. Population Variances are equal, .c., 6;2 = 62% = 67, (say). 
In this case . 

= {(jt1, plz, 07) : ~ 00 <p; < 0, 07 > 0, (i = 1, 2)} 
@ = ((, 0”) : 00 <p <0, o? > 0} 
The likelihood function is then given by 


..-(16-44) 
For }1;, [2, 6? € @, the maximum likelihood equations are given by 


so log L, = 0 => =k 


| ..(16-45) 
a log L=0 > {lo -= 4 
and a logL=0 =>. 6? = — : : [E04 - Ht)? + U(x9j - i] 
= «= 7 Rex ~%,)? + ¥ (xy - x7] 
= =—1— ~{ms,2 + ns_’| »+(16-45a) 


oer the valties from (16-45) and (16-45a) in (16-44), we get 
(ns + a! 
6 = [set] aren me Fim+n)] — ...(06-46) 


27(ms,* + nSo* 
Ip Qo, }; = Hz =H, (Say), and we get 


+ 2 (Xo; - wr} .- (16-47) 


~m + 1 
=> logl(®,) = C -™ °) ~ Jog 0? ~2a| Ben — pL)? + 22; -w? | 
where C is a constant independent of u and o7. The likelihood equation for 
estimating j1 gives 


e 
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Slogh= ds |, (x1; - 1) + = (x2j - »|- 

= (mx, + ~ (m+n) =0 

= = -—— [mz, + nz] (16-48) 
Also os log L =0 

= ~ SE 4 ha [Dew + Eley - py] = 0 

= G2 = — — [Emi ~ ib)? + Eley ~ HY (16-49) 
‘But = (x -Pe E (x1; -X1 +X, — PP 


_ a A 
= L(x; — x1? + m(X1 - 1), 
the product term vanishes since 


>» (x1; —X)) =@ 


2 (x4; — = ms,2+ m [¥, M+ shy 


_ 4g, mnXx, — Xz)? 
= sy +m + ny? 
Similarly, we shall get 
A nm*(x2 - X))? 
ua lene ee 
jo Gu BP = mel + n+ ay? 
Substituting in (16-49), we get 


A I _ | 
Go = 7 ms,2 + AS2? + =o 5 (kin =) ..-(16-49a) 


Substituting from (16-48) and (16-49a) in sean we get 


A (m + a2 
L(®) = (m + - 
2K [nsy + AS9* ss » - 5] 
x exp (- m _ .-(16-50) 


a U6) “| ms? + ns2? - ave 


2) ms,2 + nso? + —"2~ (z, - x,)2 
L@) 1 ot ay 17 
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l cee + )/2 
=f a .-(16-N) 
, mn (x,-%X2)? __\ 
(m + n)(ms,? + nyo?) 
We know that (c.f. § 14-2-10), under the null hypothesis Ho : }1; = {42, the 
statistic 


t 2- ...(16-52) 
| 1 1 
§ —+ = 
moon 
1 
where S? = = ao (ms,2 + nsq?) ...(16-52a) 
follows Student’s t-distribution with (m + n — 2) d.f. Thus in terms of t, we get 
— (m+ ny/2 
{2 
A -| + ae 5 ...(16-53) 


As in § 16-7-1, the test can as well be carriéd with ¢ rather than with 2. The 
critical region 0 < A < Ao, transforms to the critical region of. the type 


2>(men 2 em ay 1] =4% (a) 


i.e., DY lt l>A, 
where A is determined so that 
P[ltl>AlHyl =a .. (16-55) 
Since under Ho, the statistic ¢ follows Student's t-distribution with 
(m+n —2) d.f., we get from (16-55) 
A = ta +a -2 (G/2) .. (16-56) 
where, #,(a) is the right 100 a% point of the ¢-distribution with vn d.f. 
Thus for testing the null hypothesis 
Ho : fy = Py 3 0)7 = 0,2 = 07 > 0 
against the alternative 
H, : fy # fy, 6,2 = 6,2 = o? > 0, 
we have the two-tailed t-test defined as follows ; 


if lel= se > bm +n -2 (0/2) 
| 2 1 
5 —+- 
m n 


reject Ho, otherwise Ay may be accepted. 


Remarks, 1. Proceeding similarly as in Remarks.to § 16-7-1, we can 
obtain the critical regions for testing 


Ho": fy, = Py 3 0)? = 0,7 = 02 > 0 
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against the alternative hypothesis 
Hy: > a oe 

or Hy,’ 2; <p236 =07>0 

We give below, in a tabular ane the ea region, the test statistic and 
the confidence interval for testing the hypothesis 

Ho: 5 = [1 — U2 = Sp, (Say), 

against various alternatives, viz., 5 > So, 5 < 5, or 5 # So. 

2. For testing Ho : 8 = So against the alternative H, : 5 < do, the roles of 
X, and xX, are interchanged and the case 1 of the table is applied. 

3. If 59 = 0, the above test reduces to testing Hy: HW, = [2,i.e., the 
equality of two pupulation means. 


4. If the two population variances are not equal, then for testing 
Ho 5 = So, we use Fisher-Behrens’ d-test- 


Reject Hg at 
level of 
significance 


aif 


(2 — @) confid- 
ence interval 


5 2 (x, - x,)-4,5 


Witt >t,,, 2 (a/2) 
= t,, (say) 


(x, - x,)—-4,5 


<8 <(x, - x) +45 


16-7-3. Test for the Equality of Means of Several Normal 
Populations. Let X;, G = 1, 2, ...,. nj; i= 1,2,... k) be k independent 
random samples from k normal populations with means 1;, po, ..., Be 
respectively and unknown but common variance o?. In other words, the k 
normal populations are supposed to be homoscedastic. We want to test the null 
hypothesis 

Ho : Hy = Wp =... = Hy = Wt (Say), (unspecified) 
6,2 = 022 =... = 0,7 = 0? (say), (unspecified) 
against the alternative hypothesis 
Hy, : pj,’s are not all equal, 
6,2 =0,? =... = 6,2 = 62, (unspecified) 
Thus we have 
@ = (City, Pa, -.., We, 02): — 00 < pt; < 00, @ = 1,2, ...,k)2 02 > 0) 
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and Oo = ((Uy, Has... Hy O27: -o< = UL <oo, (f= 1, 2,..., k):02>0) 
The likelihood function of the sample observations is given by 


nf2 l k mi 
ZS) = aS) bd ; 2? a fo (Xi - | -» (16-57) 


k 
where n= Y n;. 


For variations of }t;, = 1,2, ..., k) and o? in @, the maximum likelihood 
estimates are given by 


- log L(Q)=0 = ij —;) =0 


=> ti; = 15 Er Xj = X; ...(16-58) 
Nn; j=l 


0 . A, 1 A 
oi 10gL(8)=0 => G2= 7 E Xexy - iy? 


= G2 = LL (xy —x;)* = “2. , (say), ...(16-58a) 
ij 


where in ANOVA (Analysis of Variance) terminology, Sw is called within 
sample sum of squares (W.S.S.). 


In @p, the only. variable parameters are 1 and 0” and we have 


1 
L(@») = ae exp ( 76% EE (ty - ut (16-59) 
The MLE’s of- and o? are given by 


ce log L(Q@)) =0 => 22 (xij —p) =0 


oy. 
A ] = 
3 TE dep >> xj=X ...(16-60) 
and 25 log L(G) = 0 => ae (x, - 1)? 
= 6? =— 2 EL ~¥ P=—" , (say), ...(16-60a) 


where in ANOVA terminology, Sy, is called total sum of squares (T.S.S.) 


Substituting from (16-58) and (16-58a) in (16-57) and from (16-60) and 
(16-602) in (16-59), we get respectively 
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A n_ )A2 n 
L(8) = & i) . exp ( +) ...( 16-61) 
a = L(@,) = fe aie exp ( 5) ...(16-62) 
A 
n ee -('= y (16-63) 
L(e) i 
We have 


Sy => (x -x = DE Gy —x;+%;-X)? 
ij t 


=22 Qi ~ XP + 22%; = x) +2y [(x; EYE Oy -x)] 


But ; (xj; ~ X,) = 0, being the algebraic sum of the deviations of the 
sseeation of the ith sample from its mean. 
Sp = ED (ty ~¥)*4 DH, - FP 
= Sw + Sz, (say) ...(16-63a) 
where Sg = x nd x; ~% )2, im ANOVA terminology is called between samples 


sum of squares (B.S.8.): 
Substituting in (16-63), we get 


A= . (16-64) 


1 
“wl ~ 
le 
+ je 
tr 
& 
) 
3. 


We know that under Hp, the statistic 


_ Spl(k - 1) 
s Swi(n ~k) -..(16-65) 


follows F-distribution with (k — 15 n —k) d.f. 


Substituting in (16-64), the likelihood ratio criterion 4 in terms.of F is 
given by ; 


ras en 
Xr -|1 + ak F| »--( 16°66) 
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Since A is a monotonic function of F, the test can well be carried on with 
F as test statistic rather than with A.The critical region for testing Hp against 
Hy, viz.,.0<2 < Ap, is equivalently given by: 


ee 1 "2 
[1+ por > Ao 


=> F 2 : [ (Ag)-2 — 1] =A, (say), ..-(16-67) 
where A is determined from the equation 
PIF>A\Hol =a ..(16-67a) 


Since F follows F-distribution with (k — 1, 2 ~k) d.f., we get 
A=F,_3, a-k (a) 
where F, -1,,-4(@) denotes the upper a-point of the F-distribution with 
(k -—1,n--k) df. 
Hence the test for testing 
Ho! fy = Py =.-. = WEP, 6/2 = On =... = G,2=07>0 
against the alternative kypothesis 
Hy, : u,’s are not all equal, 6,7 = 677 = .., = 0,2 = 07 >0 
is defined as follows : 

Reject Hoy if F > Fy_\, 5-4 (a), otherwise Hy may be accepted, where F is 
defined in (16-65). 

Remark, In ANOVA terminology, Ss/(K — 1) is called Between Samples 
Mean Sum of Squares (M.S.S.) while Sy/(n — &) is called Within Samples (or 
Error) Mean Sum of Squares and thus F is defined as 

Between Samples M.S.S. 
= Within Samples M.S.S. entOOle) 

16:7-4. Test for the-Variance of a Normal Population. Let us 
now consider the problem of testing if the-variance of a normal population has a 
specified value 69, on the basis of a random sample x), x9, ..., x, of size n 
from normal population N(j1, 07). 

' We want to test the hypothesis 
Ho : 6? = Go”, (specified), 
against the alternative hypothesis ° 


Hy : 02 # 62 
Here we = 
= ((l, 02): -0 <p <2, 0750} 
and aaa — 00 << 0007 = 67} 
The likelihood function of the sample observations is given by 
L= Gta)" exp ‘ t.. r (x; - w| (16-68) 
2no? } ° 20? 5, a 


AS in § (16-7-1), (c.f. (16-27)}, we shall get 
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alt A ; 
L@) = ; ~ exp |- 5 (16-69) 
In @o, we have only one variable parameter, viz., 1 and 
af? a 
L(@y) (sec) exp - ia? xX Gi 0 ..-(16-70) 
The MLE for p is given by 
oO = = 
au logL=0 => p=x 


al2 : 
s)ma] 


The likelihood ratio eric is given by 


1O(&|"oL i G-*] 
X= exp |- -n 
(0) Soe xp i 2 Lay? 


We know that under Ho, the statistic 


ay .. (16-72) 
follows chi-square distribution with (n — 1) d.f. In terms of x2, we have 
2 | 
Xv -| - . exp - 5 (x? - m| ..-(16-73) 


Since i is a monotonic function of y?, the test may be done using x? as a 
criterion. The critical region 0 < A < Ag is now equivalent to 


we j 
(x7/n) exp |— 9 (x? —2n)|<Aj 


We , , 

or exp (— 5 x7) (x2)"? < Ap.(ne“!)*? = B, (say). . (16-74) 
Since x? has chi-square distribution. with (n ~ 1) GE, the critical region 

(16-74) is eelerunee by” a pair of 
intervals 0 < x? < x2” and x,? < x2 < &,. 
where 7,7 and x5” are to be determined 
such that the ordinates of (16-73) are 
equal, i.é., 
. GD exp- 3x1) 


. = (x22) exp (~ $2?) 
Critical region is shown as shaded region in the above diagram. 


{ 
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In other words, x,? and 2 are defined by the equations 
P(x? > 4,7) = a/2 ...(16-75) 
and PG? > m2 Int 
In other words, 
11? = 4%2,4-1(0/2) and x27 = 72,_1(1 - a2), 
where 17, 1)(@) is the upper a-point of the chi-square distnbution with (n — 1) 
d.f. Thus the critical region for testing Hy : 6? = Oo? against H, : 6? # 5p", is a 
- two-tailed region given by 
- 42> H7n-i(G/2) and x? <4%7,-1 (1 -a/2) .+4(16-76), 

’ Thus, in this case we have a two-tailed test. 

Remarks. 1. If we Want to test Ho : 6? = Go? against the alternative 
hypothesis H, : 0? < ay? we get a one-tailed (left-tailed) test with critical region 
12 < 724 _ 1(1 = &) while for testing Ho against-H; : 0? > Go’, we have a right 
tailed test with critical region 7? > 714 — 1) (a). 

"We give below in a tabular form, the tést statistic, the test criterion and the 
confidence interval for the parameter for testing Ho : 6? = Go2, 11 (unknown), 
\ against various alternative hypotheses. 
NORMAL POPULATION N(u, 62); 1p UNKNOWN ; Ho: 07 = 


| S.- [Alternative Test Reject Ho at (1 - a) ! 


statistic ‘at level of confidence interval 


significance if 
0 7% re (a) 
x? < 070-1 (1 - @) 


x? >7,, a | (a/2) 


andy?<73,_,0-an) | s ———— 
. x x 1¢ 47, (1 - a/2) 


2: Lf we want.to test the null hypothesis Hp : o? = Go against the various 
alicrmative hypothests, vizZ., 6? > Go* Or G? < Go? or 6? # Oo,” for the normal 
population N (i, o2)..where 1 is known then the test statistic, the critical region 
and ine confidence interval for o? can be obtained from the table given above on 


replacing (n - 1) by n and ns? by, ‘the expression z Gis :)?, 
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16-7-5. Test for Equality of Variances of two Normal 
Populations. Consider two normal populations N(j1;, 0;7) and N (Uo, 62”) 
where the means j1, and 12 and variances 6,7, 0 are unspécified. We want to 
test the hypothesis:: 

Ho : 0,2 = Gq? = o? (unspecificd), with 1, and py (unspecified) against.the 
alternative hypothesis 

H, : 0,2 # Gq? ; pt, and p, (unspecified). 

If x4;, (i = hi 2, ks m) and X2j, i =I, i Tere n) be independent random 
samples of sizes m and n from N(y, 0,7) and Np, 62”) respectively then 


ae ee ee ee ek eee 
mes 210 ;? p 20,2 rag 1 Bi 


x = Pee a , (x9; — [12)? | (16-77) 
20>" 26%" jul j a i 
In this case * 
G8 = {pti, fo, 0:2, 6,7) 2 — 00 < [; < 90; 6,2 > 0, (ze l, 2)} 


and @ = (ij, He, 07) : - 2 <p; < 00; (f= 1, 2), 0? > Q} 
As in § 16-7-2 [c.f. (16-42)}, 


A m2 ; nl? Ph coe 
L(8) “(es ) , ue 7 . exp - = + | .- 16°78):. 


where s,? and s,? are as defined in (16-414). 
In Qo, the likelihood furiction (16-77) is given by 


; (m + n)/2 1 
L{@o) = [tah xp - 402 { 201; ~ ft)? + 2 (Xj ~ H2)? 1| 


...(16:79) 
and the MLE’s for ji, U2 and o? are now given by 
A = as 
i= %1, He =%2 ...(16-80) 
A, a ar A 2: 
ad o7 = 7 ae (X15 - iy)? + % (x2 - [2 ) 
ae [2@j- 3)? + 2 Gaj- — %)?] 
24 
ees -..(16-80a) 


Substituting from (16-80) and (16-80a) in (16-79), we get 
A. m+n (m+n) 1 
L{@q) -| 2x (ms,? + nso) a [- ul | seen) 
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A 
, er) 
"= A 
Lie) 
mon _ (sr (57)? eae (s 
= (m + n)? nz a + ns? +n)/2 
_ (m+ nym m2 [ (ms)? (ns) 
We know that under Hg, the statistic 
-o= X1)2 ~— 1° 2 
F _ 2(x1; = /(m 15) =o : .--(16-83) 
X (x2; = X2)7/(n — 1) 2 
follows F-distribution with (m— 1, n— 1) d.f. (16-83) also implies 
Fe m(n — 1) 5)? 


n(m - 1)s>* 
m—l _ ms,” : 
im n-1)" "nse .- (16-834) 


Substituting in (16-82) and simplifying, we, get 


m-1 _, mi2 
x (m + nyn* M2 n-1} 


mnm2 nrl2 m - ] a 
pray a 
n-1 


.--(16°84) 
1 + 


Thus A is a monotonic function of F and hence the test can‘ be carried on 
with F, defined in (16-83) as test statistic. The critical region,0 < A < Ag can be 
equivalently seen to be given by pair of intervals F < F, and F 2 F., where F; 
and F, are determined so that under Hy 

P(F > F2)=0/2 and P(F 2F;,) = 1-—a/2 

Since, under Ho, F follows Snedecor’s F-distribution with (m — 1, n - 1) -d.f., 
we have 

Fo =Fmoi,-1(@/2) and F, =F,,1,,-, (1-o/2), 
where F,, ,(Q@) is the upper a-point of F-distribution with (m, n) df. 
Consequently for testing Ho: 0,2 = 6,” against the alternative hypothesis 
H, : 0,2 4 0, we have a two-tailed F-test, the critical region being given by 

F>Faiin-1(@/2) and F<Fy_i.,-1(1-a/2) — ...(16-85) 
where F is defined in (16-83) or (16-83a). 

Remark. Let us suppose that we want to test the hypothésis 
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Without loss of generality, we can assume that S,2 > S,?, where S,? and 
S,? ate unbiased estimates of o,2 and o,” respectively. We know that the 
statistic 
Fe 26? 52° (under Hp), 
follows F-distribution with (m -—1,n —1) d.f. The test-statistic, the tést 
criterion and (1 — ) confidence interval for the parameter for various alternative 
hypotheses are given in the following table. 
If 89 = 1, the above test reduces to testing the equality of population 
variances. 
©" 2 
NORMAL POPULATION; Ho: 5 3=%0 


(1 - @) confidence 


Critical region at interval 
level of 2 
signifscance ‘a for ar 

7) 


S. | Alternative 
No.| Hypothesis 


e252, 


2a5 
So” Fy yp 1(Q) 


F>F,,_ 1 4~1(@) ; 
7) 


| «2 ¢2 
O° _§ 1 
F<F__, ..,i-@ —_s 
m-in—il— 9) GP 52° Fy 1 (1- @) 


S,? 


F>F,, 1._-1(@/2), $2 ree a 5 
m-1,a— 


and 
F<F 4,341 - af2) j 


16-7-6. Test for the Equality of Variances of Several Normal 
Populations. Let X;, (j = 1, 2, ..., n;) be a random sample of size n; from the 
normal population N(p;, 07), i = 1, 2, ..., k We want to test the null 
hypothesis : 

Hy : 0,2 = 62? = ... = 04? = o? (unspecified), with 11, [2, ..., We 
(unspecified), against the alternative hypothesis : 

H, :0;? @, 2, ..., k), are not all equal; j1;, pp, «.., Hx (unspecified). 

Here we have 

© = (pt), plo, ---, bes 0,2, 0,2, Kees 0;,”): — 09 < [L; < 00, O;? > 0, 
(i=1,2,...,k)} 
and @ = (hy, po, .-., Me 3 O17, On”, ...,°0,4): — 00 <p; < 09, 672 = 07 > 0, 
(Gi = 1,2, ..., k)} 
The likelihood function of the sample observations xj, (j = 1, 2, .... mj 3 
t= 1,2, ..., k) is given by: 
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k r a 
a a ba cake J y+ guy 
a ie =| as + 26} ; me (xy — Hi) I --- (18-86) 


It can be easily, seen that in © the-MLE’s of j1,’s and o;’s are given by 
/ 


a 
i, = x; and 67 = »y (x;; -— x)? = 5,7, .--( 16-87) 
: jt joi 


a 1 | a,f2 ; 
1d fla) 9] 
k t a,f2 
=exp (. : P al Ge) .-.( 16-88) 


where n = DN. I 
In Qo, 6,2 = 027 = ... = 0,7 = oO and therefore 
aa 1 2 
L{®p) -(st) exp - 202 22 (xi + Te) | .-:(16-89) 
The MLE’s of 1;’s and A are given by 
= = Xi and 62 = == >>) ( x; -x)?=+ = ns? ...(16-90) 
Substituting from._(16- So in (16- 89), we 2 
a n 
L(y) (Bena? exp (- 5) ,..(16-91) 


€ 


k 
aA nf2 
a6) | ne TTL") 


L(@) | ; nal 
Lt = 
k 
au [(s2)**] 


= pe where s?=* Yrs? 


k ‘ a 
- I ( se 7] jo-(1692) 


A is thus a complicated function of sample observations and it is not easy 
to obtain its distribution. However, if n,’s are large (i = 1, 2, ...,k), Theorem 
16-2 provides an approximate test defined as follows : 

For large:n;’s, the quantity -2 log,A is approximately distributed as a chi- 
square variate with 2k - (k+1)=k-1 df. 

The test can, however, be made even if n;’s are not large. It has been 
investigated and found that the distribution of — 2 log, A is approximately a 
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X?-distribution with (k - 1) d.f. even for small n;’s. However, a ‘better 
approximation is provided by the Bartlett’s test statistic : 


2 -—______—2 log hh” 


v= _ 
ee ie ee 
i saa [E( 2) sh 


where 2’ is obtained from 2 on replacing n, ‘by (n; — 1) in (16-92), which 
follows x?-distribution with (k - 1) d.f. Thus the test statistic, under Ho is given 
by 


k 
-> (n; - 1) log, 
yea ~ Xp ...(16-93) 
oak ae fy (1-)_ | 
3 -D LF (ny J Se 
The critical region for the test is, of course, the right-tail of the y?- 
distribution given by 
x? > x76 -1)(0), fa (16-94) 
where x? is defined in (16-93). 


EXERCISE 16 (b) 


1. (a).Define ‘Likelihood Ratio Test’. Under what circumstances would you 
recommend this tést ? 

(b) Let x,, x2, ....X, be a random sample from a ndrmal distribution 
N(@,, 92). Use likelihood ratio test to obtain BCR of size a under Ho: 8, =0 
against H, : 6, #0. 

2. (a) Let po (x) be the density of a random variable with the miixed’ second 
derivative = CE PO) 2 0 for all x and 8. Then show that the family has 
‘monotone likelihood ratio in-x. 

3. Discuss the general method of construction of likelihood ratio: test. 
Consider n Bernoullian trials with probability of success p for each trial. Derive 
the likelihood-ratio test for testing Ho: p = po against H; : p > po: - 

[Delhi Univ. B.Sc. (Stat. Hons.), 1992, 1986) 


4. Let X,, X5, ...,X, be a raridom sample from a Poisson distribution 
with parameter 0. Derive the likelihood ratio test for Ho: 6-= Qo against H, : @> 
Q. Show that this is identical with the corresponding UMP test. — 

5. (a) Let X,, X2, ...,X,,be a random sample from a normal population 
with unknown mean jt and, known: variance o?: Develop the likelihood ratio test 
for testing Hy: p= Lo (specified) agairist (i) Hy : W> po and (ii) H, : Wb < po. 

(b) Let X|, X2,...,X, bea random sample from N(u, 67), where o? is 
known. Develop the likelihood ratio test for testing Io : = Wo (specified) 
against H, : ut < [Up. [Delhi Univ. B.Sc. (Stat. Hons.), 1987] 
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(c) Find by the method of likelihood ratio testing, a test for the null 
hypothesis Ho : m = mo for a normal (m, 62) population, o? known. 

[Calcutta Univ. B.Se., (Maths Hons.), 1989] 

6. Discuss the general method of construction of likelihood ratio test. 

Let X,, X2, ....X, be a random sample from a N(1, ®) population where 
is the unknown variance and 1 is known. Obtain a likelihood ratio test for 
testing a simple Ho : © = Oo against H, : ® > Qo. 

(Delhi Univ. B.Sc. (Stat. Hons.),-1993) 

rp (a) Develop the likelihood ratio test for testing Ho : L = Wo based on a 
random sample of size n from N(t, 62) population 

[Delhi Univ. B.Sc. (Stat. Hons.), 1982] 

(b) Let x,, X2, ...,X, be a random sample from a normal population with 
mean jt and variance o2, 1 and o? being unknown. We wish to test Ho : 1b = Uo 
(specified) against H, : bp # Uo, 0 < G2 < 00, 

Show that the Likelihood Ratio Test is same as the two tailed t-test. 

[Delhi Univ. M.A. (Eco.), 1986] 

(c) Describe the likelihood ratio test. 

The random variable X follows normal distribution with mean 6, and 
variance 02. The parameter space is 

@ = (8;, 02} : - 00 <Q, < 00,0 < 8) <0}. 

Let @ = ((0:, 02) : 8; = 0, 0 < 02 < o}, 


Test the hypothesis Ho : 8, = 0, 9, > 0 against the alternative composite 


hypothesis H, : 0, #0, 0) > 0. 
, (Madras Univ. B.Sc., 1988) 

8. Discuss the general method of construction of likelihood ratio test. 
Given N (41, 6,7) and N(12, 62”), where all the parameters j1), ba 6,2 and 0,” 
are unspecified, develop the LR test for testing Ho : 6,7 = 6? against 
Hy: : 6,2 # G2. 

(Dethi Univ. B.Se. (Stat. Hons.), 1983] 

9. Describe likelihood ratio test and state its important properties. 

Let X, and X, be N(ji;, 02) and N(jt,, 6”) respectively where the means: ‘and 
variance are unspecified. Develop LR test for testing Ho: 11, = 12 against 
Hy. Pi FM 

OR 
Construct LR test for testing Ho: © = Oo against all its alternatives in N(0, 07), 
where 6? is known. 
[Delhi Univ. BSe. (Stat. Hons.), 1988] 

10. Show that the likelihood ratio test for testing the equality of variances 
of two normal distributions is the usual F-test. 

11. Show that the likelihood ratio test for testing Ho: a = 0 against 
H, : a #0, based on a random sample of size n from 


fix’: a: B) = 35 0 -B sx sa+B 


o- 
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is (R/2Z)" where R =.X¢,j —Xqy and Z = max [- Xu, Xw]. 
[Dethi B.Sc. (Stat. Hons.), 1989, 1985} 


12. Show that the likelihood ratio principle leads to the sanie test; when 
testing a simple hypothesis against an alternative simple hypothesis, as that 
given by Neyman-Pearson theorem. [Madras Univ. B.Sc., 1988] 


16-8. Non-parametric Methods. Most of the statistical tests that we 
have discussed so far had the following two features in common. 

(i) The form of the. frequency function of the parent population from which 
the samples have been drawn is assumed to be known, and 

(ii) They were concerned with testing statistical hypothesis about the 
parameters of this frequency function or estimating its parameters. 

For example, almost all the exact (small) sample tests of significance are 
based on the fundamental assumption that the parent population is normal and 
are concerned with testing or estimating the means and variances of these 
populations. Such tests, which deal with the parameters of the population aré 
known as Parametric Tests. Thus, a parametric statistical test is a test whose 
model specifies certain conditions about the parameters of the population from 
which the samples are drawn. 

On the other hand, a Non-parametric (N.P.) Test is a test that does not 
depend on the particular form of the basic frequency function froni which the 
samples are drawn. In other words, non-parametric test does not make any 
assumption regarding the form of the population. 

However, certain assumptions associated with NP. tests are : 

(¢) Sample observations are independent. 

(ié) The variable under study is continuous. 

(iii) p.d.f. is continuous. 

(iv) Lower order moments exist. 

Obviously these assumptions are fewer and much weaker than those 
associated with parametric tesis. 

16-81. Advantages and Disadvantages of N.P. Methods over 
Parametric Methods. Below we shall give briefly the comparative study of. 
parametric and non-parametric methods and their relative merits and demerits. 
Advantages of N.P. Methods : 

(i) N.P. methods are readily comprehensible, very simple and easy to apply 
and do not require complicated sample theory. 

(ii) No assumption is made about the form of the frequency function of the 
parent population from which sampling is done. 

(iit) No parametric technique will apply to the data which are mere 
classification (i.e., which are measured in nominal scale), while N.P. methods 
exist to deal with such data. 

(iy) Since che socio-economic data are not, in general, normally distributed, 
. P. tests have found applications in Psychometry, Scciology and Educational 

tatistics. 

(v) NP. tests are available to deal with the data which are given in ranks or 
whose seemingly numerical scores have the strength of ranks. For instance, no 
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parametric test can be applied if the scores are given in grades such as At, A’, B, 
A, B*, etc. 
Disadvantages of N.P. Tests. 

(¢) N.P. tests can be used only if the measurements are nominal or ordinal. 
Even in that case, if a parametric test exists it is more powerful than the N.P. 
test. In other words, if all the assumptions of a statistical model are satisfied by 
the data and if the measurements are of required strength, then the NP. tests are 
wasteful of time and data: 

(it) So far, no N.P. methods exist for testing interactions in ‘Analysis of 
Variance’ model unless special assumptions about the additivity of the model are 
made. 

_ (tii) NLP. tests are designed to test statistical hypothesis only and not for 
estimating the parameters. 

Remarks 1. Since no assumption is made about the parent distribution, 
the N.P. methods are.sometimes referred to as Distribution Free methods. ‘These 
tests are based on the ‘Order Statistic’ theory. In these tests. we shall be 
using median, range,. quartile, inter-quartile range, etc., for which an ordered 
sample is desirable. By saying that x,, x2, ..., x, is an ordered sample we mean 
X)Sx2 5... 5X4. 

2. The whole structure of the N.P. methods rests on:a simple but 
fundamental property of order. statistic, viz. 

“The distribution of the area under the density function between any two 
ordered observations is independent of the form of the density function” , which 
we Shall now prove. 

16°8-2. Basic Distribution. Let.Z -be a continuous random variable 
with a p.df. f(.). Let Z,, Z2, .... Z, be a random sample of size n irom f(.) and 
let x, X2, ..., X, be the corresponding ordered sample. Then the joint density of 
X1,X2, ...,X, 1S given by 

(X11, Xz, ...,Xq) = MN! f(X) fla) ... fq), OSX] <M XO <<... SX, < 09 
...(16-95) 
the factor n ! appearing since there are n ! permufations of ‘the sample 
observations and each gives rise to the same ordered sample. 
Let us define 


U; = \* f(z) dz = F(x), @= 1,2, ..., A) .. (16:96) 


where. F(.) is the distribution function of Z. But since F(x,) is a uniform random 
variable on [0, 1], U;,(é= 1, 2, ..., n), defined in (16-96) are random variables 
following uniform distribution on [0, 1]. Thus the joint density &(.) of the 
random variables U,, t= 1, 2, ..., n) is given by 

k(u,, Uz, ....U,)=n!,0 Su, <u <...< ups ...(16-97) 
and does not depend on f{.). 
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ru) =|, ah " un! du duz... du, 


(On simplification) __...(16-98)- 


rar 


Thus the expected area under f{.) between two successive ordered obser- 
vations is given by 


E(U;) - EU;-i)=— 


which is independent of ft.) 

168-3. Wald-Wolfowitz Run Test. Suppose x), X2, ..., x,, iS an. 
ordered sample trom a population with density f,(.) andlet y;, y2, ..., y,, be an 
independent ordered sample from another population with density f2 (.). We want 
to test if the samples have been drawn from the same population or, from 
populations with the same density functions, i.e., if f,(.) = foC.). 

Let us combine the two samples and arrange the observations iu urder of 
magnitude to give the combined ordered sample as, (say), 

X1 X21 V2 ¥3X3 YaX4Xs Ys... .. (16-99) 

Run (Definition). A run is defined as a sequence of letters of one kind 
surrounded by a sequence of letters of the other kind, and the nuinber of elements 
in a Fun is usually referred to as the length (I) of the run. 

Thus in (16-99), we have in order, a run of x (/ = 2), a run of y (/ = 3), a 
run of’x (= 1), a run of y (7 = 1) etc. 

If both the samples come from the sample population then there would be 
thorough mingling of x’s and y’s and consequently the number of, runs in the 
combined sample would be large. On the othér hand if the samplesiconie from 
two different populations so that their ranges do not overlap, then there would be 
only two runs of the type x), .X2, ..,X,, and yi, Yo, ...» Yag-- Generally, any 
difference in mean and variance would tend to reduce the number of runs. Thus 
the alternative hypothesis will entail too few runs. 

Procedure. In order to test the Null Hypothesis Ho : f;(.) = fo(.) i.e., the. 
samples have come from the same population we count the number of runs ‘U’ 
in the combined ordered sample. 

Null hypothesis is rejected if U < up, where the value of uo for given level 
of significance is determined from considering the distribution of U under Ho. 

First of all let us find the probability of Obtaining a specific arrangement 
(16-99) under Ho : fi.) = fol.) =f), (Say). 

If X’s and Y’s are transformed to U’s and V’s by the relation : 


U;= | a2) de, ae | 42) de, 


then the joint p.d.f. of U’s and V’s becomes 
g(u;, U9, wooy Ups Vis V2z eons v,) = ny ! ny ! .-. (16-100) 


_i-l 1 
1 antl nel’ 


.-.(16-98a) 


16-62 Fundamentals of Mathematical Statistics 
The probability of an arrangement (16-99) is obtainéd on integrating 
(16-100) over the region defined by 
O<u, <u, <v) <vW2<W,<...< 1 


i.e., integrating u, over 0 to u, ; then uz over 0 to v, and so on. The value of 
the integral will, on simplification, come out to be 


mtn! _ 1 
(n, + n2) ! ni +n 
nm 
+ : 
Since there are exactly| | arrangements of, n;, x’s and no, y’s, it 
follows that all the arrangements of x’s and y’s are equally likely. 
Since under Hg all the oa if i arrangements of n, x’s and nz y’s are 


equally likely, to obtain the distribution of U under Hg, it is necessary to count 
all the arrangements with exactly, ‘u’ runs. Let us first take the case of even 
number of runs, i.e., u = 2k. In this case we should have k runs of x’s and k 
runs of y’s. 

n, x’s will give k runs if they are ‘separated by (k — 1) vertical bars in 
distinct spaces between the x’s. In other words, (k — 1) spaces are to come. out of 
the total number of (n, —'1) spaces between the n, x’s and this can happen in 

= = | 
« : ways. Hence & mins of x’s can be obtained in (7 i ways. 
Non~ 
k-1 

The same result holds if the sequence of runs in (16-99) starts with x or 
with y. Since a sequence of type (16-99) may start with x or y, we get 


Similarly, & runs of y’s can be obtained in ways. 


P(U=2) = 


ni + No 
my 


df the number of runs.'in (16-99) is odd, i-e., u = 2k + 1, then we should 
shave either (i) (kK + 1) runs of x.and & runs of y or (ii) & runs of x and (k + 1) 
runs of y. Hence 


P(U=2k +1) =P () +P Gd 


ny + ie) 
ny 


Hence the distribution of U under Hp is given by- 
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n,- 1 no- 1 


.-.(16-101) 
If the probability of type I error is fixed as a, then up is determined from 
the equation : 


. h(u) = ot ..(16-102) 


ua 2 
where h(u) is the probability function of U given by (16-101). 
Calculation of uo from (16-102) is quite tedious and cumbersome unless n, 
and n> are large in which case under Ho, U is asymptotically normal with 
2nyn, 


E(U) eras ..-(16-103) 
Var (U) ~ 2nyN2 (Zn Nz - m — M2) (16-104) 


~ (ny + nz)? (ny + nz - 1) 
and we can use the normal test 


U —- E(U) : 
i= ~ N(O, 1), asymptotically. ...(16-10 
ar) (0, 1), asymp y. ...(16-105) 

This approximation is fairly good if each of n, and nz is greater than 10. 
Since the alternative hypothesis is “too few runs’, the test is ordinarily one- 
tailed with only negative values leading to the rejection of Ho, 

16°3:4. Test for Randomness. Another application of the 
‘run’ theory is in testing the randomness of a given. set of observations. Let 
X1, X2, ..-, X, be the set of observations arranged in the order in which they 
occur, i.e., x; is the ith observation in the outcome of an experiment. Then, for 
each of the observations, we see if it is above or below the value of. the median 
of the observations and write A if the observation is above and B if it is below 
the median value. Thus we get a sequence of A’s and B’s of the type, (say), 

ABBAAABABB ..(*) 

Under the null hypothesis Ho that the set of observations is random, the 

number of runs U in (*) is ar.v. with 


E(U) = ar 2 and vane =% (t=4) ...(16:106) 
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For large n (say, > 25), U may be regarded as asymptotically normal and we 
may use the normal test. 

16-85. Median Test. Median test is a statistical procedure for testing 
if two independent ordered samples differ in their central tendencies. In other 
words, it gives information if two independent samples are likely to have been 
crawn from the populations with the same median. 

As, in ‘run’ test, let x1, x2, ..-, Xp, aNd 1, Ya, .-+ Yap bE two independent 


ordered samples from the populations with p.d-f.’s f,(.) and.f2(.) respectively. 
The measurements must be at least ordinal. Let z), 22, .... 2a, +n. be the 
combined ordered sample. Let m, be the number of x’s and mz the number of y’s 
exceeding the median value M, (say), of the combined sample. 

Then under the null hypothesis that the samples come from the same 
population or from different populations with the same median, i.e., under 
Ho: fi) = f2(.), the joint distribution of m, and mz is the hypergeometric 
distribution with probability function : 

ry " Ao 
my, my 
p(m,, m2) = ..(16-107) 
A, + Ao 
( m, + M2 ) 
If m, < n,/2, then the critical region corresponding to the size of type 1 
efror_ a, is given by m, < m,’ where m,’ is computed from the equation 
my 
»y ptm, my) =@ .--( 16-108) 
m, 21 
The distribution of . under Hp is also hyper-geometric with 


E(m,) => , if N =n, + nz is even 


n tp if N is odd 
(16-109) 


and Var (m) “ane qin — 1) iN is even 


=n od ‘if N is odd 


This distribution is most a the times quite inconvenient to use. However 
for large samples, we may regard m; to be asymptotically normal and use 
normal test, viz., 

m,- E(m,) ck . 
= ————= ~ N(0, 1), asymptotically. —_...(16-110) 
V Var (m,) 


Remarks 1. The observations m, and m, can be classified into the 
following 2 x 2 conungency table. 
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No. of: observations 
>M 


No. of observations 
<M 


If frequencies are small we can compute the exact probabilities from 
(16-107), rather than approximate them. However, if frequencies are large we 
may use ¥2-test with 1 d.f. (for a 2 x 2 contingency table) for testing Ho. 

The approximation is fairly good if both 7, and nz exceed 10, 

2. Median test is sensitive to the differences in location between f;(x) and 
faly) but not to differences in their shapes. Thus if f,(x) and f(y) have the same 
median, we would expect Mg : fi(.) = fh.) to be accepted ordinarily even though 
their shapes are quite different. 

3. Generally, the median test makes the correct decision with a little more 
mee gaa than does the sign test (cf. §. 16-8-6) but not as decisively as the 
t-test. 

16-8-6. Sign Test. Consider a situation where it is desirea to compare 
two things or materials under various sets of conditions. An experiment is thus 
conducted under-the following circumstances : 

__ (i) When there are pairs of observations on two things being compared. 

, Gd For any given pair, each of the two observations is made under similar 
extrancous conditions, 

. (ii) Different pairs are observed under different conditions. 


Céndition (iii) implies that the differences d; = x;- y;; i= 1,2, ..., n have 
different variancés and thus renders the paired t-test (Chapter 14) invalid, which 
‘would’ have otherwise been used unless there was obvious non-normality. So, in 
such a case we use the ‘Sign Test’, named so since it is based on the signs (plus 
or minus) of the deviations d; ='x;- y;. No assumptions are made regarding the 
parent Population. The only assumptions are : 
~ @& ‘Measurements are such that the deviations d; = x; ~ y;, can be expressed 
in terms of | positive or negative signs. = 

_(ti) Variables have contnuous distribution. 

(iti) d,’s are independent. 


Different pairs (x;, y;) may. be from different populations (say ,w.r.t. age, 
weight, stature, education, etc.). The only requirement is that within each -pair, 
there is matching w.r.t relevant extrancous factors. 
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Procedure. Let (x;, y,), i= 1,2, ...,2 be n paired sample observations 
drawn from the two populations with p.df.‘s f,(.) and f,(.). We want to test the 
null hypothesis Ho : f;(.) =/o(.). To test Ho, consider’ d; = x; -— y;, @ = 1, 2, ..., 
n). When Ho is true, x; and y; constitute a random sample of size 2.from the 
same population. Since the probability that the first of the two sample 
observations exceeds the second is same as the probability that the second 
exceeds the first and since hypothetically the probability of a tie is zero, Ho may 
be restated as : 


Ho: PIX -Y>0]=% and PIX-Y¥<0}=5 
Let us define: 
Us 1, ifx;-y,;> 0 
; 0, if x;- yi < 0 


U; is a Bernoulli variate with p = P(x;—y;>0) =4. Since U;’s, i = 1, 2, 
..n are independent, U = : Us, the total number of positive deviations, is a 


Binomial variate with ees rand p (= +). Let the. number of positive 
deviations be k. Then 


k 
P(U Sk) =2" re (p=q= + under H;). 


gk 
-( 5 ) Z, & =r" (say). ..(16°111) 


If p’ $,.0-05,. we reject’ Ho at 5% level of significance and if p’> 0-05, we 
conclude that the data do not provide any evidence against the null hypothesis, 
which may therefore, be accepted. 

For large samples;-(n 2 30),-we may dit Lain symtoucally normal 
with, (under Ho) 


, E(U) =.np =n/2 and Var wye np ae old. 
op 212) a lee Um is symptocally NO. 1), ...(16-112) 


| VVar 


and we may use normal tést. 


16-8-7. Mann-Whitney-Wilcoxon U-test. This non-parametric test 
for two samples was described by Wilcoxon and studied by Mann.and Whitney. 
It is the most widely used test as an alternative to the t-test when we do not 
make the ¢-test assumptions about the parent population. 

Let x; (i = 1, 2, ..., ay) and y; (j = 1, 2, ..., 13) be independent ordered 
samples of size n, and az from the populations with p.d/. f,(.) and f,(.) 
respectively. We want to test the null hypothesis 7, : f;(.) =/2(.). Like the run 
test, Mann-Whitney test is based on the pattern of the x’s and y’s in the 
combined ordered sample. Let T denote the sum of ranks of the y’s in the 
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combined ordered sample. For example, for the pattern (16-99) on page 16-61 of 
combined ordered sample the ranks of y observations are respectively 3, 4, 5, 7, 
10 etc. and T=3+4+5+7+ 10+... The test statistic U is then defined in 


terms of 7 as follows : 
U=nn+ faleg + 7) oe i (16-113) 


If T is significantly large or smal} then / : f,(.) =/,(.) is rejected. The 
problem is to find the distribution of T under Hg. Unfortunately, it is very 
troublesome to obtain the distribution of T under 479. However, Mann and 
Whitney have obtained the distribution of T for small n, and n>, have found the 
moments of T in general and shown that 7 is asymptotically normal. It has been 
established that under Ho, U is psyopioncally normally distributed as N (j1, 2), 
where 


hyn 
yw =E(U)=~>* 
g? = Var (U) =a ty Fat A) (16-114) 
Hence 
z = T= #~ vo, 1), asymptotically, _...(16-114a) 


and rormal test can be used. The approximation is fairly good if both a, and 2, 
are preater than 8. 

Remark. The asymptotic relative efficiency (ARE) of Mann-Whitney’s 
U-test relative to two samples ¢-test is greater than or equal to 0-864. For a 
normal population, this ARE = 3/r = 0-955. Accordingly, Mann-Whiiney’s U- 
test is regarded as the best non-parametric test for location. 


EXERCISE 16 (c} 


1. Explain what is meant by non-parametric meuwus. How do they differ 
from parametric methods ? Illustrate your answer by considering a suitable 
nonparametric test for the hypothesis that two independent samples havc come 
from the same population. 


2. (a) Derive the sign test, stating clearly the assumptions made. 


(b) Describe the median test for the two-sample location problem. Find the 
distribution of the test statistic and compute its mean and variance under the null 
hypothesis. How is the test carried out in case of large samples ? 


3. Explain the main difference between parametric and non-parametric 
approaches to the theory of statistical inference. Derive the sign test for two 
sample problem. (Delhi Univ. B.Sc. (Stat. Hons.), 1988) 


4. Describe the sign test. 
X;,X2, ..., X19 is a random sample of size 10 from a population having 
distribution function F(x). Test the hypothesis’ H9 : F(72) = ; against the 


alternative hypothesis, H, : F(72) > 3: [Madras Univ. B.Sc., 1988) 
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5. Explain Median Test and how it is applied. 

The observations of a random sample of size 10 from a distribution which 
is symmetric about K.s. are 20-2, 24-1, 21-3, 17-2, 19-8, 16-5, 21-8, 18-7, 
17-1, 19-9, Use Wilcoxon’s Test to test the hypothesis Hy : K.s = 18 against 
H,: K.5 > 18 if a = 0-05. You may use the normal approximation. 

[Agra Univ. B.Sc., 1989) 

6. Describe the procedure in median test when there are two independent 
samples. What non-parametric test would you use when the two samples are 
related. 

7. Discuss the Mann-Whitney-Wilcoxon test for the equality of two 
population distribution functions. [Delhi Univ. B.Sc. (Stat. Hons.), 1986] 

8. What are the advantages and disadvantages of non-parametric methods 
over parametric methods ? 

Develop the following non-parametric tests, stating the underlying 
assumptions and the null hypotheses : 

(a) Median test 

(b) Mann-Whitney-Wilcoxon test. 

(Dethi Univ. B.Sc. (Stat. Hons.), 7993) 

9. Explain the main difference between the parametric and non-parametric 
approaches to the theory of statistical inference. What are the advantages of non- 
parametric tests ? Develop Median test and Mann-Whitney- Wilcoxon test. 

[Delhi Univ. B.Se. (Stat. Hons.), 1992, 1985] 

10. Distinguish between ‘sign test’ and ‘Wilcoxon signed rank test’. 
Describe the sign test for testing that the population median is Mo against the 
alternative that the median is M, (> Mo). 

11. Develop the Mann-Whitney-Wilcoxon test and obtain the mean and 
variance of the test statistic T. How is the test carried out for large samples ? 

12. Explaining the distinction between the parametric and non-parametric 
tests, write down the advantages of ‘non-parametric tests. Also write their 
disadvantages. 

Thirty observations as given below are obtained : 

24, 35, 12, 50, 60, 70, 68, 49, 80, 25, 69, 28, 28, 11, 83, 
31, 37, 34, 54, 75, 45, 95, 75, 26, 43, 57, 94, 48, 63, 45 

Test their randomness by considering the sequence of positive and negative 
signs. [Agra Univ. B.Sc., 1989] 

‘13. What are thé advantages and disadvantages of Non-Parametric Methods 
over Parametric Methods ? 

Derive the Wald-Wolfowitz run test for testing the equality of two 
distribution functions. [Delhi Univ. B.Sc. (Stat. Hons.), 1987] 

14, What are the advantagés of Non Parametric tests ? Define a run and the 
'cngth of a run. Describe the Run Test in detail for testing the equality of the 
«wo populations and extend the test when the ties occur. 

[Delhi Univ. B.Sc: (Stat. Hons:), 1983) 
1S. What.are runs ? Comment on their utility in non-parametric inference. 
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If R, and R, denote the number of runs of n, objects of one type and n2 
Objects of another type in a sample-of size n, + no, then find the probability that. 
R, + R, =r, for r even and r odd, and also:the mean and variance of R, + R2 
when all these n, + , observations arise from the same dist-ibution. 

[Delhi Univ. M.Sc. (Stat.), 1991) 

16. (¢) Explain how the run test can be used to test randomness. 

(it) In the median test with samples of size 9 and 7 respectively, from two 
populations find the probability density function of the random variable 
representing the number of values of the samples from the first population in 
the lower half of the combined sample. (Madras Univ. B.Sc., 1988) 

17. (a) The win-lose record of a certain basketball team for its last 50 
consecutive games was as follows :— 

WWWWWWLWWWWWWLWLWWWLLWWWW 

LWWWLLWWWWWWLLWWLLLWWLWWW 

Apply run test to test that sequence of wins and losses.is random. 

(b) Use an appropriate non-parametric test procedure to test for randomness 
the following set of 30 two-digit numbers : 

15, 17, Ol, 65, 69, 69, 58 41, 81, 16, 
16, 20, 00, 84, 22, 28, 26, 46, 66, 36, 
86, 66, 17, 34, 49, 85, 45, 51, 40, 10. 

18. At the beginning of the year a first grade class was randomly divided 
into two groups. One group was taught to read using a uniform method, where 
all students progressed from one stage to the next at the same time, following 
the teacher’s direction. The second group was taught to read using an individual 
method, where, each student progressed at his own rate according to a 
programmed work book, under supervision of the teacher. At the end of the year 
each student was given a reading ability test with the following results : 


First Group Second Group 
227 55 184 202 271 63 
176 234 147 14 151 284 
252 194 88 165 235 53 
149 247 161 171 147 228 
16 92 171 292 99 271 


Use the Wald-Wolfowitz run-test to test for the equality of the distribution: 
functions of the two groups. 

19. Using the number of runs above and bclow the median, test for 
randomness the following set of a table of 2-digit numbers : 

15, 77, 01, 65, 69, 69, 58, 40, 81, 16, 16 20, 00, 84, 22, 

28, 26, 46, 66, 36, 86, 66, 17, 43, 49, 85, 40, 51, 40, 10. 


16-9. Sequential Analysis - Introduction We have seen that in 
Neyman-Pearson theory of testing hypothesis, n, the samplé size is regarded as a 
fixed constant and keeping a fixed, we minimise B. But in the, sequential 
analysis theory propounded by A. Wald n, the sample size is not fixed but is 
regarded:as a random variable whereas both a and B are fixed constants. 

16-9-1. Sequential Probability Ratio Test (SPRT). The best 
known procedure in sequential testing is the Sequential Probability Ratio Test 
(SPRT) developed by A. Wald discussed below. 
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Suppose we ‘want to test the hypothesis, Hy : 8 = Qo against the 
alternative hypothesis, H; : 8 = 6,, for a distribution with p.d.f. fx, 6) For any 
positive integer m, the likelihood function of a sample x), x2, ..., X» from the 
population with p.d.f. f(x, 9) is given by 


Lim = IT f(x, 1) when H, is true, 
inl 


andby Lom = II f(x; 9) when Ho is true, 
iol 
and the likelihood ratio ,, is given by 


wee) oo 16-115) 


The SPRT for testing Ho against H, is defined as follows : 
At each stage of the experiment (at the mth trial for any integral value m), 
the likelihood ratio A,,, (m = 1, 2, ...) is computed. 
(i). If A,, 2A, we jerminate the process with the 
rejection of Ho 
(ii) If A,, <B, we terminate the process with the 


acceptance of Ho, and --o(16116) 
(iii) If B-<1,,<A, we continue sampling by 
taking an additional observation. 
Here A and B, (B < A) are the constants which are determined by the relation 
A= 1-8 5 a ..-(16-117) 


“1l-@ 


where o@ and f are the aad of tyne I error and type II error respectively. 
From computational point of view, it is much convenient to deal with 
log A, rather than A,,, since 


fi, 91) 
log An, -= log 8 hx,, @) = - 5 z; .--(16-118) 
= fis 0) 
where z; = log Ax;, 8) @,) ... (16-1182) 


In terms of z;’s, SPRT is defined as follows : 
i) If 2 2; 2 log A, reject Ho 
(ii) If D 2; < log B, reject H, 
(iii) If log B < yz; < log A, continue sampling by 
taking an additional observation. 


,. (16-119) 
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Remarks 1. In SPRT, we continue. taking additional observations unless 

the inequality 

B<ijA,<A => logB<22z;<1logA, 
is violated at either end. It has been proved that SPRT eventually terminates 
with probability one. 

2. Sequential schemes provide for a minimum amount of sampling and 
thus result is considerable saving in terms of inspéction, time and money. As 
compared with single sampling, sequential scheme requires on the average 33% 
to 50% less inspection for‘ the same degree of protection i.e., for the same 
values of a and B. 

16:9-2. Operating Characteristic (O.C.) Function of SPRT. 
The O.C. function L(8) is defined as 

L(®) = Probability of accepting Ho : 8 = @) when 6 is the true value of 
the parameter, 
and since the power function 
P(6)= Probability of rejecting Hy where 6 is the true value, we get 
L(®) = 1-P(8) .. (16-120) 

The O.C. function of a SPRT for testing Ho : © = 69 against the alternative 
H,:®=6,, in sampling from a population with density function f(x, 9) is 
given by 

| Afh®) — 1 
L(®) =7) —BMO) ” 
where for each value of 8, the value of 2(8) # 0, is to be determined so that 


: Bs a - 


»+(16-121) 


~ 


Se , 8o) 


where the constants A and B have already been defined in (16-117). It has been 
proved that under very simple conditions :on the nature of the function f(x, 8), 
there exists a unique value of 4(8) # 0 such that (16-122) is satisfied. 

16-9-3. Average Sample Number (A.S.N.). The sample size n in 
sequential testing is a random variable which can be determined in terms of the 
true density function f(x, 6). The A.S.N. function for the S.P:R.T. for testing 
Ho : 8 = 9o against H, : 8 = 9;, is given by 


(16-122) 


E(n) ac. ames ..-(16-123) 
_,.|f%,9)] ,_1-B ,__ 8B | 
where Z =\log fe bh me ,B= i= 6 .-:(16-123a) 


Example 16-11. Give the SP.R.T. for testing Hy: 9 =9%9 against 
H, : © = 61 (> 9), in sampling from a normal density. 


a ey|-} x= 8 ‘| —00 <x < 00 
8 WV2n oe 


where o is known. Also obtain its O.C. function and ASN. function. 
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Solution. ein 81) = exp |- a5 { (x; ~ 91)? - (x; -— 90)? } 


Kei Oo) 

1 * 
= exp|- Foi { (Oo ~ 01) (2x; - 8 - 6,)} »»(*) 
a2 Lx 8 _ 9, - 8 _ Oo + Oy hk 

z; = log ae. ahs 32 E Xi- 5 | saaQ™") 


0, - 0, — 90 or _ m(Qo + 01) 
o Co 


= ~ logA,, =z z;= 


Hence the S.P.R.T. for Ho : © = Oo against H, ; @ = 0, is given by 
(c.f. (16-119)] : 
(i) Reject Ho if 


0,-0 Oo +91) |. Z 
iste, 080) 0g Pa 


=> 


ees ie 6) + 98 
(ii) Accept Ho if 


“ Oo i m (989 + 9;) | 
7 i2, 7! $0, -05 we ( 78 }- 3 (81 > Oo) 


and (iii) Continue taking additional observations as long as 


nen }<%5% =" [Ex -@o+ Pi) : Pa 


l-a 
sates 


7D l-a 


_ oe( 5 B meas 8,) ee 
i‘ main 81) 


- 


O.C. Function. First of all we shall determine h = A(8) #0, from 


(16-122) ie., from 
ie Ria fix, 0) dx=1 


Statistical Inference - II (Sequential Analysis) 16-73 


1 1 (x — 0 1 
a =|. orl 3 ( o J [exe {- 2-00 


h 
x (-2x + Oo + oo} dx = 1, [On using (*)] 


=> =| exp [-35 (7 - 2x (@, - 0) 4 +6) 
oV2n ?~ 
+ 02+ (8,2 Oo2)h} ] dx=1 
If we take 
d = (8, ~ Oo) h + 0 ans 
A2 = (8 2 = 0o2)h + 6? 
then L.H.S. becomes 


1 = ~ ] 
exp|—- => (x - a fs 
GV 27x J. 20? * 


which being the total area under normal probability curve with mean A and 
variance a? is always unity, as desired. Thus h = h(8) is the solution of (***) 
and is given by 
(8,2— Oo*) h+ 6? = [0, — Oo) + 6]? 
=> (0;2—Oo7)h = (8; ~ Oo) 7h 2 + 20(8, — Ooh 
Since h = h(8) ¥0 and 6, # 9, on dividing throughout by (8; — 9o)h, we 
get 


(0, +09) = (6; -Oo) h +20 
-2 
= MW) = 
1—%o 
Substituting for h(8) in (16-121) we get the required expression for the 
O.C. function. 
A.S.N. function. We have 


- 


x,0:) 6,-6 Oo + 0, | 
Z = log San 1S _ ne) [From (**)] 
Q s ¢ 
o E(Z) = Lae [2E(x) - 0 ~ 0, ] 
8, ~-98 
=~? [26 ~ 0, — 8, | 


Substituting in (16-123), we get the required A.S.N. function. 
Example 16-12. Let X have the distribution : 
fix, 9) = 6 (1 — 8)'-4;5x=0,1;0<06<1 
For testing Ho : 8 = 8» against H, : © = 6,, construct S.P.R.T. and obtain 
its A.S.N. and O.C. functions. 
[Dethi Univ. B.Sc. (Stat. Hons.), 1993, 1985] : 
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Solution. We have 


N _ LG. XQ, «00 Xm | 1) 
si Lx, X2y ove, Xm \ Ho) 


m- x; 


Ex . 
fel! iw + x; m — Lx; 
+ (1 ~ 6;) | (0, (1 ~ 89) ) 


_ 0, ate 1-0, \- Pas 
~ | Qo 1 — 8 
log A,, = Lx; log (0:/0o).+ (m — Lx;) log ( T= 


ne 6, (1 ~ 9g) 1-6 
= 2 oe| @(1 01) ~8,) |+» log 7 0: my 


Hence SPRT for testing Ho: 8 = Oo against H, : 6 = 0,, is given by 


(uf. (16-119)] : 
(i) Accept Ho if log A,, < log a }- b, (say) 


hog. Sie ee b - — m log [(1 - 6,)/(1 - 8o)] | 
he 27S Tog [0301 ~ @oVBa(t -0,)]~-7mr SY 


(i) Reject H, (Accept H,) if log ky 2 og 1=# = a, (gay) 

sits hee a —m log [(1 — 0,)/(1 - 99)] 

C., i 2 = T mss ° 
=~ if 22 log [6,(1 — 69)/80(1 -.9,)] Tm (say) 
(iit) Continue sampling if 

b<logaA, <a => An<L2ij<Tm 
O.C. Function. O:C. function is .given by : 


L(6) = [A*® — 1] / [AX®) — B®] (cf. (16-121)] .. 
where for each value of 6, h(0) # 0 is to be determined such that 
E Fan) “we 1 (cf. (16-122)] 
= ar fix 0) =1 
a 21 (8) (=e) T° 6(1-6)'-* = 


f1-— 68, \AX8) - (8 
=> (=e | a-9+( )” .6 =] 


(a) 


.» (i) 
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The solution of this equation for h = h(8) is very tedious. From: practical 
point of view, instead of solving (ii).for h we ‘tegard h as:a parameter-and solve 
it for 6, thus giving 


| (‘ )*- te y"]--(: - 0, i 
8 1 - 0 1-0, 
=> Q spe 1G OATS Oy) 60h) ) (ii) 
(81/80) — [1 — 4)/C1 = Op) s (Say). ...(éii 
Using (i), we have 
L@) = for = 1(0, h), (say) (iv) 
[(1 — B)/o.)* — [B/(1 - a)" 7M), (Say). +s 


Various points on the 0.C. curve are obtained by assigning arbitrary 
values to ‘h’ and computing the corresponding values of ® and L(6) from (ii) 
and (iv) respectively. 

A.S.N. Function. 

Sf, 91) 0,) _ 1- B _ B 
aS me | Sg PRTG 


J\%, Vy) 


E(Z) = E, log log hare PS i) Six, 8) 


=> 2 ae (5 aaa Br -x 
= E08] (% (=e) ‘le Oa 


= (1-6) log teh @ . log G: 
1-0, 


6, (1-8 
= 6 lo eletuaey] + + toe Teh 1-0, .»-(v) 
A.S.N. is given by 
E(n) = _ L(®) log B: + {1 -L(6)). log A (vi) 


E(Z) 
Substituting the values of E(Z) and L(6) from (v) and-{iv) in (vi), we get 
the A.S.N. function. 
Remark. If A assumes negative values i.e., if instead of h we take —h 
where fh > 0, then 


- ] 1 —A* Af — 
LQ.-W) = Zea Bt (a= |= (fear | 2 
=> L(®,—h) =B*. L(6, h) .+ (Vii) 
[(1 - 8,)/(1 - @)]* - 2. 
= -*) =Tq eit — Ool* -@ a 


= O(h) ( 2 ‘ (vif) 
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Formulae (vii) and (viii) are very convenient .o use for obtaining the points 
on O.C. curve for arbitrary negative values of h. 


EXERCISE 16(d) 


1. (a) Describe Wald’s Sequential Probability Ra:io Test. 
(b) Explain how the sequential test procedure Jiffers from the Neyman- 
Pearson test procedure. 


2. Define the OC function and ASN function ia sequential analysis. 
Derive their approximate expressions for the sequential probability ratio test of a 
simple hypothesis against a simple alternative. 

3. Describe Wald’s S.P.R.T. Let X be a Bernoulli variate with p.d.f. 

f(x 9) = 67 (1 - 6)! -45x=0,1;0<56<1. 

Employ S.P.R.T. for testing Ho : © = 69 against H, : @ = 6,, and obtain its 
A.SN. and O.C. functions. 

[Delhi Univ. B.Sc. (Stat. Hons.), 1993, ’85) 

4. (a) Explain how the sequential test procedure differs from the Neyman- 
Pearson test procedure. 

Develop the S.P.R.T. for testing Ho : x = m, against H, : 7 = 1%, based on 
a random sample from a binomial population with parameters (n, 2), 2 being 
known. Obtain its O.C. and A.S.N, functions. 


(6) Obtain the sequential probability ratio test of the hypothesis Ho : 8 =4 
against H, : 0 = : for the distribution : 


67 (1 — 6)! -*, forx =0, 1 
fix; 8) =| 0 otherwise 
[Madras Univ. B.Sc., 1988] 
5. Develop Ss .P.R. test for testing Ho: 6 = 09 against H, : 8 = 0,, 
(8; > 89), where 6 is the parameter of a Poisson distribution. Find approximate 
expressions for OC function and ASN function of the test. 
(Delhi Univ. B.Sc. (Stat. Hons.), 1988} 
6. Describe S.P.R.T., its OC and ASN functions. 
Construct S.P.R.T. for testing Ho: 6 = Oo against H, : ® = 0,, (0< 8p < 9;), 


on the basis of a random sample drawn from the Pareto distribution with density 
function : 


6a® 
fix, 8) = sea» x 2 @:- 
Also obtain its O.C. function and A.S.N. function. 


[Delhi Univ. B.Sc. (Stat. Hons.), 1989) 


7. Explain how the sequential test procedure differs from the Neyman: 
Pearson test procedure 


Develop the S.P.R.T. for testing Ho : 6 = 89 against H, : 8 = 8, (> 9p), 
based on a random sample of size n from a population with p.c.f. 
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fix, 0) =3 e-*8 x>0,0> 0. 


Also obtain its A.S.N. and O.C. functions. 
[Delhi Univ. B.Sc. (Stat. Hons.), 1987] 
8. Let X have the p.d.f. 
Q@e-8*;x20,0>0 


x, 9)= 
fx, ®) 0 _, elsewhere 

For testing Ho : 8 = Qo against H, : 8 = 8,, construct the S.P-R.T. and 
obtain its ASN and OC functions. 

[Indian Civil Services (Main), 1989; 
Delhi Univ. B.Sc. (Stat. Hons.), 1982, 1986] 

9. (2) What is a sequential test ? How will you develop an optimum test of 
a specified strength for a simple null hypothesis versus a simple alternative ? 

(b) Find expressions for the sample size expected for termination of SPRT 
both under Ho and H,. Clearly state all the.assumptions made. 

(c) A random variable follows the normal distribution N [9, 07], where o? 
is known. Derive the SPRT for testing Ho : 8 = 09 against H, : 6 = 6;. Obtain 
the approximate expression for the OC function. 

[Indian Civil Services (Main), 1990) 

10. To test sequentially the hypothesis Ho that the distribution is given by 
P(X =-1) = P(X = 1) = P(X = 2) =4 against the alternative H, that it is given 
by P(X =-1) = P(X = 1) =1; P(X =2) =}, it is decided to continue sampling 


as long as 2 +B S, < se - where 5, =X, +X2+...+ X,, the X,’s 
being the successive observations. Compute the probability under Hy and under 
H, that the'‘procedure will terminate with the fourth observation or earlier. 


11. X,,X>, ..., X, be a sequence of i.i.d. observations from N(j1, 67), 
where pt is known, ao? being unknown. Obtain the SPRT for testing 
Hy : 6? = Go, against H, : 6? = 6,2 (> do). Also obtain its OC function and 
ASN. function. 


ADDITIONAL EXERCISE ON CHAPTER XVI 


1. (a) “An examiner may pass a dull student or may fail a good student”. 
Explain the above statement with reference to type-I and type-I errors. 


2. A single value x is drawn from a normal population with mean m and 
variance 25. The null hypothesis Hy : m = SO is accepted if x < 75, otherwise 
H, : m= 60 is considered true. Evaluate the type I and type II errors. 


3. Let p be the proportion of smokers in a certain city. You desire to test 
the hypothesis Ho : p = : against H, : p= 2. If you reject Ho when 60 persons 


or more are found smokers in a sample cf 100 persons, compute the significance 
level and power of the test. 


16.78 Fundamentals of Mathematical Statistics 


4. Let X,,X>, ..., X29 be a random sample of size 20 from a Poisson 
20 
distribution with mean 8. Show that the critical region defined by Lx; > 5, 


is'a uniformly most powerful critical region for testing Hp: ® = 1/10 against 
H,:08> 1/10. 

5. Let X,,X>, ...,X,_ denote a random sample from a normal distribution 
N(6, 16). Find the sample size n and a uniformly most powerful test of 
Hy): 8 = 25 against H, : 6 < 25, with power function K (8) so that approxi- 

nately K(25) = 0-10 and K(23) = 0-90. 
6. In testing Ho : 6 = Oo against H, : d = 0; ( #9o), for the distribution : 


fa)=h ex|- ' 3°). <x <0, >0) 


snow that the UMP test is of the form 
2x; 2constant and yx; < constant. 
7. X,,X2 is a random sample from a distribution with~ p.d.f. 


Six, 8) = xen, x> 0,6 > 0. The hypothesis Hp: 6 = 2 is tested against 


,: 6 > 2 and is rejected if and only if X, + X2 = 9-5. Obtain the power 
function and the significance level of the test. Also find the probability of type 
Il-error when 6 = 4. 


8. On the basis of a single observation x from the following p.d.f. 
fix, 0) = 3 e*® (x> 0; 0>0) 
the null hypothesis, Ho: = 1 against the alternative hypothesis H, : 6 = 4, is 
tested by using a set 
C = (x: x>3) 
as the critical region. Prove that the critical region C provides a most.powerful 
test of its size. What is the power of the test ? 

9. Let X be a single observation from the density f (x ; 6) = 26x + 1 —9, 
0<x<1,161< 1; zero otherwise. Find the best critical region of size a, for 
testing Ho : 8 = 0 against H, : 8-< 0, Express the power ‘function of this test in 
terms of a. Is the test uniformly most powerful ? Explain. 

10. X,,X2, ..., X, is a random sample of size n from N (6, 100). For 
testing Hy: 6 = 75 against H, : 6 > 75, the following test procedure is 
proposed : 

Reject Hy if x 2c; Accept Ho if x <c. 
Determine n and c so that the power function P(6) of the test satisfies 
P(75) =-159' and P(77) = 0-841. 
11. Let X,, Xs, ..., X, be a random sample from a distribution having 


p..f. 
[x(1~ x)]®=? 
fix, 9) = x 8, 6) »0<x<1,6>90 
=0, elsewhere 
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Show that the best critical region for testing Hp: 8 = 1 against #, :6=2 jis 


caf (a. X95 coer v,) e-s tl x, (1 - x) 


12. Let X be a single observation from the distribution with p.d.f. 
f(x:0) =6.e-%; 0<x< 0, (8 > 0) 
=Q, elsewhere. 
Obtain the best critical region of size a for testing Hy: = 1 against 
H,: ® = 2. Also obtain the power of this test. 
{[Dethi Univ. M.Sc. (Maths), 1990) 
13. Let (x), x2, .... X9) be a random sample from N (1, 9). To test the 
hypothesis Ho : p = 40 against H;, : p # 40, consider the following two critical 
regions : 
C,=(%:%24,} 


Cy = (x 21x -401 249) 
(i) Obtain the values of a; and a so that the size of each critical region is 
0-05. 
(ii) Calculate the power of the two critical regions when 41 = 39 and 
{4 = 41 and comment on the results. 
(Delhi Univ. M.A. (Eco.), 1992] 
14. (a) For a sample of size 25 from a normal population N(u, 25), 


X = 11-5. Test the hypothesis Hp : 4 = 10 against the alternative H,:p> 10. 
Calculate the power of the test for 1 = 11. 
[Dethi. Univ. M.A. (Eco.), 1988] 
(b) Let X ~N (ps, 25). The null and alterantive hypotheses are : 
Ho: p=10 and H,:p<10 
(i) Give the best test of size a = 0-05 for a sample of size 25. (No 
derivation is expected). 
(ii) Calculate the power of the test tor pi = 8. 
[Delhi Univ. M.A, (Eco.), 1990) 
15. X is normally distributed with o = 5 :nd it is desired to test 
Ho: a = 10S against H, : p =110, How large a sample shouid be taken if the 
probability of accepting Ho when H, is true is 0-02 and if a critical region of 
size 0-05 is used ? 
(Agra Univ. B.Sc. 1989) 
16. Let p be the probability that a given die shows an even number. To 
test Hy: p =3 against H; : p= ; ; the following procedure is adopted. Toss the 
die twice and accept Hy if both times it shOws even number. Find the 
probabilities of type I and type II errors. 
(Delihi Univ. M.C.A., 1990) 
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17. The p.d.f. of x is given by f(x) = ; O<'x <6. Let the null 


hypothesis be Hy: 8 = against the alternative hypothesis WH, : 9 > ‘. We 
have a random sample of one observation. The critical region is defined by 
C=(x:x>1). 

(i) Find the significance level of the test. 


(ii) Find the power of the test for 9 = 7/3 and 9 = 10/3. 
[Dethi Univ. M.A. (Eco.), 1991] 
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TABLE II 
POWERS, ROOTS AND RECIPROCALS 


tate} et ee eee 
n 


103823 
1110592 
117649 
125000 
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TABLE III 
POWERS, ROOTS AND RECIPROCALS 


Ae eee 


17:213 
17:325 
17-435 


132651 
, 140608 
, 148877 


54 157464 17-544 
$5 | ,166375 17'652 
S6 175616 17-758 
57 185193 17:863 
58 195112 17-967: 


89 205379 18-070 
60 216000 18-171 
61 226981 
62 238328 
63 250047 
64 262144 
65 274625 
68 287496 
67 300763 
68 314432 
69 328509 
70 343000 
71 357911 
72 373248 
73 389017 
14 405224 
15 421875 
16 438976 
17 -456533 
78 474552 
79 493039 
80 512000 
81 531441 
82 551368 
83 571787 
84 592704 
85 614125 
86 636056 
87 658503 
88 681472 
89. 704969 
90 729000 
91 7153571 
92 775688 
93 804357 
94 830584 
95 857375 
96 884736: 
97 912673 
98 941192 
‘99 970299 


[000] 10000 [1000000 
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TABLE IV 
AREAS UNDER NORMAL CURVE 
Normal probability curve is given by ; 


‘y — 2 
f= ; exp {- = (254) —0o <X¥ <0 Areas uncer Nomal Curve 
OV 2 S | 
and standard normal probability curve is given by 


1 1 

(2) = exp ( - 522), 0 <2 <ee 
Von : 

where Z 22) ~ vO. 1) 


The following table gives the shaded area in the diagram viz.. P(0<Z<z) 
for different values of z. 


1Z— 0 1 2 3 4 5 6 “7 8 oO 


boONLe@ CerAawW bOoKH-S 


e * ° 


CS ee Oe 
~@ Oor"aw 
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TABLE V 
ORDINATES OF THE NORMAL PROBABILITY CURVE 


The following table gives the ordinates of the standard normal probability 
curve, i.e., it givés the value of 


1 1 
(2) = exp (— 72), 00 < z < 00 
V2n 


for different values of } where 


z en N (0, 1) 


Obviously ¢ (-z) = $(z). 
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where 


P=P, (x? >a) =o 


| TABLE VI _ 
SIGNIFICANT VALUES 4?(a) OF CHI-SQUARE 
DISTRIBUTION. (RIGHT TAIL AREAS FOR GIVEN PROBABILITY a, 


AND-v IS DEGREES OF FREEDOM (4.f.) 


Degree.of “| zi 
freedom 0 = -99 
V) 
l -000157 
2 0201 
3 -115 
4 -297 
5 554 
6 - -872 
7 1-239 
8 1-646 
9 2-088 
10 2-558 
11 3-053 
12 3-571 
13 4-107 
14 4-660 
15 4-229 
16 5-812 
17 6-408 
18 7-015 
19 7-633 
20 8-260 
21 8-897 
22 9-542 
23 10-196 
24 10-856 
25 11-524 
26 12-198 
27 12-879 
28 13-565 
29 14-256 
30 14-953 


Probability (Level of significance) 


0-95 


00393 
-103 
352 
‘711 
1-145 
1-635 
2-167 
2-733 
3-325 
3-940 


4-575 
5-226 
5-892 
6-571 
7-261 
7-962 
8-672 
9.390 

10-117 

10-851 


11-591 
12-338 
13-091 
13-848 
14-611 
15-379 
16-151 
16-928 
17-708 
18-493 


0-50 


-455 
1-386 
2-366 
3-357 
4-351 
5-348 
6-346 
7-344 
8-343 
9-340 


10-341 
11-340 
12-340 
13-339 
14-339 
15-338 
16-338 
17-338 
18-338 
19-337 


20-337 
21337 
22-337 
23-337 
24-337 
25-336 
26-336 
27-336 
28-336 


29-336 


0-10 


2-706 
4-605 
6-251 
1-719 
9-236 
10-645 
12-017 
13-362 
14-684 
15-987 


17-275 
18-549 
19-812 
21-064 
22-307 
23-542 
24-769 
25-989 
27-204 
28-412 


29-615 
30-813 
32-007 
32-196 
34-382 
35-363 
36-741 
37-916 
39-°087 
40-256 


- 


0:05 


3-841 
5-991 
7-815 
9-488 
11-070 
12-592 
14-067 
15-507 
16-919 
18-307 


19-675 
21-026 
22-362 
23-685 
24-996 
26-296 
27-587, 
28-869 
30-144 
31-410 


32-671 
33-924 
35-172 
36-415 
37-65? 
38-885 
40-113 
41-337 
42-557 
43-773 


0-02 


5-214 

7-824 

9-837 
11-668 
13-388 
15-033 
16-622 
18-168 
19-679 
21-161 


22-618 
24-054 
25-472 
26-873 
28-259 
29-633 
30-995 
32-346 
33-687 
35-Q20° 


36-349 
37-659 
38-968 
40-270 
41-566 
41-856 
44-140 
45-419 
46-693 
47-962 


Note. For degrees of freedom: (v) greater than 30, the quantity 


’ 2x7 - ‘/2u — 1 may be used as a normal. variate with unit variance. 


0-01 


6-635 

9.210 
11-341 
13-277 
15-086 
16-812 
18-475 
20-090 
21.666 
23-209 


24-725 
26-217 
27-688 
29-141 
30-578 
32-000 
33-409 
34-805- 
36-191 
37-566 


38-932 
40-289 
41-638 
42-980 
44-314 
45-642 
46-963 
48-278 
49-588 
50-892 
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TABLE VII 
SIGNIFICANT VALUES 1t,(a) OF tsDISTRIBUTION 
(TWO TAIL AREAS) 
P [11> 1(a)] 20 
af, | Probability (Level of Significance) 
(vy 0-50 0-10 0-05 0-02 0-01 0-001 
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TABLE VIII 
SIGNIFICANT VALUES OF THE VARIANCE-RATiO 
F-DISTRIBUTION (RIGHT. TAIL AREAS) 
5 PER CENT POINTS 


1 2 3 4 5 6 8 12 24 oo 
V2 


1 161-4 199-5%215-7 224-6 230-2 234-0 238-9 243-9 249-0 254-3 
2 18-51 19-00 19-16 19-25 19-30 19-35 19-37 19-41 19-45 19-50 
3 10:13 9-55 9-28 9-12 9-01 8-94 8-84 8-74 8-64 8-55 
4 7-71 6:94 6-59 639 6-26 6-16 6-04 5-91 5-77 5-65 
5 661 5:79 5-41 5-19 £05 4-95 4-82 468 453 4-96 


6 5:99 5-14 4-76 4-53 439 4-28 4-15 4:00 3-84 3-67 
7 5:59 4:74 4-35 4-12 3-97 3-87 3-78 3-57 3-41 93-23 
8 5:32 4:46 4-07 3-84 3-69 3-58 3-44 3-28 3-12 2-93 
9 5-12 4-26 3-865 3-63 3-48 3-37 3-23 3:07 2:90 2-71 
0 4-96 410 3-71 3-48 3-33 3-22 3-07 2-91 2-74 2-54 


11 4-84 -3-98 3-59 3-365 3:20 3-09 2-95 2:79 2-61 2-40 
12 4:75 3-88 4-49 3-26 3-11 3-00 2-85 2-69 2-50 2-30 
13 4-67 3:80 5-41 3-18 3:02 2:92 2-77 2:60 2-42 2-21 
14 460 3-74 3:54 3-11 2:96 2-85 2-70 2-53 2-35 2-13 
15 4-54 3-68 3-29 3:06 2:90 2-79 2-64 2-48 2-29 2-07 
16 4:49 3-63 3-4 3-01 2-85 2-74 2-59 2-42 2-24 2-01 
17 4-45 3-59 3-20 2-96 2-81 2-70 2-55 2:38 2-19 1-96 
18 4-41 3:55 3-96 2-93 2:77 2-66 2-51 2:34 2-15 1-92 
19 4-38 3-52 3-13 2:90 2-74 2-63 2-48 2-31 2-11 1-88 
20 4-35 3-49 3-10 2-87 2-71 2:60 2-45 2-28 2-08 1-84 
21 4:32 3:47 3-07 2-84 268 2-57 2:42 2:25 2:05 1-81 
22 4-30 3-44 3:05 2-82 2:66 2-55 2:40 2-23 2-03 1-76 
23 4-28 3-42 3-03 2:80 2:64 2-53 2-38 2:20 2:00 1-76 
24 | 426 440 3-01 2-78 2-62 2-51 2:36 2-18 1-98 1-73 
25 4:24 3:38 2:99 2-76 2:60 2-49 2:34 2-16 1-96 1-71 
26 4:22 3:37 2-98 2-74 2:59 2-47 2-32 2:15 1-95 1-60 
27 4-21 335 2:96 2-73 2:57 2-46 2-30 2-13 1-93 1-67 
28 4:20 3-34 2:95 2-71 2:56 2-44 2-29 2-12 1-91 1-65 
29 4:18 3-33 2-93 2-70 2:54 2-43 2-28 2:10 1-90 1-64 
30 4:17 3-32 2-92 2-69 2-53 2-42 2-27 2-09 1-89 1-62 
40 4:08 3-23 2-84 2-61 2-45 2-34 2-18 2:00 1-79 1-51 
60 400 3-15 2-76 2-52 2:37 2:25 2-10 1-92 1-70 1-39 
120 3-92 3-87 2-68 2-45 2-29 2-17 2-02 1:83 1-62 1-25 


240 3-84 2:99 2.60 2-37 2-21 2-09 1-94 1-75 1-52 1-00 
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A 
Absolute moments 3°25 
Additional theorem of 
—expectation 6°4 
—probability 4°30 
Additive property of 
—Binomial variates 7°18 
—Chi-square 13°7 
—Gamma variates 8-70 
—Normal variates 8°26 
—Poisson variates 7°47 
Algebra of events 4-21 
—sets 4°15 
Alternative hypothesis . 12°6 
Analysis of variance 14°67, 16-54, 16°49 
A posteriori probability 4:69 
Approximate distributions . 
—Binomial for hyper-geometric 7:91 
—Normal for binomial 7:22 
—Normal for Chi-square 13-6 
—Normal for gamma 8-69 
—Normal for Poisson 7°56 
—Normal for t 14°14 
—Poisson for binomial. 7:40 
—Poisson for negative binomial 7°75 
A priori probability 4-69 
Arithmetic mean 2°6 
—Demerits 2°11 
—Merits 2°10 
—Properties 2°8 
—Weighted 2:11 
Array. 2:1 
Association of attributes 11:15 
Attributes 114-1 
Average sample number 16°71 
A.S.N. function 16°71 
Axiomatic definition of probability 4-17 
8 
Bartlett’s test 13°68 
Bayes’ theorem 4:69 
Bernoulli distribution 71 
Bernoulli trials 71 
Bernoulli law-of largo numbers 6-103 
Best linear. unbiased estimator 15°14 
Beta distribution of 
First kind 8-70 
Second kind 8-72 
Binomial distribution 71 
—Additive property 7°15 
—Characteristic function. 7°16 
—Cumulants 7°16 
—Factonial moments TAA 
—Mean deviation 7114 
—Mode : 7-12 
—Moment Generating Function 7°14 


—Recurrerice relation for moments 


7:9 
Bivariate frequency distribution 10°32 
Bivariate normal distribution 10°84 
—Conditiona distributions 10-90 
—Marginal distributions 10°88 
—N.g.f. 10-86 
—moments 10°91 
Blackwellisation 15°34 
Boole’s inequality 4-33 
Borel Cantelli lemma 6-115 
Buffon’s needle problem 4°83 
Cc 
Cauchy distribution 8-98- 
Central limit theorem 8-105 
—De Moivre's theorem 8-105 
—tLiaponouffs’ theorem 8-109 
—Lindberg-Levy theorem 8-107 
Central tendency 2°6' 
Charactenistic function 6°77 
—multivariate 6°84 
—Properties 6°78 
—Theorems 6°79 
—uniqueness theorem of 6°88 
Charlier’s checks 3°24 
Chebychev inequality 6°97 
Chi-square distnbution 13-1 
—non-central 13°69 
Classical definition of probability 4°3 
Class frequency 11:1 
Class limits 2°3 
Class sets 4°17 
—field 4:17 
—ring 74:17 
—o-feld 4°17 
—o-ring 4:17 
Co-efficient of dispersion 3°12 
—skewness 3°32 
—variation 3°12 
Complete sufficient statistic 15-32 
Complete family of distributions 15°31 
Composite hypothesis 16:2 
Compound distribution 8-116 
—Binomial distribution 8-116 
—Poisson distribution 8-117 
Conditional distribution function 5-46 
Conditional expectation 6-54 
—probability 4:35 
Conditional variance 6°54 
Confidence interval 15-82 
—imits 15-82 
—For difference of means 
12°41, 16°47 
—For means 12°32, 16°42 
—For proportions ‘p’ 12°13 
—For variances 16°52, 16:55 
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Consistent estimator 
—Invariance property 
—Sufficient conditions 

Consistent Data 

Contingency table 


Continuous distribution function™ 


—frequency distribution 
Continuous random variable 


Convergence in distribution (or law): 


Convergence in probability 
Convolution 
Correlation coefficient 
—imits 
—properties 
Correlation of ranks 
Correlation ratio 
Covariance 
Cramer-Rao's inequality 
Cramer's theorem 


Critical region 
Cumulants 
Cumulative frequency 
Curve fitting 
Curvilinear regression 

D 
Deciles 


Degree of freedom 
Degenerate random variable 
De-Moivre-Laplace theorem 
Dichotomy 
Discrete distribution function 
Dispersion 
Distribution discrete 
—Continuous 
Distribution function 
Distribution function of 
joint distribution 
Distributions : 
—Bernoulli 
—Beta 
—Binomial 
—Bivariate normal 
—Cauchy 
—Chi-square 
—Discrete uniform 
—Double exponential 
—Exponential 
—F-distribution 
—Gamma 


—Generalised power series 


—Geometric 
—Hypergeometric 
—Laplace 

—Logistic 

—Log normal 
—Multinomial 

~-Negative binomial 
—Non-central Chi-square 


—Non-Central F 
—Non-Central t 
—Normal 
—Order statistic 
Pareto 
—Pearsonian system 
—Poisson 
—Power series 
—Rectangular 
—Student’s t 
— t distribution 
—Tnangular 
—Uniform 
—Webul 


Efficient estimators, 
Elementary events 
Empirical probability 
Equally likely events 
Error function (N.D.)- 
Errors of first and second kind 
Estimation 

Estimator (unbiased) 
Event 

Exhaustive events 
Expectation 


—of continuous random:variable 


Exponential distribution 
F 


Factorial moments 
Factorization theorem (Neyman) 
Favourable events 
Fisher's Lemma 
—Z-distribution 
—Z transformation 
Fourier inversion theorem 
Frequency 
Frequency distribution: 
—polygon , 
—table 
F-distribution (Snedecor’s) 
—non-central 


G 


Gama Distribution 
—Additive property 
—Cumulant Gen. Function 
—Moment Gen.-Function 
Geometric distribution 
Geometric mean 
Geometrical Probability 
Goodness of fit 
Grouped frequency distribution 
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H 
Harmonic mean 2°25 
Hazoor Bazer's Theorem 15°54 
Helly Bray Treorem 6-90 
Histogram 2-4 
Hypergeometric distribution 7°88 


( 
Impossible event 4°19 
Independence of attributes 11-12 
Independent events 4:37 
intraclass correlation 10°81 
interval Estimation 15:82 
Inversion Theorem 6°87 
J 
Joint density function 5-44 
—Probability law $°41 
—Probability distribution 
function 5-42 
—Probability mass funstion §°41 
K 
Khintchin’s Theorem 6:104 
Koopman's Form 15°19 
Kurtosis 3°35 
<2 L 


Laplace double exponential distribution 


8°89 
Large sample tests 12°10 
Least square principle 9°1 
Leptokurtic 3°35 
Level of significance 12-7 
Liapounoff's theorem 8-109 
Likelihood ratio test 16°34 
Linear transformation 13-16 
Lindeberg-Levy theorem 8-107 
Lines of regression 10-49 
Log-normal distribution 8-65 

M 

Markoff’s theorem 6°104 
Mann Whitney U-test 14°62 
Marginal distribution function 5°43 


—probability function 
Mathematical expectation 6 

—of continuous r.v. 6 
Mean deviation 3 
Mean square deviation 3° 
Measures of 

+~Central tendency 2 

—Correlation A 0 
—Correlation ratio 


3 
—Dispersion 3°1 
—Kurtosis 3°35 
—Skewness 3°32 
Median 2°13 
—Derivation 2°19 
—Demerits 2°16 
—Merits 2°16 
Median test 16°64 
Mesokurtic curve 3°35 
Methods of estimation 15°52 
—Least square 15-73 
—Maximum Likelihood 15-52 
-—Minimum variance 15°69 
—Moments 15-69 
Mode 2:17 
—Derivation 2°19 
—Demenits 2:22 
—Mernits 2:22 
Moments 3°21 
—Absolute 3°25 
—Factorial 3°24 
—Sheppard correction 3°23 
Moment generating function 6-67 
—of binomial distribution 7°14 
—of bivariate normal 
distribution 10-86 
—of negative binomial 
distribution 7:74 
—of chi-sqare distribution 13°§ 
—of exponential distribution 8°86 
—of gamnia distribution 8:68 
—of geometric distribution 7-85 
—of multinomial distribution 7°96 
—of non-central chi-square 
distribution 13°76 
—of normal distribution 8°23 
—of Poisson distribution 7°47 
Moments of bivariate probability 
distribution 6°54 
Most efficient estimator 15-8 
Multinomial distribution 7°95 
Multiple correlation 
—coeflficient 10°111 
Multiple regression 10-105 
Multiplication law of probability 4:35 
" —expectation 6-6 
Multivariate Characteristic 
Function 6°84 
Mutually exclusive events 4-2 
MVB estimator 1§:24, 15:2 
N 
Negative binomial distribution 7°72 
—Cumulants 7°74 
—Moment Generating Function 7:74 
—Probability Generating 
Function 7°76 
Neyman and Pearson Lemma 16°7 
Non-parametric methods 16-59 


4 


Normal Distribution 8.17 
—Characteristics 8-20 
—Cumulant Generating Function 3-24 
—Importance 8-31 
—Mean Deviation 8. 28 
—Median 8-23 
—Mode 8.22 
—Moments 8-24 
—Moment Generating Function 8-23 

Normal Curve 8:29 
—equations 9-2 

Null hypothesis 12-6 

O 

Ogive 2-27 

Operating charecteristic(O.C.) curve 16-21 
O - C. function 16-71 

Order statistics 8. 136 
—Distribution function of X(p)_ 8 - 136 
—Joint p.d.f. of X(t), X(s) 8-138 
—p.d.f. of X/4) 8. 137 
—p.d.f. of X(n) * §8.-137 
—p.d.f. of Range 8-140 

Pp 

Paired t-test 

Pairwise independent events 4 -39 

Parameter 12-3 

Parameter space 15-1 

Pareto distribution 16-76 

Partial correlation. 10-103 
—Coefficient 10-114 
—regression coefficient 10-105 

Partition values 2-26 
—Graphical location 2:27 

Pearson B and coefficients 3-24 

Pearson's distribution 8-120 

Percentiles 2-26 

Platykurtic 3-35 

Poisson distribution 7-40 
— Additive Property 7-47 
—Characteristic function 7-47 
—Cumulants 7-47 
—lMode 7-44 
—Moments 7-47 


—-Moment Generating Function 7-47 
—Recurrence relation for eee 


- 46 

Poisson Process 7-42 
Power of test 16-5 
Power series distribution 7-101 
Probability 4-1 
—Addition law 4-20 
—Axiomatic 4-17 
—Bayes’ Theorem 4-69 
—Definitions of Terms 4-2 
—Empirical 4.4 
—Function .- 5-6 
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—Geometric 4-4 
—History 44. 
—Multiplicative Law 4-3 
Probability density function 5- i 
Probability distribution 5 | 
—conditional 4.3 
Probability generating function 6- 12 
—mass function 5-4 
Probable error 10-3 
Purposive sampling 12.4 
Q 
Quartile 2-26,5-14% 
—deviation 
R ‘ 
Random experiments 4:1 
—sampling 12) 
Random variables 5-| 
—discrete 5-4 
Range 3. 
Rank Correlation 10-3 
Rao-Cramer inequality 15-2 
Rao-Blackwell theorem 15-% 
Rectangular distribution 8-| 
Regression coefficients 10-8 
Regression (linear and nonlinear) 10-51 
—curve 10-@ 
—plane 10-16 
Relation between t and F 14-4 
F and x° 14-6 
Reproductive property 
(see additive property) 
Residual variance (regression) 10-16 
Root mean square deviation 3.2 
Run 16-6 
Run Test (Wald-Wolfowitz) 16 -6l 
Ss 
Sample correlation coefficient 14.3 
Sample partial correlation coefficient 14.2 
Sample multiple correlation 
coefficient 14.9 
Sample space 4 . 18 
Sample standard deviation 
—variance and covariance 12.2 
Sampling 12. 
—attributes 12 . il 
—variables 12 2 
Sampling distribution 12-9 
Scatter diagram 10-1 
Semi-interquartile range 3-1 
Sequential analysis 16 - 69 
—probability ratio test 16 -69 
—A.S.N. function . 16-71 
—O.C. Function 16-71 


INDEX 


Set é 
—complement 
disjoint 
—emply 
—equal 

Sign test 

Simple sampling 

Simple hypothesis 

Skewness 

Spearman's Rank Correlation 
Coefficient 
— Tied ranks 

Standard deviation 
—error 

Statistic 

Statistics, meaning of 

Statistical probability 

Stochastic Indeperidence 

Stratified sampling 

Student's t-distribution 

Sufficient statistics 
—Complete 
—Factorisation theorem 


T 
Tehebycheff inequality 


(see Chebychev's inequality) 


Testing of hypothesis 
—composite hypouiesis 
—critical region 
—distribution free methods 
—level of significance 
—most powerful test (MPT) 


—null hypothesis 
—Neyman-Pearson Lemma 
—non-parametric tests 
—sequential test 

(see sequential analysis) 
—two kinds of errors 


—uniformly most powerful test 
UMPT) 


Test for randomness 
—of significance 

Tipett random numbers 

tdistribution (Student's) 
Fisher 


non-central 14°43 
Transformation of 
—one-dimensional r.v. §:70 
—two-dimensional r.v. '§-73 
Triangular distribution _ 8-10 
Truncatéd distributions 8-151 
—Binomial Examples 8°54 
—Ca 8-55 
—Gamma 8-156 
—Normal Examples 8-153 
—Poisson 8-154, 8-155 
Type of Sampling ‘ 
U 
U-Test 16°66 
UMPT 16°7 
Unbiasedness of estimator 15-2 
Uniform distribution (Discrete) 8-1 
—continuous 
Uniqueness theorem of moment 
generating functions B:72 
Vv 
Variance 8°3 
Variance of residual 10-11 
Ww 
Wald-Wolfowtz Test 16°61 
Wald's SPRT ; 16°69 
Weak law of large numbers 6-101 
Weighted.mean 2°11 
Y 
Yates’ continuity correction 13°57 
Yule's coefficiency of association 11°17 
—colligation 11:17 
Yule'’s notation (multiple and 
partial correlation) 10-104 
Z 
Zero-one law 6°117 


